- Superhuman AI
- Posts
- Cerebras unveils faster way to deploy AI
Cerebras unveils faster way to deploy AI
ALSO: Create AI images with Meta AI
Read time: under 4 minutes
Welcome back, Superhuman
All eyes are on Nvidia’s earnings report today, but relative newcomer Cerebras has an announcement that might just beat the world’s second-largest company’s numbers. And: We finally get a sneak peek into OpenAI’s “Strawberry.”
Today’s Insights
Cerebras releases the world’s fastest AI inference system
Tutorial: How to create AI images with Meta AI
From the Frontier: ‘Strawberry’ details revealed
Everything else you should know today
3 new AI tools to boost your productivity
AI-Generated Images: Old Books
NEXT IN AI
Cerebras releases the world’s fastest AI inference system
Source: Cerebras Systems
We’re still in the AI dial-up era, but a startup called Cerebras wants to do to LLMs what high-speed internet did for web browsing. Earlier this year, it showed off the world’s largest AI chip (it’s about the size of a dinner plate). Now, it’s releasing a new system that can run AI products via the cloud — and at unprecedented speeds.
How it works: Cerebras packed its record-setting chips onto a system called CS-3, then used that infrastructure to build some of the world’s largest supercomputers. Its latest release helps companies put those LLMs to use in the real world.
What’s inference anyway?
It’s the process of taking in new information, then running it against a dataset that the model was previously trained on
It can be used to spot patterns in large swathes of data, and it can help models make decisions much faster than other approaches
Inference already makes up about 40% of the AI hardware market, but that figure is steadily ticking up
Why is Cerebras so much faster than its rivals? Traditional GPUs have to interact with external memory each time they crunch a piece of data; but because Cerebras’ chips are so massive, there’s room to fit a ton of memory directly onto them, completely bypassing that step.
The results: Many performance-focused systems have to scale back their accuracy in order to boost speed. But Cerebras says its architecture runs at a native 16-bits, meaning its precision never drops off. When it comes to training Meta’s Llama 3.1, it’s around 20 times faster than comparable Nvidia GPU-based systems — at just one-fifth of the cost.
PRESENTED BY SPINACH AI
AI Agents now join your Zoom and run your meetings
Spinach AI runs daily standups and project meetings for thousands of companies.
Focused meetings - runs the meetings, keeps track of time
Accurate summaries - saved in Google Docs, Notion or Confluence
Ask questions - “what are the open action items from last week?”
Speaks 100 languages
Spinach offers a 14-day trial and takes 30 seconds to set up**.
THE AI ACADEMY
Create AI images on WhatsApp with Meta AI
Open WhatsApp and click on the Meta logo at the top of the screen.
It will open a new chat window for you.
Explain what you want to generate and watch the magic happen.
It will generate images for you in real-time.
You can share your creations with your friends and family on WhatsApp and enjoy.
Prompt used: Imagine a cute golden retriever in front view in a park with his 40-year-old owner, a lady, and children playing with him. It's golden hour, and beautiful sun rays are striking from the background.
FROM THE FRONTIER
Is it Strawberry Season?
Details about OpenAI’s “Strawberry” – a rumored model that could take AI reasoning to the next level – have finally been revealed.
Here they are:
Sources told The Information OpenAI might integrate Strawberry into ChatGPT, instead of releasing it as a standalone model
It’s reportedly so powerful that a team showed it off to American national security officials this summer
It would be able to perform high-level math and logic problems — even those it was never trained on, although it’d take longer to generate results
It could be released as soon as this fall, with a new model code-named Orion coming at a later date
The company is struggling to raise more capital, so this could be just the boost it needs to power through
PROMPT OF THE DAY
Act as an Accountant
Prompt: I want you to act as an accountant and come up with creative ways to manage finances. You’ll need to consider budgeting, investment strategies and risk management when creating a financial plan for your client. In some cases, you may also need to provide advice on taxation laws and regulations in order to help them maximize their profits. My first suggestion request is ‘Create a financial plan for a small business that focuses on cost savings and long-term investments’
Source: gptbot [dot] io
PRESENTED BY GUIDDE
Create video documentation 11x faster with AI
Tired of explaining the same thing over and over again to your colleagues? Guidde is a GPT-powered tool with AI-generated documentation that helps you explain the most complex tasks in seconds.
The best part? Our extension is free. Try it here
AI & TECH NEWS
Everything else you need to know today
Source: Samsung
Answers in a Flash: Google is rolling out three new Gemini variants, including a more powerful Pro model that can tackle complex coding and logic problems.
Unexpected Blessing: Elon Musk has endorsed California’s controversial AI safety bill, arguing governments should monitor LLMs “just as we regulate any product/technology that is a potential risk to the public.”
Appliance Upgrade: Samsung’s touchscreen refrigerators are getting new AI capabilities, while its AI TVs will now receive seven years of updates.
📈 Up & Up: The AI coding platform Cursor raised $60M last week, and already, it’s become the go-to programming app for AI enthusiasts. Users say that when combined with Claude 3.5 Sonnet, the platform can help create entire apps from scratch within minutes. Even former Tesla AI director Andrej Karpathy is a fan, admitting he “can’t imagine going back to ‘unassisted’ coding at this point.”
😎 Claude-voyant: In what’s considered an industry-first bid toward transparency, Anthropic has released the system prompts for its Claude models. This typically top-secret data shows how the model comes to certain decisions — and why it might avoid certain topics altogether.
PRODUCTIVITY
3 AI Tools to Supercharge Your Productivity
✅ Kerlig: An AI-powered writing assistant that can be used in Slack, Figma, Gmail, LinkedIn, and more.
✅ Arold: Use AI to reply to guests within your Airbnb inbox from a single tap.
✅ PackPack: An AI-driven bookmark management tool tailored for saving content from online resources like news and social media.
* indicates a promoted tool, if any
AI-GENERATED IMAGES
Old Books
Source: Reddit user u/MGS023
Prompt: A book cover for 'Lord of the Rings' by J.R.R. Tolkien. At the top, the title 'Lord of the Rings' in mystical, ancient-style letters. At the bottom, the author's name 'J.R.R. Tolkien' in smaller, elegant type. The central image features Mount Doom, depicted in a dramatic, dark landscape with swirling clouds and an ominous, glowing lava flow. This portrayal captures the foreboding and epic nature of the location within the story.
Models: flux1-dev as the model and LORA darkfantasyillustration
Acquire new customers and drive revenue by partnering with us
Superhuman is the world’s biggest AI newsletter for businesses and professionals with 600,000+ readers and 1.5 Million followers on socials working at the world’s leading startups and enterprises. Companies like Amazon, Hubspot, and Salesforce feature their products in Superhuman. You can learn more about partnering with us here.
🧞Your wish is my command
What did you think of today's email?Your feedback helps me create better emails for you! |
Got more feedback or just want to get in touch? Reply to this email and we’ll get back to you.
Thanks for reading.
Until next time!
Zain & the Superhuman AI team