Cerebras unveils faster way to deploy AI

ALSO: Create AI images with Meta AI

Read time: under 4 minutes

Welcome back, Superhuman

All eyes are on Nvidia’s earnings report today, but relative newcomer Cerebras has an announcement that might just beat the world’s second-largest company’s numbers. And: We finally get a sneak peek into OpenAI’s “Strawberry.”

Today’s Insights

  • Cerebras releases the world’s fastest AI inference system

  • Tutorial: How to create AI images with Meta AI

  • From the Frontier: ‘Strawberry’ details revealed

  • Everything else you should know today

  • 3 new AI tools to boost your productivity

  • AI-Generated Images: Old Books

NEXT IN AI

Cerebras releases the world’s fastest AI inference system

Source: Cerebras Systems

We’re still in the AI dial-up era, but a startup called Cerebras wants to do to LLMs what high-speed internet did for web browsing. Earlier this year, it showed off the world’s largest AI chip (it’s about the size of a dinner plate). Now, it’s releasing a new system that can run AI products via the cloud — and at unprecedented speeds.

How it works: Cerebras packed its record-setting chips onto a system called CS-3, then used that infrastructure to build some of the world’s largest supercomputers. Its latest release helps companies put those LLMs to use in the real world. 

What’s inference anyway? 

  • It’s the process of taking in new information, then running it against a dataset that the model was previously trained on

  • It can be used to spot patterns in large swathes of data, and it can help models make decisions much faster than other approaches

  • Inference already makes up about 40% of the AI hardware market, but that figure is steadily ticking up

Why is Cerebras so much faster than its rivals? Traditional GPUs have to interact with external memory each time they crunch a piece of data; but because Cerebras’ chips are so massive, there’s room to fit a ton of memory directly onto them, completely bypassing that step.

The results: Many performance-focused systems have to scale back their accuracy in order to boost speed. But Cerebras says its architecture runs at a native 16-bits, meaning its precision never drops off. When it comes to training Meta’s Llama 3.1, it’s around 20 times faster than comparable Nvidia GPU-based systems — at just one-fifth of the cost.

PRESENTED BY SPINACH AI

AI Agents now join your Zoom and run your meetings

Spinach AI runs daily standups and project meetings for thousands of companies.

  • Focused meetings - runs the meetings, keeps track of time

  • Accurate summaries - saved in Google Docs, Notion or Confluence

  • Ask questions - “what are the open action items from last week?”

  • Speaks 100 languages

Spinach offers a 14-day trial and takes 30 seconds to set up**.

THE AI ACADEMY

Create AI images on WhatsApp with Meta AI

  • Open WhatsApp and click on the Meta logo at the top of the screen.

  • It will open a new chat window for you.

  • Explain what you want to generate and watch the magic happen.

  • It will generate images for you in real-time.

  • You can share your creations with your friends and family on WhatsApp and enjoy.

Prompt used: Imagine a cute golden retriever in front view in a park with his 40-year-old owner, a lady, and children playing with him. It's golden hour, and beautiful sun rays are striking from the background.

FROM THE FRONTIER

Is it Strawberry Season?

Details about OpenAI’s “Strawberry” –  a rumored model that could take AI reasoning to the next level – have finally been revealed.

Here they are:

  • Sources told The Information OpenAI might integrate Strawberry into ChatGPT, instead of releasing it as a standalone model

  • It’s reportedly so powerful that a team showed it off to American national security officials this summer

  • It would be able to perform high-level math and logic problems — even those it was never trained on, although it’d take longer to generate results

  • It could be released as soon as this fall, with a new model code-named Orion coming at a later date

  • The company is struggling to raise more capital, so this could be just the boost it needs to power through

PROMPT OF THE DAY

Act as an Accountant

Prompt: I want you to act as an accountant and come up with creative ways to manage finances. You’ll need to consider budgeting, investment strategies and risk management when creating a financial plan for your client. In some cases, you may also need to provide advice on taxation laws and regulations in order to help them maximize their profits. My first suggestion request is ‘Create a financial plan for a small business that focuses on cost savings and long-term investments’

Source: gptbot [dot] io

PRESENTED BY GUIDDE

Create video documentation 11x faster with AI

Tired of explaining the same thing over and over again to your colleagues? Guidde is a GPT-powered tool with AI-generated documentation that helps you explain the most complex tasks in seconds.

The best part? Our extension is free. Try it here

AI & TECH NEWS

Everything else you need to know today

Source: Samsung

  • Answers in a Flash: Google is rolling out three new Gemini variants, including a more powerful Pro model that can tackle complex coding and logic problems.

  • Unexpected Blessing: Elon Musk has endorsed California’s controversial AI safety bill, arguing governments should monitor LLMs “just as we regulate any product/technology that is a potential risk to the public.”

  • Appliance Upgrade: Samsung’s touchscreen refrigerators are getting new AI capabilities, while its AI TVs will now receive seven years of updates.

📈 Up & Up: The AI coding platform Cursor raised $60M last week, and already, it’s become the go-to programming app for AI enthusiasts. Users say that when combined with Claude 3.5 Sonnet, the platform can help create entire apps from scratch within minutes. Even former Tesla AI director Andrej Karpathy is a fan, admitting he “can’t imagine going back to ‘unassisted’ coding at this point.”

😎 Claude-voyant: In what’s considered an industry-first bid toward transparency, Anthropic has released the system prompts for its Claude models. This typically top-secret data shows how the model comes to certain decisions — and why it might avoid certain topics altogether.

PRODUCTIVITY

3 AI Tools to Supercharge Your Productivity

 Kerlig: An AI-powered writing assistant that can be used in Slack, Figma, Gmail, LinkedIn, and more.

 Arold: Use AI to reply to guests within your Airbnb inbox from a single tap.

 PackPack: An AI-driven bookmark management tool tailored for saving content from online resources like news and social media.

* indicates a promoted tool, if any

AI-GENERATED IMAGES

Old Books

Source: Reddit user u/MGS023

Prompt: A book cover for 'Lord of the Rings' by J.R.R. Tolkien. At the top, the title 'Lord of the Rings' in mystical, ancient-style letters. At the bottom, the author's name 'J.R.R. Tolkien' in smaller, elegant type. The central image features Mount Doom, depicted in a dramatic, dark landscape with swirling clouds and an ominous, glowing lava flow. This portrayal captures the foreboding and epic nature of the location within the story.

Models: flux1-dev as the model and LORA darkfantasyillustration 

Acquire new customers and drive revenue by partnering with us

Superhuman is the world’s biggest AI newsletter for businesses and professionals with 600,000+ readers and 1.5 Million followers on socials working at the world’s leading startups and enterprises. Companies like Amazon, Hubspot, and Salesforce feature their products in Superhuman. You can learn more about partnering with us here.

🧞Your wish is my command

What did you think of today's email?

Your feedback helps me create better emails for you!

Login or Subscribe to participate in polls.

Got more feedback or just want to get in touch? Reply to this email and we’ll get back to you.

Thanks for reading.

Until next time!

Zain & the Superhuman AI team