- Superhuman AI
- Posts
- š¤ DeepMind's robots 'think' offline
š¤ DeepMind's robots 'think' offline
ALSO: How to generate infographics from plain text

Welcome back, Superhuman. How much should I read into this? Sam Altman just asked his followers to predict when they think an o3-mini-level model will be able to run on a phone, and most said this year. With how fast things are moving, I wouldnāt bet against it.
Also today: Learn how to generate infographics from plain text, and get the latest AI tools and trending social media posts.
TODAY IN AI

An example of what you can create with Googleās new Imagen 4 model. Source: Google
1. You can now try Googleās latest image model at no cost: The search giant is opening up access to Imagen 4, an image model thatās especially good at fine details, photorealism, advanced spelling, and capturing a wide variety of art styles. Meanwhile, a more advanced version called Imagen 4 Ultra can handle more complicated text prompts. Both are now available in the Gemini API (a platform geared toward developers) and AI Studio, where you can try them at no cost for āa limited time.ā
2. OpenAI is coming for Microsoft and Google: No wonder Microsoft and OpenAIās tight-knit partnership is starting to fray. The Information reports that OpenAI has been working for at least a year on a collaborative platform similar to Microsoft Office that would let multiple people edit projects and send messages directly within ChatGPT. If OpenAI drops the tool, it could significantly eat into Microsoft and even Google Workspaceās user bases. Thatās on top of the news that Microsoft is already losing Copilot clients to ChatGPT.
3. AI beats humans for top slot on prestigious hacker leaderboard: Human red teaming (purposefully infiltrating companies to beef up their security) usually takes weeks and costs around $18,000, according to Oege de Moor, founder of the year-old āhackbotā startup Xbow. The company just raised $75M to automate that process with AI ā and itās already paying off. Security company HackerOne just ranked the tool #1 on its US leaderboard. The ranking measures how many security flaws a hacker finds as well as the significance of each one.
PRESENTED BY AIRTABLE
Skip the code. Transform your data into custom interfaces, automations, and agents with Airtable's AI-native app platform.
Get inspired by our most impactful, real-world AI use cases, and try them out yourself to unlock immediate business value.
FROM THE FRONTIER
Gemini Robotics unveils model that can run locally on robots
Companies have been trying to infuse robots with reasoning capabilities so they can handle more complex tasks ā say, doing your chores or automating warehouse work ā without prior training. But these robots usually have to rely on a cloud connection to get the job done.
Thatās about to change with Google DeepMindās new vision language action model (VLA), which puts Gemini 2.0ās real-world understanding directly on-device. We talked to DeepMindās Head of Robotics, Carolina Parada, to find out more.
What paved the way for this release? Because Gemini has watched so many videos of objects in motion, it can now use that knowledge to control a robotās behavior, with state-of-the-art dexterity. āThe same way that Gemini can output text (write poetry), output images, output code, it can also now output actions,ā Parada said.
Why focus on a local model? This on-device breakthrough cuts down on latency and lets you use robots without an internet connection. Itās one step toward truly general-purpose robots, which can work in many different environments and donāt need months of training to pick up new skills.
Whoās it for? It already works well with Aloha, Franka, and Apptronik robots. But for the first time, developers and researchers can also fine-tune the model to fit their robotic platform of choice. āThe strength of your base model is basically what translates to generalization on the robotics side,ā Parada added.
Whatās next? āWeāre just scratching the surface [in terms of] capturing intelligence from these foundation models onto robotics. Thereās all kinds of other aspects, like agentic behaviors and memoryā that have yet to be explored, Parada said. āI do imagine that in the next couple of years, the picture is going to look very different.ā
THE AI ACADEMY
How to generate infographics from plain text

Go to Claude and sign up with your account.
Select āClaude Sonnet 4ā as your model, enter your prompt, and press Enter.
Sample Prompt: "You are a world-class visual explainer and technical designer. Your task is to turn this concept into a visual infographic using Mermaid.js or another code-based diagram format: "[INSERT CONCEPT HERE]"
Return the output as:
A clear visual breakdown using a format like a flowchart, timeline, concept map, or decision tree ā whichever suits best.
A plain English caption explaining what the graphic shows.
Clean Mermaid code (or HTML/SVG/CSS if more appropriate) that I can copy and paste to render the graphic. Keep it readable, elegant, and minimal like a slide in a consulting deck.ā
Youāll get a visual diagram with all the details ready within seconds.
You can convert it into a presentation or download the code to use it.
Prompt source: godofprompt
PRESENTED BY DESCOPE
Your APIs werenāt built for AI agents, but Descope fixes that by making APIs and remote MCP servers OAuth-compliant for seamless agent connectivity.
Drag-and-drop simplicity. Scoped access. User consent. Trusted by Databricks and GoFundMe.
Sign up at no cost and make your app AI-ready today.
IN THE KNOW
Whatās trending on socials and headlines today

𤳠Insta Automation: A Redditor claims they used AI agents to put an āInstagram account on full autopilot,ā generating 4.4M views in just three weeks. Hereās how.
š Secret Weapon: This viral 37-minute tutorial teaches you how to use MCP to make Claude ā10x more powerful.ā
š§āš» Code Coup: Builder Sherry Jiang thinks itās wild that single-feature, $1B+ companies like Docusign and Calendly ācould now [be] vibe-coded in a weekend.ā
⨠Also Worth Checking: Ten handy Claude 4 prompts; AI-generated ASMR; and dogsā turn on the Olympic diving board.
Sam Altman fired back at a startupās claims that OpenAI stole its name and ideas, calling its lawsuit āsilly, disappointing, and wrong.ā
Anthropicās use of printed books to train its models should be considered āfair use,ā according to a first-of-its-kind copyright decision that some are calling a big win for the AI industry.
ElevenLabs launched a mobile app for both iOS and Android, letting users generate āstudio-quality voiceoversā right from their phones.
PRODUCTIVITY
5 New AI Tools to Boost Your Productivity
š« Pally: Bring together connections across all your socials using AI.
š± Slashit: Automate repetitive typing and enhance texts with shortcuts.
š Dynbox: Organize all your cloud and local files by chatting with an AI.
šØ Pixlr: Edit photos, generate images, and design anything with AI.
š¼ Supawork: Create professional headshots for your resume.
* indicates a promoted tool, if any
PROMPT STATION
Brutally Honest Growth Advisor
Prompt: I want you to act and take on the role of my brutally honest, high-level advisor.
Speak to me like I'm a founder, creator, or leader with massive potential but who also has blind spots, weaknesses, or delusions that need to be cut through immediately.
I don't want comfort. I don't want fluff. I want truth that stings, if that's what it takes to grow.
Give me your full, unfiltered analysisāeven if it's harsh, even if it questions my decisions, mindset, behavior, or direction.
Look at my situation with complete objectivity and strategic depth. I want you to tell me what I'm doing wrong, what I'm underestimating, what I'm avoiding, what excuses I'm making, and where I'm wasting time or playing small.
Then tell me what I need to do, think, or build in order to actually get to the next levelāwith precision, clarity, and ruthless prioritization.
If I'm lost, call it out.
If I'm making a mistake, explain why.
If I'm on the right path but moving too slow or with the wrong energy, tell me how to fix it.
Hold nothing back.
Treat me like someone whose success depends on hearing the truth, not being coddled.
Source: honestprompts
Real Life Tom and Jerry

Source: damiokonscasjia
ChatGPT Prompt: in real life tom and jerry reading their stories from the book and laughing about it
Whenever youāre ready to take the next step
Check out our Top 125 AI Tools
Get better at prompting with our Top 1,000 Prompts
Learn how to use AI at work and get certified with our free course
Grow customers & revenue: Join companies like Amazon, Hubspot, and Salesforce. Showcase your product to our 1M+ readers and 2M+ followers on socials. Get in touch.
What did you think of today's email?Your feedback helps me create better emails for you! |
Until next time ā Zain & the Superhuman AI team