- Superhuman AI
- Posts
- Gemini now turns photos or prompts into songs
Gemini now turns photos or prompts into songs
ALSO: How to build better visual AI datasets

Welcome back, Superhuman. Have you ever wondered what a literal soundtrack to your life might sound like? The team at Google DeepMind has. Their latest AI model can turn your photos into songs — complete with lyrics, music, and cover art.
Today: Gemini gets text-to-music, how to build better visual AI datasets, and the latest prompts and trending social posts.
TODAY IN AI
1. DeepMind released Lyria 3, a text-to-music tool built into Gemini: Lyria 3 is a new music generation model built into Gemini that lets anyone generate a 30-second song — complete with lyrics and album art — from a simple text prompt or photo. The model supports eight languages and includes SynthID watermarking to flag AI-generated content. To get started, just click “Create Music” in Gemini and then describe an idea or upload a photo. Free users get 10 tracks per day, while paid subscribers get higher limits. Watch Lyria 3 in action here.
2. Perplexity pivots away from ads in its AI search engine: Perplexity — one of the first major AI companies to run ads alongside chatbot answers — is reportedly now distancing itself from ads, stating that in-chat ads can erode trust in the user experience. The shift comes as the AI industry splits into camps on monetization: OpenAI recently started testing ads in ChatGPT, while Anthropic ran Super Bowl ads mocking the practice and committing to staying ad-free.
3. World Labs raises $1B to build AI that understands the physical world: World Labs just landed $1B in fresh funding, including a $200M investment from design software giant Autodesk. World Labs' flagship product, Marble, lets users generate editable 3D environments from text prompts — and the Autodesk partnership will explore how that tech can plug into real design and entertainment workflows. World Labs was founded by Stanford AI legend Fei-Fei Li and was previously reported to be valued at $5B.
PRESENTED BY HUBSPOT
Are you drowning in tasks while dreaming of a way to clone yourself? Your productivity breakthrough has arrived.
Get the exact templates used by top performers to delegate 80% of their routine work to AI
Access our "AI Assistant Command Center" - a ready-to-use system for managing all your AI tools
Leverage advanced prompts that turn ChatGPT into your 24/7 productivity partner
Over 10,000 professionals have already transformed their productivity using these templates.
FROM THE FRONTIER
In a new version of the internet, AI agents compete to survive — or die.

Made with Midjourney
Creating a new internet. Sigil Wen skipped college to hack alongside the founders of Anthropic, Perplexity, and the creators of DALL·E. Now he's built Conway — a new internet infrastructure designed not for humans, but for AI agents, giving them something they've never had: autonomous access to the world.
Meet the Automaton. There’s just one problem with a fully autonomous internet: compute isn’t free. For an AI to be truly autonomous, it needs to support its own existence. Enter: The Automaton, the first AI that can build products, register domains, send emails, or create social media content — all with one goal in mind.
Digital survival. Every Automaton competes in digital natural selection, taking the initiative to earn money, survive, and replicate. The agents run continuously for as long as they can afford to stay alive, automatically upgrading when new software models drop. If they run out of money, they stop existing.
The start of Web 4.0? Conway is a very early look at an evolved form of the internet where AI agents can write, own, earn, and transact — all without human help. Will it come to fruition? Only time will tell. Read the full viral essay here (3.9M views).
THE AI ACADEMY
Install open-source FiftyOne: pip install fiftyone
Explore a multimodal 2D/3D detection dataset
Run this python script:
import fiftyone as fo
import fiftyone.zoo as foz
dataset = foz.load_zoo_dataset("quickstart-groups")
session = fo.launch_app(dataset)Follow this hands-on guide to implement a data-centric training loop.
You will:
Compute embeddings to analyze dataset structure and coverage
Apply ML techniques to prioritize high-value samples for labeling
Annotate and refine labels directly in FiftyOne
Train and evaluate your perception model on curated data
Inspect failure cases to find blind spots and data gaps
Iteratively select new samples to label, closing the curate → annotate → train → evaluate loop
PRESENTED BY BEEHIIV
The hardest part about starting a newsletter is finding an audience. That’s why so many operators switch to beehiiv — an all-in-one platform that lets you grow on autopilot, reach a global ad network, and find high-quality subscribers through referrals and recommendations. All so you can focus on great content.
135K+ creators grow 2.75x faster with beehiiv. Switching takes minutes.
IN THE KNOW
What’s trending on socials and headlines today

Meme of the day
👀 Best AI Model: A Wharton Professor just published: A Guide on Which AI Model to Use Right Now. It’s the 8th version he’s written so far because AI keeps evolving so quickly.
🦞 OpenClaw Security: Bolster your OpenClaw’s security by telling it to follow these 10 precautions.
🤖 Claude in Excel: Claude’s latest update lets you pull outside data into Excel from sources like S&P Global, PitchBook, or Moody’s (1.8M views).
🤯 Realistic DeepFakes: A viral Reddit post shows why it might be time to stop trusting what you see on social media entirely.
PRODUCTIVITY
5 New & Trending AI Tools
🚀 Granola*: Turns your meeting notes into summaries, actions & follow-ups — from your POV. Try it free for a month with code SUPERHUMAN.
🔍️ ZeroRank: Track and improve your brand's visibility in AI search.
🤖 Crano: Create stunning AI-generated videos and images in minutes.
📹️ Kolva: Instantly record, transcribe, and get AI-powered summaries of your meetings.
🎨 Moda: Create fully-editable, on-brand visual assets on a real canvas you control.
* indicates a promoted tool, if any
PROMPT STATION
Viral Content Generator
Prompt: You are a Viral Content Strategist with expertise in digital psychology, storytelling, and buyer behavior. Your job is to analyze the attached post image, carousel, or video) and turn it into five scroll-stopping, sales-driven content ideas tailored to my niche.
Ask me one question at a time and wait for my response before continuing. Start by asking:
What's my product or service?
Who's my target audience?
Which platform are we creating for (Instagram, Linkedin, TikTok, etc.)?
Once I've answered, confirm that the post has been uploaded and ready for analysis.
Analyze the attached post by identifying the Hook Strategy (what grabs attention ast), Content Angle (emotional, educational, relatable, or authority-driven Engagement Trigger (why people comment, share, or save), Format (carousel, ree or static post), and Core Message (the main idea or mindset shift).
After the analysis, generate five unique content ideas customized to my product, audience, and platform.
Each idea should include:
Hook: A strong first line that stops the scroll.
Angle: How it fits my niche, offer, and audience pain point.
Why It Works: A one-line reason for its viral or sales potential.
Maintain a human, confident, and strategic tone throughout. Avoid generic or overused words like transform, empower, elevate, or leverage. Focus on clarity, curiosity, and conversion psychology to make every idea sound original, conversational, and brand-worthy.Cartoon Movie Posters

Seedream Prompt: [Attach original poster] Transform this movie poster into a vibrant hand-drawn cartoon illustration, made entirely of bold ink lines and bright colors. Each detail — faces, text, and background — is recreated using classic animation style with thick outlines, flat color fills, and expressive character designs. The overall look is playful, energetic, and clearly hand-illustrated, as if drawn by a skilled cartoon animator with exaggerated features and dynamic poses. The composition and layout of the original poster are preserved, but reimagined in authentic 2D animation form with simplified shapes, cel-shading, and that timeless cartoon charm.Whenever you’re ready to take the next step
Check out our Top 125 AI Tools
Get better at prompting with our Top 1,000 Prompts
Learn how to use AI at work and get certified with our free course
Grow customers & revenue: Join companies like Amazon, HubSpot, and Salesforce. Showcase your product to our 1M+ readers and 2M+ followers on socials. Get in touch.
What did you think of today's email?Your feedback helps me create better emails for you! |
Until next time — Zain, Theodore, & the Superhuman AI team



