ChatGPT can now see, hear, and speak

ALSO: Over $4 Billion in new AI funding announcements

Read time: 2.5 minutes

Welcome back, superhuman

Another day, another big announcement from OpenAI. This time they’ve announced major new ChatGPT upgrades that can help the chatbot see your images, hear your voice, and speak to you in a human-like voice.

TODAY’S MENU

  • ChatGPT can now see, hear, and speak

  • 5 new AI tools to boost your productivity

  • Funding Roundup: Over $4 billion in funding announcements

  • Tutorial: Control image composition in Midjourney

  • AI Generated Images: Pets as popular TV shows

TODAY IN AI & TECH

  • Hola Listeners: Spotify will use AI to clone podcaster voices and translate them into other languages.

  • DJ GPT: Spotify also said they won’t be banning AI-generated music.

  • Catch That: The NFL and Amazon are using AI to create new football stats

  • Phew: Sam Altman reportedly says GPT-5 and 6 will remain short of AGI.

  • Stocked and Loaded: Getty Images to release new AI tool to generate stock images.

INSIGHT

OpenAI launches a new chapter of ChatGPT with multimodal features

Source: OpenAI

ChatGPT can now see your images, hear your voice, and answer your questions in a human-like voice. Multimodality — the ability to process images and voice, in addition to text — is finally here.

Here’s what you need to know:

Voice

  • You can now use voice to engage in back-and-forth conversations with ChatGPT and use it as an assistant.

  • You can enable voice from the New Features section in Settings. You can then choose between 5 different voices.

  • You can check out voice samples here.

Images

  • You’ll be able to take or upload images into ChatGPT and ask questions about the image.

  • Questions can range from why your grill isn’t working to what the meaning of a graph is.

  • You’ll be able to focus and ask questions about a specific part of the image using the drawing tool.

  • You will also be able to upload screenshots and documents containing both text and images.

  • You can watch a sample video of the feature here.

Both features will be available in 2 weeks to ChatGPT Plus and Enterprise users. Developers will also be given access in the future.

You can get more details on the new features here.

TOGETHER WITH AE STUDIO

The AI team you didn’t know your company needed — until now

Hire world-class AI experts from Harvard, Stanford and Princeton

Not sure how to implement the right AI strategy for your product? Hire AE Studio's world class team of software builders to craft and implement the optimal AI solution for your business.

Our development, design, and data science teams work closely with founders and executives to create custom software and AI solutions.

From custom-built MVPs to bespoke AI/ML solutions, see how you can leverage AI to achieve your business objectives.

5 AI Tools to Supercharge Your Productivity

FireCut: Speed up your video editing by automating time-consuming tasks so you can focus on the creative stuff.

Datatera: Transform your diverse data formats or websites into structured forms for analysis and save time.

Akooda (sponsored): Discover Akooda's AI Rev-Ops Intelligence Platform. Unlock the potential of the processes, team & resources to make your business win! Request a demo

Vespio: Increase your win rate and revenue by catching hidden customer desires from every conversation.

Zing DataGPT: Query your data using AI. Use natural language to create fully interactive charts and tables.

Careers

👋 Come work with us at Superhuman

Want to level up your career and work at one of the biggest and fastest growing technology newsletters in the world? Check out our openings below:

Interested? Send us your resume at [email protected] and we’ll get back to you before the end of the week!

FUNDING AND ACQUISITIONS

A Roundup of the biggest deals in AI this week

  • Pryon raises $100 million to index and analyze enterprise data. Read more →

  • Amazon steps up AI race with up to $4 billion investment in Anthropic. Read more →

  • Levelpath secures $44.5 million in funding for AI-powered procurement platform. Read more →

  • HiddenLayer raises $50 million to bolster defenses of enterprise AI models. Read more →

  • Corti, an AI 'co-pilot' for healthcare clinicians, raises $60 million. Read more →

  • Secoda raises $14 million to bring AI-driven, Google-like search to enterprise data. Read more →

AI TUTORIAL

How to control image composition in Midjourney

Want to create professional looking shots? Master composition, make each image pop, and let your photos stand out. Try the prompt below:

Spiderman, (image composition) --v 5.2

You can use any of the tokens given below to get your desired image composition:

  • Wide Angle

  • Diagonal Angle

  • Oblique Angle

  • Macro

  • Aerial

  • High Angle

  • Low Angle

  • Close-up

Source: @ciguleva on X

AI-GENERATED IMAGES

Pets as popular TV shows

Source: u/babumoshaiiii on Reddit

📈 Feature your product in the world’s biggest AI newsletter

Superhuman is the world’s biggest and fastest-growing AI newsletter with 400,000+ readers working at companies like Apple, Meta, Amazon, Google, Microsoft, and many more. Companies like Masterworks, Brave, SEMRush, and 1Password have featured their products in Superhuman. You can book ad spots here.  

🧞 Your wish is my command

What did you think of today's email?

Your feedback helps me create better emails for you!

Login or Subscribe to participate in polls.

Reviews of the day

Thanks for reading.

Until next time!

p.s. if you want to sign up for this newsletter or share it with a friend or colleague, you can find us here