- Superhuman AI
- Posts
- đźď¸ Grok-2 gets an image generator
đźď¸ Grok-2 gets an image generator
ALSO: How to distinguish between different chatbots
Read time: under 4 minutes
Welcome back, Superhuman
Weâve all gotten used to typing in prompts, but soon, interacting with AI could feel more like having a conversation with a friend. In todayâs newsletter, weâll explore why that reality might be coming sooner than you might think.
Todayâs Insights
xAI unveils Grok-2
Tutorial: Comparing different chatbots
Gemini Live early impressions
Everything else you should know today
5 new AI tools to boost your productivity
AI-Generated Images: Vincent Van Dogh
NEXT IN AI
xAI releases its new Grok-2 model on X
Source: CoinGape
If most AI models resemble a friendly colleague, xAIâs latest model might be more âlike [your] cool (largely uncensored) uncle,â as one X user put it. A beta version of xAIâs Grok-2 is now available for X Premium subscribers â and for better or worse, early users are having a field day.
Whatâs it capable of?
The Elon Musk-led startup says its new frontier model is âmore intuitive, steerable, and versatileâ
It now sits in third place on the Lmsys Chatbot Arena, behind only Gemini 1.5-Pro and the latest version of ChatGPT-4o
Itâll be able to incorporate real-time information from X, and also carries new vision capabilities
The base model, as well as the slimmed-down Grok-2 mini will be made available to developers later this month
The thing that has users most excited: A new image-generator powered by none other than Black Forest Labsâ state-of-the-art Flux model, which is known for its realism and high fidelity. But early experiments suggest xAIâs version lacks the safeguards that limit what you can create on most text-to-image generators.
The evidence: X timelines are already filling up with bizarre creations, featuring pregnant celebrities, gun-toting presidents, and copyrighted TV characters. The fear is that these silly images could quickly veer into darker territory, with few moderators available to filter them out. For now, there are also no watermarks to help users differentiate between whatâs real and whatâs not.
Taking a step back: It remains unclear whether xAI will eventually make Grok-2 open-source, as it did with Grok-1. If so, itâd potentially be the most powerful LLM fully accessible to developers. In the meantime, the startup is building what may be the âworldâs largest supercomputerâ in Memphis, which will be fully functional in 2025.
PRESENTED BY PLAYPLAY
How to create high-quality videos with AI in 5 minutes
Create high-quality videos in minutes with AI â boost engagement with less time and money spent. No editing skills required
Sign up for a 14-day trial of PlayPlay (no cc required)
Pick from 300+ templates
Upload photos/videos, or choose from Getty stock content
Add logos, brand colors, and key text to communicate your message
Choose from 120+ languages and add AI voiceovers
Click âCreateâ and share it everywhere!
PlayPlay handles all the animations, transitions, and formatting instantly.
(Need a video even faster? PlayPlayâs new AI Video Assistant creates professional videos from one single sentence.)
THE AI ACADEMY
How to compare different AI models and chatbots
Learn how to compare different AI models and chatbots with Poe here.
AI & VOICEBOTS
Can Gemini Live compete with OpenAIâs Voice Mode?
Source: PCMag
The first one out of the gate isnât always the winner. Thatâs the argument the worldâs leading search giant is making with Gemini Live, a new voicebot that rivals OpenAIâs voice mode. OpenAI showed off its eerily life-like voice functionality in May, but itâs underdog Gemini Live that will be the first to see a wide release.
Overlapping timelines: OpenAI Plus subscribers are starting to get antsy. While some got access to Voice Mode in late July, others are still waiting. Alphabetâs equivalent, meanwhile, is already rolling out to Gemini Advanced users with Android phones â while iOS functionality is coming within weeks.
But how does it perform?
Most early users are impressed, with one Wall Street Journal columnist admitting she âalmost forgot it was a botâ
The consensus is that Gemini Live is a skilled conversationalist who can engage in all kinds of open-ended discussions, including brainstorming and interview prep
Although it doesnât yet have the ability to interact with the real world â say, setting an alarm on your behalf â it does reportedly sound human-like thanks to minimal lag and the ability to simply cut it off each time it veers in a direction youâre not interested in
Plot twist: The latest voice bots are apparently so convincing that some people canât help but see them as companions. For its part, OpenAI said in its latest safety report that it appeared some people had begun to form emotional bonds with its voicebot, Ă la the 2013 movie âHer.â With more voice assistants flooding the market, we're entering uncharted territory that could fundamentally alter our social dynamics.
PRESENTED BY GUIDDE
Reduce training time with AI How-To videos
Onboarding multiple new hires?
With Guidde, you can turn training documents into step-by-step videos instantly. Just record an SOP/upload a PDF, edit your AI-generated video, and share. You can even add logos and choose from 35 languages and 100 voices.
Try Guidde today at no cost
AI & TECH NEWS
Everything else you need to know today
Source: Getty Images
Long-Term Memory: Claude now lets developers cache their prompts â meaning theyâll be able to write one elaborate prompt and easily refer back to it again in the future, reducing costs by up to 90%.
Green Light: A judge has ruled that a group of artists can move forward with their copyright case against Stable Diffusion, Midjourney, and other text-to-image generators.
AI Guardian: Sahara AI, a startup co-founded by a USC professor, has raised $43M to help companies like Microsoft and Amazon navigate safety issues while training their AI models.
War Chest: Radical Ventures has raised nearly $800 million for a fund that will invest in new AI startups.
đ One Fun Thing: Milliseconds can make all the difference during NASCAR races. Now, Lenovo is helping Richard Childress Racing use AI to make its pit stops more efficient. The model has been fine-tuned to know exactly how much fuel a car is expected to burn through, helping pit crews time refueling stops with much more precision.
đ§ Brain Food: Researchers at MIT have compiled what may be the worldâs most comprehensive AI risk repository. With more than 700 listed risks, AI companies can reference the database while building safety features for their models.
PRODUCTIVITY
5 AI Tools to Supercharge Your Productivity
â Venturekit: Generate a winning business plan that includes market research, operational tasks, and financial projections.
â Minimap: A cartography tool that uses AI to spatially arrange news topics, revealing trends and the breadth of coverage at a glance.
â Spinach AI*: The worldâs first AI Project Manager. Joins your Zoom, Meet, Teams & captures tasks in Jira, Asana, Monday, Trello, ClickUp, and Linear. Try it here.
â Vola Mail: Write email templates with the help of AI, and send them with an API call.
â Tusk: Save time and effort by assigning smaller tickets to an AI agent.
PS: Want more? Check out our Top 100 AI Tools.
* indicates a promoted tool, if any
PROMPT OF THE DAY
Sleep Better
Prompt:
List the common sleep disorders that may affect your quality of sleep, such as insomnia, sleep apnea, restless leg syndrome, and narcolepsy, along with their symptoms and potential treatments.
Follow-up prompt:
Compose a personalized sleep routine tailored to your specific needs and schedule, considering factors such as your ideal bedtime, wake-up time, and duration of sleep.
You can adapt the prompt to your specific needs.
Source: Scaz
AI-GENERATED IMAGES
Barky Night
Source: Inspired by @tuanbk20790 on Midjourney
Midjourney Prompt: A cartoon [insert dog breed here] playing the drums, with swirling stars and vibrant colors in the style of Van Gogh's Starry Night. The background is a detailed landscape of rolling hills under a starlit night sky.
--ar 105:128 --v 6.1
Acquire new customers and drive revenue by partnering with us
Superhuman is the worldâs biggest AI newsletter for businesses and professionals with 600,000+ readers working at the worldâs leading startups and enterprises. Companies like Amazon, Hubspot, and Salesforce feature their products in Superhuman. You can learn more about partnering with us here.