- Tech Momentum
- Posts
- Amazon Nova Sonic & Wiz of OZ
Amazon Nova Sonic & Wiz of OZ
Google reboots The Wizard of Oz with AI, while Amazon launches Nova Sonic for real-time voice. Visual search gets a big AI boost.

Welcome to Tech Momentum!
Hey there! What if classic cinema met cutting-edge AI? Google just turned The Wizard of Oz into an immersive tech spectacle, Amazon gave AI a voice, and search is no longer just about typingāwelcome to the new era of human-machine magic. Donāt blink, the future is talking back.
Letās break it all down!
Updates and Insights for Today
Google Enhances AI Mode with Multimodal Search Capabilities
Amazon Unveils Nova Sonic
Google Uses AI to Reimagine āThe Wizard of Ozā for Las Vegas Sphere
The latest in AI tech
AI Tutorials: Crafting an AI-Powered Startup Pitch Generator with Gemini Pro
AI tools to checkout
AI finding/resources
AI News
Google Enhances AI Mode with Multimodal Search Capabilities

Quick Summary:
Google has updated its AI Mode in Search to include multimodal capabilities, allowing users to interact using images alongside text. This integration combines Google Lens with the Gemini AI model, enabling comprehensive responses to image-based queries.
Key Insights:
AI Mode now interprets and responds to image inputs, enhancing search versatility.
The integration of Google Lens and Gemini AI facilitates a deeper understanding of visual content.
Users can receive detailed, context-aware information by uploading or capturing images.
This feature is accessible via the Google app on both Android and iOS platforms.
Why Itās Relevant:
This advancement signifies a shift towards more intuitive search experiences, accommodating various input forms beyond text. By enabling image-based queries, Google enhances user engagement and accessibility, reflecting a broader trend of integrating AI to interpret and process diverse data types effectively.
š Read More: Google
Our Partner Today
Find out why 1M+ professionals read Superhuman AI daily.
In 2 years you will be working for AI
Or an AI will be working for you
Here's how you can future-proof yourself:
Join the Superhuman AI newsletter ā read by 1M+ people at top companies
Master AI tools, tutorials, and news in just 3 minutes a day
Become 10X more productive using AI
Join 1,000,000+ pros at companies like Google, Meta, and Amazon that are using AI to get ahead.
Amazon Unveils Nova Sonic for Real-Time Conversational AI

Quick Summary: Amazon has introduced Nova Sonic, a speech-to-speech foundation model designed to enable real-time, human-like voice interactions in AI applications. This model integrates speech recognition and generation into a single architecture, enhancing conversational AI experiences.
Key Insights:
Nova Sonic unifies speech understanding and generation, streamlining the development of voice-enabled applications.
The model supports function calling and knowledge grounding using Retrieval-Augmented Generation (RAG).
Developers can access Nova Sonic through Amazon Bedrockās new bidirectional streaming API for real-time interactions.
Initially, it offers support for American and British English, with plans to include additional languages.
Why Itās Relevant:
Nova Sonic represents a significant advancement in conversational AI, offering developers a streamlined approach to creating natural and responsive voice interfaces. Its integration into Amazon Bedrock simplifies deployment, making sophisticated voice interactions more accessible across various applications.
š Read More: Amazon
Google Uses AI to Reimagine āThe Wizard of Ozā for Las Vegas Sphere

Quick Summary:
Google, in collaboration with Sphere Entertainment, has reimagined the classic 1939 film āThe Wizard of Ozā for the Las Vegas Sphere, utilizing advanced AI techniques to enhance and expand the original footage for an immersive viewing experience.
Key Insights:
Google DeepMind and Google Cloud employed AI methods like āperformance generationā and āoutpaintingā to upscale and extend scenes beyond their original frames.
Over 90% of the film has been AI-modified using generative models such as Veo 2 and Imagen 3, creating wider shots and complete character renderings not visible in the original.
The project involved collaboration with professional filmmakers, including Oscar-nominated producer Jane Rosenthal, to ensure authenticity and quality.
The enhanced film is set to debut at the Sphere on August 28, 2025, offering audiences a novel, immersive interpretation of the beloved classic.
Why Itās Relevant:
This initiative showcases the transformative potential of AI in the entertainment industry, enabling the revitalization of classic films for modern, immersive experiences. It exemplifies how technology can bridge the past and present, offering audiences innovative ways to engage with timeless stories.
š Read More: Google
AI Tutorials
Crafting an AI-Powered Startup Pitch Generator with Gemini Pro

Source: marktechpost.com
Quick Summary:
This tutorial guides you through creating an AI application that generates startup pitch ideas using Googleās Gemini Pro model, integrated with LiteLLM, Gradio, and FPDF in Google Colab.
Key Insights:
LiteLLM Integration: Facilitates seamless interaction with the Gemini Pro model.
Gradio Interface: Provides a user-friendly platform for input and output operations.
FPDF Utilization: Enables exporting generated pitches as PDF documents.
Google Colab Environment: Offers an accessible, cloud-based coding platform.
What Can I Learn?
How to integrate the Gemini Pro model using LiteLLM.
Setting up a Gradio interface for interactive user inputs and outputs.
Utilizing FPDF to export generated content as PDFs.
Implementing the entire application within Google Colab.
Which Benefits Do I Get?
Acquire skills to develop AI-powered applications for business purposes.
Learn to create user-friendly interfaces for AI models.
Understand the process of exporting AI-generated content into professional formats.
Gain experience working within cloud-based coding environments like Google Colab.
Why Itās Relevant:
For entrepreneurs and developers, this guide offers a practical approach to harnessing AI for crafting compelling startup pitches. By combining powerful tools like Gemini Pro and Gradio, users can streamline the pitch creation process, enhancing efficiency and presentation quality.
Read More About The AI Tutorial š Click Here
The latest in AI tech
AI in Mental Health Therapy
AI bots can now offer therapy as effectively as human clinicians, according to a new study. They help treat anxiety and depression with significant results.
š Read More: https://www.npr.org/sections/shots-health-news/2025/04/07/nx-s1-5351312/artificial-intelligence-mental-health-therapy
Shopify Makes AI Mandatory
Shopify CEO Tobias Lütke demands all employees adopt AI as a baseline skill. Itās a strategic push to keep the company AI-competitive.
š Read More: https://www.pymnts.com/artificial-intelligence-2/2025/shopify-ceo-tobias-lutke-employees-must-learn-to-use-ai-effectively/
Nurture Beats Nature for Robot Hands
USC researchers found that training sequences (not sensors) are key for robotic hand dexterity. Itās all about the right learning path.
š Read More: https://viterbischool.usc.edu/news/2025/04/nurture-more-important-than-nature-for-robotic-hands/
Deepfake Law in New Jersey
New Jersey criminalizes harmful deepfakes with up to 5 years in prison. Victims can now also sue, setting a new standard for AI regulation.
š Read More: https://apnews.com/article/new-jersey-deepfake-videos-criminal-civil-penalties-276ca23b00b10a7ee7e7303ead8b4260
AI Tools to check out
3D AI Studio: Type a prompt or upload an image & get 3D models instantly.
AI Agents: Empower your team with on-demand AI teammates
Arcwise: Your AI Data Analyst, built into your current spreadsheet
OssaAI: Chat to Video for Social Media
Thanks for sticking with us to the end!
We'd love to hear your thoughts on today's email!
Your feedback helps us improve our content
āāāSuperb
āāNot bad
ā Could've been better
Not subscribed yet? Sign up here and send it to a colleague or friend!
See you in our next edition!
Tom