Google's Gemini AI can now analyse your audio files: All you need to know

Google's Gemini app now lets Android, iOS, and web users upload audio files for AI to do a comprehensive analysis, opening new use cases like transcribing interviews, lectures, and voice memos

Gemini
Gemini
Aashish Kumar Shrivastava New Delhi
2 min read Last Updated : Sep 09 2025 | 12:56 PM IST
Gemini can now analyse your audio files. Google has enabled the option across platforms, including Android, iOS and web, for users to upload audio files for analysis on the Gemini app. The vice president of Google Labs and Gemini, Josh Woodward, announced this development on X (formerly Twitter). As per a Google support page, free-tier users can upload up to 10 minutes of audio and have five daily prompts.
 
This opens Google Gemini AI to new use cases – think transcribing interviews, analysing voice memos, or converting lecture audio into searchable AI dialogue. It lifts the app beyond typed or spoken prompts, offering multimodal input.
 
Along with this, Google has added support for five languages, including Hindi, to Search’s AI mode. Furthermore, as per Woodward’s reply to a comment on this post revealed that now Google is working to build an Auto mode in Gemini so that users won’t have to manually switch Gemini Flash and Pro mode. 

Limit for uploading audio files in Gemini

Free-tier users can upload up to 10 minutes of audio and have five daily prompts. Those subscribed to Gemini AI Pro or AI Ultra can use uploads up to three hours, across up to 10 files, even packaged within a single ZIP archive.

Expanding multilingual reach in Google Search

 
Also on Monday, Google enhanced its Search AI Mode by adding support for five new languages: Hindi, Indonesian, Japanese, Korean, and Brazilian Portuguese. Powered by Gemini 2.5 technology, this expansion allows more users globally to ask complex questions in their native tongues while using Search’s AI-driven tools to explore content. 

NotebookLM gets smarter

Furthermore, Google’s NotebookLM – a Gemini-powered research assistant – has gained the ability to generate structured reports such as blog posts, study guides, quizzes, and flashcards, all from uploaded documents or media. These are now available in over 80 languages. Users can select the desired style and tweak tone or format, with full rollout expected by end of week, according to a comment shared on X.
 
While audio upload is entirely new to the Gemini app, NotebookLM had already offered audio input, maintaining its positioning as a deep-dive research assistant rather than a conversational interface.
*Subscribe to Business Standard digital and get complimentary access to The New York Times

Smart Quarterly

₹900

3 Months

₹300/Month

SAVE 25%

Smart Essential

₹2,700

1 Year

₹225/Month

SAVE 46%
*Complimentary New York Times access for the 2nd year will be given after 12 months

Super Saver

₹3,900

2 Years

₹162/Month

Subscribe

Renews automatically, cancel anytime

Here’s what’s included in our digital subscription plans

Exclusive premium stories online

  • Over 30 premium stories daily, handpicked by our editors

Complimentary Access to The New York Times

  • News, Games, Cooking, Audio, Wirecutter & The Athletic

Business Standard Epaper

  • Digital replica of our daily newspaper — with options to read, save, and share

Curated Newsletters

  • Insights on markets, finance, politics, tech, and more delivered to your inbox

Market Analysis & Investment Insights

  • In-depth market analysis & insights with access to The Smart Investor

Archives

  • Repository of articles and publications dating back to 1997

Ad-free Reading

  • Uninterrupted reading experience with no advertisements

Seamless Access Across All Devices

  • Access Business Standard across devices — mobile, tablet, or PC, via web or app

More From This Section

Topics :GoogleGemini AIAI Models

First Published: Sep 09 2025 | 10:43 AM IST

Next Story