Google's Gemini AI can now analyse your audio files: All you need to know

Google's Gemini app now lets Android, iOS, and web users upload audio files for AI to do a comprehensive analysis, opening new use cases like transcribing interviews, lectures, and voice memos

2 min read Last Updated : Sep 09 2025 | 12:56 PM IST

Gemini can now analyse your audio files. Google has enabled the option across platforms, including Android, iOS and web, for users to upload audio files for analysis on the Gemini app. The vice president of Google Labs and Gemini, Josh Woodward, announced this development on X (formerly Twitter). As per a Google support page, free-tier users can upload up to 10 minutes of audio and have five daily prompts.

This opens Google Gemini AI to new use cases – think transcribing interviews, analysing voice memos, or converting lecture audio into searchable AI dialogue. It lifts the app beyond typed or spoken prompts, offering multimodal input.

Along with this, Google has added support for five languages, including Hindi, to Search’s AI mode. Furthermore, as per Woodward’s reply to a comment on this post revealed that now Google is working to build an Auto mode in Gemini so that users won’t have to manually switch Gemini Flash and Pro mode.

ALSO READ: iPhone 17 series launch: Where to watch Apple's 'Awe dropping' event live

Limit for uploading audio files in Gemini

Free-tier users can upload up to 10 minutes of audio and have five daily prompts. Those subscribed to Gemini AI Pro or AI Ultra can use uploads up to three hours, across up to 10 files, even packaged within a single ZIP archive.

Expanding multilingual reach in Google Search

Also on Monday, Google enhanced its Search AI Mode by adding support for five new languages: Hindi, Indonesian, Japanese, Korean, and Brazilian Portuguese. Powered by Gemini 2.5 technology, this expansion allows more users globally to ask complex questions in their native tongues while using Search’s AI-driven tools to explore content.

ALSO READ: Google's Gemini AI reportedly unsafe for teens despite guardrails

NotebookLM gets smarter

Furthermore, Google’s NotebookLM – a Gemini-powered research assistant – has gained the ability to generate structured reports such as blog posts, study guides, quizzes, and flashcards, all from uploaded documents or media. These are now available in over 80 languages. Users can select the desired style and tweak tone or format, with full rollout expected by end of week, according to a comment shared on X.

While audio upload is entirely new to the Gemini app, NotebookLM had already offered audio input, maintaining its positioning as a deep-dive research assistant rather than a conversational interface.

Already subscribed? Log in

Subscribe to read the full story →

^*Subscribe to Business Standard digital and get complimentary access to The New York Times

Smart Quarterly

₹900

3 Months

₹300/Month

SAVE 25%

Smart Essential

₹2,700

1 Year

₹225/Month

SAVE 46%

*Complimentary New York Times access for the 2nd year will be given after 12 months

Super Saver

₹3,900

2 Years

₹162/Month

Renews automatically, cancel anytime

Here’s what’s included in our digital subscription plans

Exclusive premium stories online

Over 30 premium stories daily, handpicked by our editors

Complimentary Access to The New York Times

News, Games, Cooking, Audio, Wirecutter & The Athletic

Business Standard Epaper

Digital replica of our daily newspaper — with options to read, save, and share

Curated Newsletters

Insights on markets, finance, politics, tech, and more delivered to your inbox

Market Analysis & Investment Insights

In-depth market analysis & insights with access to The Smart Investor

Ad-free Reading

Uninterrupted reading experience with no advertisements

Seamless Access Across All Devices

Access Business Standard across devices — mobile, tablet, or PC, via web or app