The Gemini app, developed by Google, has introduced a highly anticipated feature that allows users to upload audio files for analysis. Previously, while the app excelled at processing text, images, and video, it lacked the capability to interpret audio content. This significant enhancement is now available across devices, including Android, iOS, and the web.
How the New Feature Works
Using the audio upload function is straightforward. Users can access the ‘plus’ menu in any Gemini chat, then choose “Files” on mobile or “Upload files” on the web. Once there, users can select audio formats such as MP3, M4A, or WAV. This feature is not just a simple addition; it opens up a range of practical applications that can greatly improve user productivity.
Transcribing audio is one of the most immediate benefits, making it easier to convert spoken content into written form. For instance, users can upload a recording of a meeting and request a bulleted summary along with a list of action items. Students can upload a college lecture to extract key concepts for a study guide, while journalists might find it useful to analyze interviews and highlight essential quotes.
Usage Limitations and Accessibility
As with many advanced features, there are restrictions based on user subscription tiers. Free users can upload audio files totaling up to 10 minutes in length. In contrast, subscribers of the Google AI Pro or Ultra plans receive a substantial upgrade, permitting uploads of up to 3 hours of audio. This enhancement significantly increases the versatility of the Gemini app, responding directly to user feedback regarding their needs for improved functionality.
The introduction of the audio upload feature marks a notable advancement for the Gemini app, reinforcing its role as a powerful digital assistant. This development illustrates Google’s commitment to enhancing user experience by integrating the most requested functionalities. The response to this update highlights the importance of user input in shaping technological advancements.
For more details, visit the source of this information, 9to5 Google.
