
gemini app finally expands to audio files Google has introduced significant updates to its Gemini-powered products, enhancing user experience and broadening functionality across various platforms.
gemini app finally expands to audio files
Expansion of the Gemini App to Audio Files
On Monday, Google announced that the Gemini app now supports audio file uploads, a feature that has been highly anticipated by users. According to Josh Woodward, vice president of Google Labs and Gemini, this capability was the “#1 request” from the app’s user base. The addition of audio file compatibility marks a pivotal moment for the Gemini app, which aims to streamline how users interact with various media formats.
Audio File Compatibility Details
The new audio file feature allows users to upload audio files with varying limitations based on their subscription type. Free users can upload audio files up to 10 minutes in length and are limited to five prompts each day. In contrast, users subscribed to AI Pro or AI Ultra plans can upload audio files that are up to three hours long. This tiered approach to audio file uploads ensures that a broader range of users can benefit from the new functionality, catering to both casual users and those requiring more extensive capabilities.
Furthermore, the Gemini app accommodates up to 10 audio files simultaneously, allowing for a diverse range of formats, including those contained within ZIP files. This flexibility is crucial for users who may want to analyze multiple audio clips or combine various audio resources for a single project. The ability to handle multiple file formats enhances the app’s utility, making it a more versatile tool for users across different sectors, including education, research, and content creation.
Enhancements to Google Search’s AI Mode
In addition to the Gemini app’s audio capabilities, Google has also expanded its Search functionality by integrating five new language options into its AI Mode. The newly supported languages are Hindi, Indonesian, Japanese, Korean, and Brazilian Portuguese. This expansion is made possible through the integration of Gemini 2.5 with Google Search, allowing users to engage with the platform in their preferred language.
Implications of Language Expansion
This update is significant as it broadens accessibility for non-English speakers, enabling a more inclusive experience for users around the globe. With the ability to ask complex questions in their native languages, users can explore the web more deeply and effectively. This move aligns with Google’s ongoing commitment to making information accessible to everyone, regardless of language barriers.
The integration of additional languages into Google Search’s AI Mode also reflects the growing importance of multilingual capabilities in technology. As the world becomes increasingly interconnected, the demand for tools that can operate across various languages is more critical than ever. This update positions Google as a leader in providing AI solutions that cater to a diverse user base.
Updates to NotebookLM: New Report Styles and Features
Another noteworthy update is the enhancement of the Gemini-powered NotebookLM software, which now offers new report styles in over 80 languages. This feature is based on the documents, files, and other media uploaded by users, allowing for a more tailored and personalized report generation experience.
Standard Report Formats
The standard report formats available in NotebookLM include study guides, briefing documents, and blog posts. Additionally, the latest update introduces flashcards and quizzes, making it a more comprehensive tool for educators and students alike. The ability to create self-designed formats allows users to adjust the structure, tone, and style of their reports, catering to specific needs and preferences.
According to a company comment on X, this feature is expected to be “100%” available by the end of the week, indicating a swift rollout to users. The flexibility in report generation not only enhances the user experience but also positions NotebookLM as a valuable research tool that helps users identify patterns in various file formats.
Comparison with Previous Capabilities
While the Gemini app has just introduced audio capabilities, NotebookLM had already been equipped with similar functionalities, allowing it to serve as a robust research tool. This differentiation highlights the unique strengths of each platform within the Gemini ecosystem. NotebookLM’s ability to analyze and generate reports from uploaded files positions it as a more specialized tool for users engaged in research and academic pursuits.
Google’s Recent AI Developments
The updates to the Gemini app, Google Search, and NotebookLM are part of a broader trend of rapid advancements in AI-related features from Google. Over the past month, the company has rolled out several significant updates, demonstrating its commitment to enhancing user experience through innovative technology.
Recent Features and Enhancements
In August, Gemini began automatically recalling user details and preferences from past conversations, making interactions more personalized and efficient. This feature allows the AI to provide more relevant responses based on previous user interactions, enhancing the overall user experience.
Additionally, free users gained access to Workspace’s video generation software, Vids, which allows for the creation of engaging video content. This capability is particularly beneficial for educators and content creators looking to produce multimedia resources quickly and efficiently.
In September, Google Photos upgraded to the latest video generation software, Veo 3, enabling free users to create silent, four-second-long videos from their personal still images. This feature adds a new dimension to photo management, allowing users to create dynamic content from their existing photo libraries.
Stakeholder Reactions and Future Implications
The recent updates have garnered positive reactions from stakeholders, including educators, content creators, and researchers. The introduction of audio file compatibility in the Gemini app, in particular, has been met with enthusiasm, as it addresses a significant user request. The ability to upload and analyze audio files opens up new possibilities for educational content, research projects, and creative endeavors.
Moreover, the expansion of language support in Google Search’s AI Mode is likely to be well-received by global users who have long sought more inclusive technology solutions. This move not only enhances accessibility but also positions Google as a leader in the multilingual AI space, potentially attracting a more diverse user base.
The updates to NotebookLM also reflect a growing trend toward personalized and adaptive learning tools. As education increasingly shifts toward digital platforms, the ability to generate tailored reports and study materials will likely be invaluable for students and educators alike.
Conclusion
Google’s recent updates to its Gemini-powered products signify a substantial leap forward in enhancing user experience and functionality. With the introduction of audio file compatibility in the Gemini app, the expansion of language options in Google Search, and the enhancements to NotebookLM, Google is positioning itself as a frontrunner in the AI landscape. These developments not only cater to existing user needs but also pave the way for future innovations that could further transform how users interact with technology.
Source: Original report
Was this helpful?
Last Modified: September 9, 2025 at 12:44 am
2 views