OpenAI will soon introduce enhanced capabilities in ChatGPT, including the ability to recognise photos and a voice option for iPhone and Android app users. These options will be available for subscribers within the next two weeks.
With the voice feature activated, users can say their query, and ChatGPT will convert the voice into a text message. After that, the engine will process the query, formulate the answer, and present the response in the form of speech. OpenAI aims to replicate Alexa and Google Assistant with better and more accurate results.
Moreover, users will get five voice options — Juniper, Sky, Cove, Ember and Breeze. OpenAI is also planning to partner with Spotify to translate podcasts into various languages while maintaining the frequency and pitch of the original podcaster’s voice.
Users must navigate to Settings > New Features on their ChatGPT mobile app to use voice conversations.
The image search is reminiscent of Google Lens. Users will take a photo of a landmark and then can ask questions about that landmark with ChatGPT. Or, users can take a photo of a maths or reasoning problem, circle the problem with the help of the in-built drawing tool and then let ChatGPT do the work.
Integrating voice and image into the text-based ChatGPT will raise some eyebrows, especially with those with a keen sense of cybersecurity. To ally their features, OpenAI has limited the AI’s ability to say something about people.
“We’ve also taken technical measures to significantly limit ChatGPT’s ability to analyze and make direct statements about people since ChatGPT is not always accurate, and these systems should respect individuals’ privacy,” said OpenAI. “Real-world usage and feedback will help us make these safeguards even better while keeping the tool useful.”
In the News: Security breach costs Mixin Network $200 million in crypto assets