ChatGPT, a language model developed by OpenAI, announces the implementation of new functionsincluding the ability to allow users to participate in a voice conversation with the chatbot.
Until now you could only interact with the different versions of ChatGPT in writing, but soon Users can have a live conversation and hear the machine’s responses. This is how the company is implementing voice and images in ChatGPT for Plus and Enterprise users over the next two weeks. The voice will be available on iOS and Android (activate in your settings) and the images will be available on all platforms.
Voice and image offer more ways to use ChatGPT in our lives. We can take a photo of a landmark during our trip and have a live conversation about what’s interesting in that area. When we’re home, we can take photos of the fridge and pantry to find out what’s for dinner (and ask more questions for a step-by-step recipe).
How do you use this language ability?
To get started with voice, you need to go to settings and then click on new features in the mobile application and thus opt for voice conversations. Next, touch the headphone button in the top right corner of the home screen and select the voice you want from five different voices. However, the default voice is one that imitates that of a young woman, as is common with this type of tool.
The The new voice feature is based on a new text-to-speech model, which is capable of producing human-like audio from just text and a few seconds of voice sample. To achieve this, they worked with professional voice actors to create each individual voice. They also use to whisperits open source speech recognition system to convert spoken words into text.
On the other hand, Spotify is leveraging the power of this technology to test its language translation feature, which helps podcasters expand the reach of their storytelling by translating podcasts into additional languages using the podcasters’ own voices.