/pcq/media/post_banners/wp-content/uploads/2023/09/OpenAIs-chatbot-added-new-Voice-and-Image-based-features.jpg)
A new iteration of its chatbot that can talk and see has been introduced by San Francisco-based artificial intelligence firm OpenAI. The ChatGPT chatbot can now converse verbally with users and reply to their uploaded photographs.
With the addition of two new features, ChatGPT's latest version becomes more human-like than before. First, it can now respond to users' questions by speaking to them in a voice synthesized from human speech, according to reports. There are five various voice selections available to users, including male and female voices.
Second, it can now react to images uploaded by users. For example, users can send a photo of the inside of their fridge, and ChatGPT can suggest dishes they can make using those ingredients.
How will the new features facilitate human conversation?
ChatGPT is powered by the Large Language Model, or LLM, which has learned to generate natural language by analyzing billions of words from the Internet. With the addition of voice support, ChatGPT can feel similar to voice assistants like Siri and Alexa. But it is actually different because it is equipped with LLM technology and therefore can handle various topics and tasks without pre-programming. It can write and (now) even read emails, poetry, treatises and jokes.
OpenAI said these features are designed to make ChatGPT more accessible and useful for everyone. ChatGPT voices are also said to be more convincing than others used with popular digital assistants. The tool can be considered a more natural way to interact with this chat, especially for people who are not comfortable writing or reading.
New features of OpenAI
Users can start using voice by going to Settings > New features in the mobile app and then selecting the voice chat option. At the same time, the image function is also very convenient. For example, users can upload a photo, graphic, or diagram, and ChatGPT can provide a detailed description of the image and answer questions about its content.
This can be a useful tool for people with visual impairments or people who want to know more about what they see. Although OpenAI introduced the image processing tool back in the spring, it halted the release due to fears of misuse.
The company feared that the product could become a facial recognition device that quickly recognizes, for example, people in photos. The new version of ChatGPT will be available over the next two weeks to everyone who subscribes to ChatGPT Plus, which costs $20 a month, and Enterprise.
However, the audio feature only works on iPhone, iPad and Android devices. The image feature works both on the web and on mobile devices. OpenAI has been rapidly releasing its AI tools in recent weeks.
Earlier, it announced a new version of its DALL-E image generator, which it integrated with ChatGPT so that users can ask the chat rat to generate images for them as well.
ChatGPT has attracted hundreds of millions of users since its launch last November. It also inspired several other companies to create similar services, such as Google Bard and Microsoft Bing. With the new version of ChatGPT, OpenAI advances the competition in the field of conversational artificial intelligence, at the same time competing with older technologies such as Alexa and Siri.