ChatGPT gains voice and image search abilities

2023-09-25 20:43:03

A OpenAI announced today a series of updates to the ChatGPT. The main changes are in the ways users interact with artificial intelligence, as the latest developments have focused on improving AI knowledge.

One of the key additions is voice control, which will allow users to interact with the AI ​​via verbal commands. The feature, according to OpenAI, provides a more intuitive and natural experience — almost like a virtual assistant.

ChatGPT can now see, hear and speak. Over the next two weeks, Plus users will be able to have voice conversations with ChatGPT (on iOS and Android) and include images in conversations (on all platforms).
openai.com/blog/chatgpt-c…

What sets this development apart is OpenAI’s commitment to not only making AI talk, but also ensuring more accurate responses, the result of substantial improvements in the technology underneath the resource.

According to the company, the key components that drive functionality include the Whisperan OpenAI system that transcribes spoken words into text, and a new text-to-speech model capable of generating human-like audio from plain text and brief speech samples.

This innovation in text-to-speech technology has resulted in collaborations with several companies, including Spotify, with the aim of translating podcasts into multiple languages ​​while preserving the hosts’ original voices. ????

Still, OpenAI says it is aware of the potential dangers associated with synthetic voices, including the risks they bring. Therefore, there is a cautious approach, with plans to restrict availability to carefully selected partnerships.

Another feature that will debut on ChatGPT is image search. With it, users will be able to take a photo of an object, scene or item of interest, and the chatbot will analyze the image to provide relevant information or answers to the query.

Additionally, the platform will offer a versatile drawing tool and allow users to complement their image with spoken or typed questions, enabling a dynamic and interactive experience.

Also according to OpenAI, the voice feature will be available on iOS and Android; image, on all platforms. The launch, according to the company, will take place for Plus and Enterprise users “over the next two weeks”. Other user groups, including developers, are expected to receive the roles “soon”.

ChatGPT
of OpenAI

Version 1.2023.264 (36.4 MB)
Requires the iOS 16.1 or superior

via The Verge

1695687623
#ChatGPT #gains #voice #image #search #abilities

Leave a Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.