Revolutionary GPT-4o: OpenAI’s Newest Technology Mannequin Takes Voice Interplay to the Subsequent Degree

2024-05-14 07:51:40

OpenAI held a presentation of the GPT-4o generative mannequin final evening. The letter “O” within the title represents the abbreviation of the phrase omni – “full”. The neural community responds to voice in 320 milliseconds on common, which is corresponding to the response throughout a dialog. New GPT mannequin works with speech, textual content and video. She communicates in a pure voice, even is aware of the right way to joke and perceive feelings, and in addition pauses in her speech when you ask her one thing.

Writer: @OpenAI/YouTube

Throughout the presentation, the corporate’s technical director, Mira Murati, mentioned that GPT-4o is way quicker than earlier variations: the neural community will be capable to analyze the content material of paperwork, movies and pictures, in addition to orally translating the phrase.

The presenters requested GPT-4o to inform a fairy story about robots, then made it clear that it ought to sound extra dramatic. Then they requested the generative mannequin to sing the identical story.

Writer: @OpenAI/YouTube

The presenter additionally hand-wrote an arithmetic instance on a sheet of paper. He confirmed it with the GPT-4o digicam and gave a voice command to resolve it. The neural community introduced the answer algorithm.

Writer: @OpenAI/YouTube

Moreover, throughout the presentation, the audio system communicated in English and Italian – GPT-4o helped them perceive one another.

Writer: @OpenAI/YouTube

1715678151
#GPT4o #neural #community

Leave a Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.