How ChatGPT becomes an interpreter

The day chosen was everything, but certainly not a coincidence. Just 24 hours before Google's long-awaited I/O developer conference, OpenAI, creator of the artificial intelligence (AI)-based chatbot ChatGPT, invited people to a somewhat spontaneous presentation. He was presented in the introduction by OpenAI CTO Mira Moratti.

A new AI model GPT-4o is introduced, where the “o” stands for “omni”. Which means that the model works multimodally. Both aspects, i.e. both in the input and in the subsequent creation of your content. But this time, the opportunity to talk to ChatGPT over GPT-4o received a particularly positive response. Until now, you could control your chatbot only via voice. If you did this, three AI models worked together: one translated language into text, a second responded to the text with text, and a third converted the response text into language again. A process that led to seconds of waiting after the question and complicated the conversation.

One model is enough

With GPT-4o, where only one neural network is running, the spoken answer must now occur within 300 milliseconds. Meanwhile, the AI's voice has taken on a more human tone. During the demo itself, OpenAI had plenty of application examples for the new AI. So ChatGPT told dramatic bedtime stories, served as an Italian and English interpreter, and even started singing.

German technology expert Philipp Kloeckner talks about a potential “Siri killer” in a simulation of Apple's voice assistant. OpenAI itself is trying hard to draw comparisons to Spike Jonze's 2013 hit film “Her.” Shy Jonathan falls in love with Samantha, the voice of a particularly sympathetic AI.

It is still unclear when the new functions will actually be available. He is currently speaking in voice mode According to OpenAI President Sam Altman Not yet with GPT-4o, while the new model is already being written.

Alban Parsons

“Social media evangelist. Baconaholic. Devoted reader. Twitter scholar. Avid coffee trailblazer.”

How ChatGPT becomes an interpreter

One model is enough

Longest jets in the universe discovered – giant particle streams as long as 140 Milky Way galaxies in a row

New method reveals 307 supernova remnants

Snapchat is upping the ante on augmented reality glasses

Nicolas Loufrani: Young Londoners Design Afro Hair Emojis

Kyiv: Russian Kursk offensive halted

US Election: Trump Vs. Harris – 2024 poll numbers in America

Magic Abba – Europe's #1 Music Show Live with the Band

One model is enough

Leave a Reply Cancel reply

More Stories

Longest jets in the universe discovered – giant particle streams as long as 140 Milky Way galaxies in a row

New method reveals 307 supernova remnants

Snapchat is upping the ante on augmented reality glasses

You may have missed

Nicolas Loufrani: Young Londoners Design Afro Hair Emojis

Kyiv: Russian Kursk offensive halted

US Election: Trump Vs. Harris – 2024 poll numbers in America

Magic Abba – Europe's #1 Music Show Live with the Band