OpenAI’s GPT-4o-Powered ChatGPT Is Now More (Terrifyingly) Conversational and Life-Like

OpenAI on Monday announced GPT-4o, a new flagship generative AI model that expands on the capabilities of its predecessor, GPT-4. The “o” in GPT-4o stands for “omni,” reflecting the model’s ability to handle multiple modalities, including text, speech, and video. 

One of the key improvements in GPT-4o is its ability to reason across voice, text, and vision, which OpenAI CTO Mira Murati believes is crucial for the future of human-machine interaction. The model’s integration into OpenAI’s AI-powered chatbot, ChatGPT, allows users to interact with the platform more naturally, as if it were a personal assistant. 

Users can now interrupt ChatGPT while it’s answering, and the model can respond in real-time, even picking up on nuances in the user’s voice and generating responses in various emotive styles — even singing.

GPT-4o also enhances ChatGPT’s vision capabilities, enabling it to analyze photos or desktop screens and answer related questions on topics ranging from software code to clothing brands. Murati suggests that future iterations of the model could allow ChatGPT to watch live events, such as sports games, and explain the rules to users.

In addition to its improved ease of use, GPT-4o boasts enhanced performance in around 50 languages and faster processing times at lower costs compared to GPT-4 Turbo. However, due to the risk of misuse, OpenAI plans to initially launch GPT-4o’s new audio capabilities to “a small group of trusted partners.”

Alongside the release of GPT-4o, OpenAI announced a refreshed ChatGPT UI on the web, a desktop version for macOS, and the expansion of previously paywalled features to free users, such as the ability to upload files and photos and search the web for answers.


Information for this story was found via Tech Crunch, Ars Technica, and the sources and companies mentioned. The author has no securities or affiliations related to the organizations discussed. Not a recommendation to buy or sell. Always do additional research and consult a professional before purchasing a security. The author holds no licenses.

Leave a Reply

Video Articles

Toronto’s 2027 Condo Time Bomb: Financial Ruin for Pre-Construction Buyers | Mark Morris

Gold Industry Set For A $160 Billion Cash Haul In 2026!? | Terry Lynch – Power Metallic

$3200 Gold & The Miners Still Lagging!? | Cliff Hale-Sanders – Cerrado Gold

Recommended

Canadian Copper Outlines $171 Million NPV, 36% IRR In PEA For Murray Brook And Caribou Complex

Golden Cariboo Hits 37.0 Metres of 1.02 g/t Gold At Halo Zone

Related News

Amazon to Acquire AI Chipmaker Perceive for $80 Million

Amazon (Nasdaq: AMZN) has announced plans to acquire Perceive Corp, an AI chipmaker and model...

Wednesday, August 21, 2024, 12:29:54 PM

Ex-Google CEO Is Building AI-Powered Attack Drones

Former Google CEO Eric Schmidt is making the shift to artificial intelligence — by funding...

Friday, January 26, 2024, 12:09:00 PM

Thomson Reuters Bets on AI Revolution in Accounting with Materia Acquisition

Thomson Reuters (TSX: TRI) recently announced that it has acquired Materia, a U.S.-based artificial intelligence...

Tuesday, October 22, 2024, 11:07:00 AM

Stability AI Isn’t Very Stable, Defaults on AWS and Google Cloud Payments

Generative AI startup Stability AI, known for the deep learning, text-to-image model Stable Diffusion, has...

Wednesday, April 3, 2024, 03:41:00 PM

OpenAI Co-Founder Ilya Sutskever Is Starting A New AI Firm

Just a little over a month after he left the company, OpenAI co-founder Ilya Sutskever...

Thursday, June 20, 2024, 02:54:00 PM