
Beyond ChatGPT: The Next Wave of AI Can See, Hear, and Create Worlds
OpenAI released GPT-4o in May 2024 as a natively multimodal model that accepts text, audio, image, and video input and can generate text, audio, or images, delivering GPT-4-level performance at roughly 50% lower cost and faster speed. Google DeepMind Gemini, unveiled in December 2023,