Browse Category

Multimodal AI News 29 July 2025

Beyond ChatGPT: The Next Wave of AI Can See, Hear, and Create Worlds

OpenAI released GPT-4o in May 2024 as a natively multimodal model that accepts text, audio, image, and video input and can generate text, audio, or images, delivering GPT-4-level performance at roughly 50% lower cost and faster speed. Google DeepMind Gemini,

29 July 2025

Go toTop