GPT-4o: A free large mixed-input and mixed-output model
Overview
GPT-4 “o”.
GPT-4o is released.
You can talk to it through voice and let it sing for you.
Features and advantages of GPT-4o
Mixed input and mixed output
Mixed input means you can input text, images, voice, video, etc. at the same time
Mixed output means that GPT-4o can output text, images, voice, video, etc. at the same time.
In this way, input and output are richer and more user-friendly.
faster
GPT-4o is much faster than GPT-4 Turbo and is almost as fast as human reaction speed.
The above two points make conversations with GPT-4o very similar to conversations with real people.
GPT-4o price
GPT-4o is free to use.
Its API usage fee has also been reduced by half compared to before.
GPT-4o VS Gmini1.5 Pro
Gemini1.5 Pro is also a large model with mixed input and mixed output. It was released earlier than GPT-4o, and its technology and performance in all aspects are actually almost the same as GPT-4o.
However, in terms of conversation experience, Gemini1.5 Pro is obviously inferior to GPT-4o.
In fact, this is understandable. After all, there are far more users using GPT than using Gemini, which leads to the daily conversation volume of GPT being far greater than that of Gemini. Over time, there will naturally be a gap.
Of course, if we just generate content, the gap won’t be that obvious.
Application scenarios of GPT-4o
robot
The emergence of large models such as GPT has directly promoted the development of the robotics industry.
GPT-4o makes the conversation experience almost the same as that of a real person, which will surely bring new development to the robotics industry.
AIGC Application
GPT-4o provides a good interface for various AIGC applications. Many AIGC applications that can generate mixed content such as text, pictures, and voice at the same time may appear in large numbers.
in conclusion
The emergence of large models with mixed input and mixed output such as GPT-4o and Gemini1.5 Pro is a major step forward for generative AI.