#162 OpenAI Launches GPT-4o Mini, Provides Access to Free ChatGPT Users
Fresh & Hot curated AI happenings in one snack. Never miss a byte 🍔
This snack byte will take approx 3 minutes to consume.
AI BYTE # 📢: OpenAI Launches GPT-4o Mini, Provides Access to Free ChatGPT Users
OpenAI has released a lighter, low-cost model for developers to tinker with called GPT-4o Mini. It costs significantly less than full-sized models and is said to be more capable than GPT-3.5.
Building apps using OpenAI’s models can add up to huge bills. Developers without the means to afford to tinker with it can get priced out of it entirely and may opt for cheaper models like Google’s Gemini 1.5 Flash or Anthropic’s Claude 3 Haiku. Now, OpenAI is entering the light model game (think of it like Bud Lite, though its not a great beer)
Starting today, ChatGPT users on Free, Plus, and Team plans can use GPT-4o Mini instead of GPT-3.5 Turbo, with Enterprise users getting access next week.
That means GPT-3.5 will no longer be an option for ChatGPT users, but it will still be available for developers via the API if they prefer not to switch to GPT-4o Mini.
Key Highlights:
Affordable Intelligence: GPT-4o Mini is designed to be highly cost-effective, priced at just 15 cents per million input tokens and 60 cents per million output tokens. This makes it more than 60% cheaper than GPT-3.5 Turbo, significantly lowering the barrier for developers and businesses to integrate advanced AI into their applications.
Enhanced Performance: Despite its smaller size, GPT-4o Mini outperforms many existing models on key benchmarks. It scores 82% on the MMLU (Measuring Massive Multitask Language Understanding) benchmark, which measures reasoning capabilities, surpassing models like Gemini Flash and Claude Haiku. Additionally, it excels in mathematical reasoning and coding tasks, with an impressive 87% on MGSM and 87.2% on HumanEval.
Multimodal Capabilities: GPT-4o Mini supports both text and vision inputs, with plans to include video and audio capabilities in the future. This makes it a versatile tool for a wide range of applications, from customer support chatbots to complex data analysis.
Large Context Window: The model boasts a context window of 128,000 tokens, allowing it to handle extensive inputs such as full codebases or long conversation histories. This feature is particularly beneficial for applications requiring detailed and context-rich interactions.
Speed and Efficiency: GPT-4o Mini is not only affordable but also fast. It has a median output speed of 202 tokens per second, making it more than twice as fast as its predecessors. This speed is crucial for real-time applications and high-volume tasks.
The new, lightweight model will also support text and vision in the API, and the company says it will soon handle all multimodal inputs and outputs like video and audio.
With all these capabilities, this could look like more capable virtual assistants that can understand your travel itinerary and create suggestions. However, the model is meant for simple tasks, so no one is exactly building Siri for cheap.