GPT-4o mini is exactly what I’ve been looking forward to.
It supports 128,000 input tokens (both images and text) and an impressive 16,000 output tokens. Most other models are still 4,000, and Claude 3.5 Sonnet got an upgrade to 8,000 just a few days ago. This makes it a good fit for translation and transformation tasks where the expected output more closely matches the size of the input. (View Highlight)
GPT-4o mini is 15 cents per millions input tokens and 60 cents per million output tokens - a 60% discount on GPT-3.5, and cheaper than Claude 3 Haiku’s 25c/125c and Gemini 1.5 Flash’s 35c/70c. Or you can use the OpenAI batch API for 50% off again, in exchange for up-to-24-hours of delay in getting the results. (View Highlight)
OpenAI point out that “the cost per token of GPT-4o mini has dropped by 99% since text-davinci-003, a less capable model introduced in 2022.” (View Highlight)
GPT-4o mini in the API is the first model to apply our instruction hierarchy(opens in a new window) method, which helps to improve the model’s ability to resist jailbreaks, prompt injections, and system prompt extractions. (View Highlight)