Pelayo Arbués

Recent Notes

Why Software Engineers Should Learn a Bit of Data Science
Apr 01, 2025
A recommender beast
Feb 05, 2025
The next generation of weak learners
Jan 28, 2025

See 89 more →

❯

Literature Notes

❯

❯

GPT-4o Mini

Apr 16, 20252 min read

articles
literature-note

Metadata

Author: Simon Willison
Full Title: GPT-4o Mini
URL: https://simonwillison.net/2024/Jul/18/gpt-4o-mini/#atom-everything

Highlights

GPT-4o mini is exactly what I’ve been looking forward to. It supports 128,000 input tokens (both images and text) and an impressive 16,000 output tokens. Most other models are still 4,000, and Claude 3.5 Sonnet got an upgrade to 8,000 just a few days ago. This makes it a good fit for translation and transformation tasks where the expected output more closely matches the size of the input. (View Highlight)
GPT-4o mini is 15 cents per millions input tokens and 60 cents per million output tokens - a 60% discount on GPT-3.5, and cheaper than Claude 3 Haiku’s 25c/125c and Gemini 1.5 Flash’s 35c/70c. Or you can use the OpenAI batch API for 50% off again, in exchange for up-to-24-hours of delay in getting the results. (View Highlight)
OpenAI point out that “the cost per token of GPT-4o mini has dropped by 99% since text-davinci-003, a less capable model introduced in 2022.” (View Highlight)
GPT-4o mini in the API is the first model to apply our instruction hierarchy(opens in a new window) method, which helps to improve the model’s ability to resist jailbreaks, prompt injections, and system prompt extractions. (View Highlight)

Graph View

Metadata
Highlights

Now Reading

![CDATA[Not Boring by Packy McCormick]]>
Apr 16, 2025

See 1293 more →

Created with Quartz, © 2025

Bluesky
Linkedin
Mastodon
Twitter
Unsplash
GitHub
RSS