@OpenAIDevs: Remember reinforcement fine-tu...
@OpenAIDevs
28 views
May 09, 2025
1
Remember reinforcement fine-tuning? We’ve been working away at it since last December, and it’s available today with OpenAI o4-mini! RFT uses chain-of-thought reasoning and task-specific grading to improve model performance—especially useful for complex domains. Take @AccordanceAI, which used RFT to fine-tune a model that’s SOTA for their tax and accounting purposes.
And in supervised fine-tuning news: you can now fine-tune GPT-4.1 nano. Get even more from our fastest, cheapest model by training it specifically for your use-case.
And in supervised fine-tuning news: you can now fine-tune GPT-4.1 nano. Get even more from our fastest, cheapest model by training it specifically for your use-case.
View Tweet
2
RFT is available to verified organizations today. Share your datasets with us to receive a 50% discount and help improve future OpenAI models.
Get started with our reinforcement fine-tuning guide: platform.openai.com/docs/guides/rf…
Get started with our reinforcement fine-tuning guide: platform.openai.com/docs/guides/rf…