Question 1

How much does it cost to fine-tune GPT-4o?

Accepted Answer

GPT-4o fine-tuning costs $25/M training tokens. For a typical dataset of 1,000 examples (500 tokens each, 3 epochs), that's about $37.50. Inference costs increase to $3.75/$15 per million input/output tokens.

Question 2

What is LoRA fine-tuning and is it cheaper?

Accepted Answer

LoRA (Low-Rank Adaptation) updates only a small subset of model parameters instead of all weights. It's 60-70% cheaper than full fine-tuning and supported by Together AI, Fireworks, and Google. Quality is comparable for most use cases.

Question 3

How many examples do I need for fine-tuning?

Accepted Answer

OpenAI requires a minimum of 10 examples but recommends 50-100 for noticeable improvement. For production quality, 500-1,000 high-quality examples across 3 epochs is a good starting point.

Question 4

Is fine-tuning or prompt engineering cheaper?

Accepted Answer

Prompt engineering is free upfront, but fine-tuning can reduce inference costs by enabling shorter prompts and using smaller models. Fine-tuning becomes more cost-effective at higher volumes (1,000+ requests/day).

Model	Training	Inference/mo	Total (6mo)
Llama 3.1 8BBest Value Together AI	$0.7200	$0.2400	$2.16
Mistral 7B Together AI	$0.7200	$0.2400	$2.16
Llama 3.1 8B Fireworks AI	$0.9000	$0.2400	$2.34
Gemini 2.0 Flash Google	$3.00	$0.7650	$7.59
GPT-4o mini OpenAI	$4.50	$1.53	$13.68
Llama 3.3 70B Together AI	$7.50	$1.60	$17.08
Llama 3.3 70B Fireworks AI	$9.00	$1.60	$18.58
GPT-4.1 mini OpenAI	$6.00	$4.08	$30.48
GPT-4o OpenAI	$37.50	$19.13	$152.25

AI Fine-Tuning Cost Calculator

Training Data

Post-Training Inference

Cost Comparison

Total Cost Breakdown (6-Month Ownership)

Model Details

Frequently Asked Questions

Related Calculators