1.0 KiB
1.0 KiB
Token Optimization - Quick Start
Quick Reference for Daily Use
Rule of Thumb
| What you need | Use this | Cost |
|---|---|---|
| Quick answer, formatting, lookup | Gemini Flash | FREE |
| Code review, analysis | Gemini Pro | FREE |
| Complex problem solving | Claude Haiku / Qwen | $ |
| Critical architecture decision | GPT-4o | $$ |
Free Tier Limits (Daily)
| Provider | Tokens | Requests | Reset |
|---|---|---|---|
| Google AI Studio | 300,000 | 60/min | Daily |
| OpenRouter Free | Varies | Limited | - |
Current Recommendation
→ Use Google Gemini exclusively until hitting 250K tokens/day → Then add OpenRouter fallback → Only use GPT-4 for final reviews
This will reduce token costs by ~90%
Next Steps
- Configure Gemini as primary (already partially done)
- Add quota tracking
- Set alerts at 80% of free limits
- Implement tiered routing