#+TITLE: Token Optimization - Quick Start #+author: Amero Garcia #+created: [2026-03-16 Mon 14:28] #+DATE: 2026-03-04 * Quick Reference for Daily Use ** Rule of Thumb | What you need | Use this | Cost | |---------------|----------|------| | Quick answer, formatting, lookup | Gemini Flash | FREE | | Code review, analysis | Gemini Pro | FREE | | Complex problem solving | Claude Haiku / Qwen | $ | | Critical architecture decision | GPT-4o | $$ | ** Free Tier Limits (Daily) | Provider | Tokens | Requests | Reset | |----------|--------|----------|-------| | Google AI Studio | 300,000 | 60/min | Daily | | OpenRouter Free | Varies | Limited | - | ** Current Recommendation → *Use Google Gemini exclusively* until hitting 250K tokens/day → Then add OpenRouter fallback → Only use GPT-4 for final reviews ** This will reduce token costs by ~90% ** Next Steps 1. Configure Gemini as primary (already partially done) 2. Add quota tracking 3. Set alerts at 80% of free limits 4. Implement tiered routing ** Savings Potential: $100-500/month → $10-50/month