Files
memex/projects/token-optimization/quick-start.org

1.0 KiB

Token Optimization - Quick Start

Quick Reference for Daily Use

Rule of Thumb

What you need Use this Cost
Quick answer, formatting, lookup Gemini Flash FREE
Code review, analysis Gemini Pro FREE
Complex problem solving Claude Haiku / Qwen $
Critical architecture decision GPT-4o $$

Free Tier Limits (Daily)

Provider Tokens Requests Reset
Google AI Studio 300,000 60/min Daily
OpenRouter Free Varies Limited -

Current Recommendation

Use Google Gemini exclusively until hitting 250K tokens/day → Then add OpenRouter fallback → Only use GPT-4 for final reviews

This will reduce token costs by ~90%

Next Steps

  1. Configure Gemini as primary (already partially done)
  2. Add quota tracking
  3. Set alerts at 80% of free limits
  4. Implement tiered routing

Savings Potential: $100-500/month → $10-50/month