39 lines
1.0 KiB
Org Mode
39 lines
1.0 KiB
Org Mode
#+TITLE: Token Optimization - Quick Start
|
|
#+author: Amero Garcia
|
|
#+created: [2026-03-16 Mon 14:28]
|
|
#+DATE: 2026-03-04
|
|
|
|
* Quick Reference for Daily Use
|
|
|
|
** Rule of Thumb
|
|
|
|
| What you need | Use this | Cost |
|
|
|---------------|----------|------|
|
|
| Quick answer, formatting, lookup | Gemini Flash | FREE |
|
|
| Code review, analysis | Gemini Pro | FREE |
|
|
| Complex problem solving | Claude Haiku / Qwen | $ |
|
|
| Critical architecture decision | GPT-4o | $$ |
|
|
|
|
** Free Tier Limits (Daily)
|
|
|
|
| Provider | Tokens | Requests | Reset |
|
|
|----------|--------|----------|-------|
|
|
| Google AI Studio | 300,000 | 60/min | Daily |
|
|
| OpenRouter Free | Varies | Limited | - |
|
|
|
|
** Current Recommendation
|
|
|
|
→ *Use Google Gemini exclusively* until hitting 250K tokens/day
|
|
→ Then add OpenRouter fallback
|
|
→ Only use GPT-4 for final reviews
|
|
|
|
** This will reduce token costs by ~90%
|
|
|
|
** Next Steps
|
|
|
|
1. Configure Gemini as primary (already partially done)
|
|
2. Add quota tracking
|
|
3. Set alerts at 80% of free limits
|
|
4. Implement tiered routing
|
|
|
|
** Savings Potential: $100-500/month → $10-50/month |