refactor: moved org-agent to its own repository as a submodule
This commit is contained in:
39
projects/token-optimization/quick-start.org
Normal file
39
projects/token-optimization/quick-start.org
Normal file
@@ -0,0 +1,39 @@
|
||||
#+TITLE: Token Optimization - Quick Start
|
||||
#+author: Amero Garcia
|
||||
#+created: [2026-03-16 Mon 14:28]
|
||||
#+DATE: 2026-03-04
|
||||
|
||||
* Quick Reference for Daily Use
|
||||
|
||||
** Rule of Thumb
|
||||
|
||||
| What you need | Use this | Cost |
|
||||
|---------------|----------|------|
|
||||
| Quick answer, formatting, lookup | Gemini Flash | FREE |
|
||||
| Code review, analysis | Gemini Pro | FREE |
|
||||
| Complex problem solving | Claude Haiku / Qwen | $ |
|
||||
| Critical architecture decision | GPT-4o | $$ |
|
||||
|
||||
** Free Tier Limits (Daily)
|
||||
|
||||
| Provider | Tokens | Requests | Reset |
|
||||
|----------|--------|----------|-------|
|
||||
| Google AI Studio | 300,000 | 60/min | Daily |
|
||||
| OpenRouter Free | Varies | Limited | - |
|
||||
|
||||
** Current Recommendation
|
||||
|
||||
→ *Use Google Gemini exclusively* until hitting 250K tokens/day
|
||||
→ Then add OpenRouter fallback
|
||||
→ Only use GPT-4 for final reviews
|
||||
|
||||
** This will reduce token costs by ~90%
|
||||
|
||||
** Next Steps
|
||||
|
||||
1. Configure Gemini as primary (already partially done)
|
||||
2. Add quota tracking
|
||||
3. Set alerts at 80% of free limits
|
||||
4. Implement tiered routing
|
||||
|
||||
** Savings Potential: $100-500/month → $10-50/month
|
||||
Reference in New Issue
Block a user