26 lines
733 B
Org Mode
26 lines
733 B
Org Mode
#+TITLE: Token Optimization
|
|
#+AUTHOR: Amr
|
|
#+CREATED: [2026-03-17 Tue]
|
|
#+BEGIN_COMMENT
|
|
Cost-effective LLM usage through smart routing, context compression, and multi-provider strategies.
|
|
#+END_COMMENT
|
|
|
|
* Token Optimization
|
|
|
|
Strategy and implementation for minimizing LLM costs while maintaining quality.
|
|
|
|
* Project Tasks
|
|
|
|
See the actionable tasks for this project in [[file:../../gtd.org::*Token Optimization][GTD.org > Projects > Token Optimization]]
|
|
|
|
* Key Documents
|
|
|
|
- [[file:plan.org][Optimization Plan]]
|
|
- [[file:token-optimization.yaml][Configuration]]
|
|
|
|
* Current Focus
|
|
|
|
- Multi-provider setup (Gemini primary, OpenRouter fallback)
|
|
- Usage tracking and budget alerts
|
|
- Smart routing by task type
|
|
- Context compression techniques |