Performance
Token Budget
The planned allocation of tokens between system instructions, retrieved context, conversation history, and expected output within a model's context window. Managing the token budget prevents truncation and controls API costs.