Reasoning
Orpheus has built-in support for reasoning models by configuring the level of reasoning effort or setting a reasoning budget, which are both available through the with_reasoning
configurator. Both methods cannot be used at the same time.
Learn more about how reasoning is controlled internally.
Configuring Reasoning Effort
You can use the Effort
enum to set the reasoning according to this guide.
let response = client
.chat("Are zebras black with white stripes, or white with black stripes?")
.model("google/gemini-2.5-flash-lite-preview-06-17")
.with_reasoning(|reasoning| reasoning.effort(Effort::Low))
.send()?;
Setting a Reasoning Budget
You can use the max_tokens
method to set the reasoning according to this guide.
let response = client
.chat("Are zebras black with white stripes, or white with black stripes?")
.model("google/gemini-2.5-flash-lite-preview-06-17")
.with_reasoning(|reasoning| reasoning.max_tokens(100))
.send()?;
Last updated
Was this helpful?