qwen/qwen3.6-flash
Alibaba Cloud Qwen 3.6 Flash is a fast, cost-efficient language model in the Qwen 3.6 series with a 1-million-token context window. Supports chain-of-thought reasoning, tool calling, and streaming. Optimised for high-throughput, latency-sensitive workloads. Accessed via OpenAI-compatible interface.
Source: official_x0.5 · Verified 2026-06-17
Cache read $0.013/M · Cache write $—/M
Use either ID to call this model via the API.
qwen/qwen3.6-flashReplace the ONEHOP_KEY placeholder with your API key. Create one →
from openai import OpenAI
client = OpenAI(
base_url="https://api.onehop.ai/v1",
api_key="<ONEHOP_KEY>",
)
completion = client.chat.completions.create(
model="qwen/qwen3.6-flash",
messages=[{"role": "user", "content": "What is the meaning of life?"}],
)
print(completion.choices[0].message.content)