Qwen 3.6 Flash

Official

qwen/qwen3.6-flash

Alibaba Cloud Qwen 3.6 Flash is a fast, cost-efficient language model in the Qwen 3.6 series with a 1-million-token context window. Supports chain-of-thought reasoning, tool calling, and streaming. Optimised for high-throughput, latency-sensitive workloads. Accessed via OpenAI-compatible interface.

1049K contextReasoningTool calling

Pricing

Source: official_x0.5 · Verified 2026-06-17

InputOfficial $0.250 / M tokens

$0.125/ M tokensSave 50%

OutputOfficial $1.50 / M tokens

$0.750/ M tokensSave 50%

Cache read $0.013/M · Cache write $—/M

Estimate cost

Input tokensOutput tokens

Estimated cost$0.002000

Protocols

OpenAI Chat CompletionsStatus: Available: Streaminghttps://api.onehop.ai/v1
OpenAI ResponsesStatus: Not supported—
Anthropic MessagesStatus: Available: Streaminghttps://api.onehop.ai/anthropic
Google Vertex AIStatus: Not supported—
OpenAI ImagesStatus: Not supported—
OpenAI SoraStatus: Not supported—

Try it in the chat playground

Call IDs

Use either ID to call this model via the API.

OneHop nameqwen/qwen3.6-flash

Try it

Replace the ONEHOP_KEY placeholder with your API key. Create one →

from openai import OpenAI

client = OpenAI(
    base_url="https://api.onehop.ai/v1",
    api_key="<ONEHOP_KEY>",
)

completion = client.chat.completions.create(
    model="qwen/qwen3.6-flash",
    messages=[{"role": "user", "content": "What is the meaning of life?"}],
)
print(completion.choices[0].message.content)

base_url: https://api.onehop.ai/v1

Other variants in this family

Qwen 3.6 Plus

Qwen 3.6 Plus — mid-tier reasoning model with strong agentic coding, 1M context.

$0.250/M↓ · $1.50/M↑