deepseek/deepseek-v4-flash
The low-cost member of the DeepSeek V4 family. Supports both thinking and non-thinking modes under one id, a native 1M-token context window, and the cheapest per-token rate in the lineup. Use for high-volume, latency-sensitive tasks.
Source: launch_0019 · Verified 2026-06-01
Cache read $0.001/M · Cache write $—/M
Replace the ONEHOP_KEY placeholder with your API key. Create one →
from openai import OpenAI
client = OpenAI(
base_url="https://api.onehop.ai/v1",
api_key="<ONEHOP_KEY>",
)
completion = client.chat.completions.create(
model="deepseek/deepseek-v4-flash",
messages=[{"role": "user", "content": "What is the meaning of life?"}],
)
print(completion.choices[0].message.content)