google/gemini-3.1-flash-lite
Google's cheapest GA model in the 3.x series. Matches Gemini 2.5 Flash quality at a fraction of the cost. Optimized for low-latency, high-volume workloads: classification, summarization, simple generation, and RAG at scale.
Source: gemini_x0.5 · Verified 2026-06-05
Cache read $0.013/M · Cache write $—/M
Use either ID to call this model via the API.
google/gemini-3.1-flash-liteReplace the ONEHOP_KEY placeholder with your API key. Create one →
from openai import OpenAI
client = OpenAI(
base_url="https://api.onehop.ai/v1",
api_key="<ONEHOP_KEY>",
)
completion = client.chat.completions.create(
model="google/gemini-3.1-flash-lite",
messages=[{"role": "user", "content": "What is the meaning of life?"}],
)
print(completion.choices[0].message.content)0 requests · last 30 days