Groq
Cloud API provider for fast inference on Groq LPU hardware via api.groq.com.
Groq provides extremely fast inference using custom LPU hardware. The API is OpenAI-compatible at api.groq.com.
Configuration
[backend]
name = "groq"
url = "https://api.groq.com"
model = "llama-3.3-70b-versatile"
api_key = "gsk_..."
Notes
Groq uses the /openai/v1/chat/completions path prefix (not /v1/chat/completions).
Default Endpoint
POST https://api.groq.com/openai/v1/chat/completions