Skip to content
G

Groq

Fastest LLM inference on LPUs

About

Hardware-accelerated inference for Llama, Mixtral, and more. Famous for hundreds of tokens/sec.

inferencelpufastopenai-compatible

Metrics

More in LLM Inference