G
Groq
Fastest LLM inference on LPUs
About
Hardware-accelerated inference for Llama, Mixtral, and more. Famous for hundreds of tokens/sec.
inferencelpufastopenai-compatible
Fastest LLM inference on LPUs
Hardware-accelerated inference for Llama, Mixtral, and more. Famous for hundreds of tokens/sec.