Liutong

Fast, affordable LLM inference with a fully OpenAI-compatible API. Powered by a custom Rust engine.

Four model families, one API

Drop-in replacement for the OpenAI API. Just change the base URL.

Ready to get started?

Explore the docs or jump straight to the quickstart guide