Inferact is a startup founded by creators and core maintainers of vLLM, the most popular open-source LLM inference engine.
Our mission is to grow vLLM as the world's AI inference engine and accelerate AI progress by making inference cheaper and faster.
Member of Technical Staff
CharacterAI
Jun 2024 - Oct 2025(1 year 5 months)
Landing GPU efficiency and inference optimizations for the leading LLM consumer app