
Open-source self-hosted alternative to GitHub Copilot
FauxPilot is a self-hosted, open-source code completion server that replicates GitHub Copilot functionality using Salesforce CodeGen models on NVIDIA Triton Inference Server. It lets developers run AI-powered code suggestions entirely on their own hardware, ensuring source code never leaves local infrastructure.
Runs entirely on local hardware, providing AI code suggestions without sending code to external servers
Exposes an OpenAI-compatible API endpoint for easy integration with existing tools
Works with existing GitHub Copilot editor extensions by redirecting the endpoint URL
Standard REST API for custom integrations and tooling beyond IDE plugins
Supports CodeGen models from 350M to 16B parameters based on available GPU VRAM
Large models can be split across multiple GPUs for running bigger models
Docker Compose stack for reproducible, containerized setup
Organizations with strict data governance can get AI coding assistance while keeping all data on-premises
Teams in classified or regulated environments deploy offline AI code completion
Academics study vulnerabilities in AI-generated code and develop safer models
Developers with NVIDIA GPUs avoid recurring Copilot subscription fees by self-hosting
Offers both Python-specialized and multi-language model variants
ML engineers swap in fine-tuned models to benchmark different code generation approaches

Open-source, AI-first business automation