FauxPilot

Open-source, self-hosted GitHub Copilot server using SalesForce CodeGen models on NVIDIA Triton — fully offline, zero subscription cost, and complete code privacy.

CLI Agents

Free Open Source Self-hosted

FauxPilot: A GitHub Copilot Alternative for Self-Hosted, Privacy-First AI Code Completion

FauxPilot is an open-source, self-hosted server that replicates the GitHub Copilot API using the SalesForce CodeGen models running on NVIDIA's Triton Inference Server. It allows development teams and organizations to run AI code completion entirely on their own hardware, with no data leaving their infrastructure. As a GitHub Copilot alternative, it is best suited for privacy-conscious developers, security-focused organizations, and self-hosted infrastructure enthusiasts who want AI code completion without sending code to external cloud services.

FauxPilot vs. GitHub Copilot: Quick Comparison

	FauxPilot	GitHub Copilot
Type	Self-Hosted Open-Source Copilot Server	IDE Extension / CLI
IDEs	Any IDE that supports the OpenAI API or GitHub Copilot plugin protocol	VS Code, JetBrains, Vim, Neovim, Visual Studio, Xcode
Pricing	Free (open source; hardware costs only)	Free for students/OSS; Individual $10/mo; Business $19/mo; Enterprise $39/mo
Models	SalesForce CodeGen models (various sizes); GPT-J format	OpenAI GPT-4o, Claude 3.5 Sonnet, Gemini 1.5 Pro
Privacy / hosting	Fully self-hosted (your own servers, no external data transfer)	Cloud (GitHub/Microsoft)
Open source	Yes	No
Offline / local models	Yes (fully offline once set up)	No

Key Strengths

Complete Data Privacy and Air-Gap Capability: Because FauxPilot runs entirely on your own hardware, your source code never leaves your infrastructure. This is a critical advantage for organizations working on sensitive, proprietary, or regulated code that cannot be sent to third-party cloud services. Air-gapped environments — such as government systems, defense contractors, or high-security financial institutions — can deploy FauxPilot without any external internet connectivity requirement for code completion.
Zero Ongoing Subscription Cost: FauxPilot is completely free to download, deploy, and use. The only costs are the hardware required to run the models (NVIDIA GPU with at least Compute Capability 6.0) and any infrastructure operational costs. For organizations with existing GPU infrastructure, FauxPilot eliminates per-seat subscription fees entirely. This makes it economically attractive for large development teams where GitHub Copilot's per-user pricing adds up significantly.
Compatible with the GitHub Copilot Plugin Protocol: FauxPilot implements the same API interface as GitHub Copilot, meaning existing Copilot plugin configurations can be pointed at a self-hosted FauxPilot instance with minimal reconfiguration. It also supports the OpenAI API protocol and REST API, giving teams multiple integration paths. This compatibility lowers the barrier to adoption for teams already familiar with Copilot-style tooling.
Open Source and Auditable: The entire FauxPilot codebase is publicly available on GitHub under an open-source license. Security teams can audit the code, verify behavior, and modify it to suit organizational requirements. Developers can contribute improvements, add support for new models, or adapt the server for specific use cases — something impossible with proprietary commercial tools.

Known Limitations

Requires NVIDIA GPU Hardware: FauxPilot depends on NVIDIA's Triton Inference Server with FasterTransformer backend, which requires an NVIDIA GPU with Compute Capability 6.0 or higher and sufficient VRAM for the chosen model size. Organizations without existing NVIDIA GPU infrastructure must invest in hardware before deployment. This hardware requirement makes FauxPilot unsuitable for developers running on CPU-only machines, AMD GPUs, or Apple Silicon without significant additional setup.
Model Quality Trails Frontier Models: FauxPilot uses SalesForce CodeGen models, which are capable but significantly less powerful than the frontier models (GPT-4o, Claude 3.5 Sonnet, Gemini 1.5 Pro) available in GitHub Copilot's premium tiers. The quality of code completions, especially for complex logic or multi-file context, will generally be lower than cloud-based competitors. Organizations prioritizing top suggestion quality over privacy may find the capability trade-off significant.

Best For

FauxPilot is best suited for security-conscious organizations, government agencies, defense contractors, and regulated financial or healthcare institutions that cannot send source code to external cloud services for compliance or security reasons. It is also ideal for teams and individual developers who have existing NVIDIA GPU infrastructure and want to eliminate per-seat subscription costs for AI code completion at scale. Open-source advocates and developers interested in self-hosting AI infrastructure will find FauxPilot an excellent learning and production project. Teams operating in air-gapped environments where external internet access is restricted or prohibited will benefit most from FauxPilot's fully offline operation.

Pricing

Free (Open Source): No subscription cost. Hardware requirements: NVIDIA GPU with Compute Capability ≥ 6.0, Docker, docker-compose ≥ 1.28, nvidia-docker, curl, and zstd. Model weights downloaded from Hugging Face during setup.

FauxPilot is open-source and free to use. Check the official GitHub repository for setup instructions and current requirements.

Tech Details

Type: Self-Hosted Open-Source Copilot API Server
IDEs: Any IDE or editor that supports the GitHub Copilot plugin protocol or OpenAI API (VS Code via Copilot plugin, REST API clients, custom integrations)
Key features: Self-hosted AI code completion, GitHub Copilot API compatibility, OpenAI API compatibility, REST API, Docker-based deployment, multi-GPU model sharding, SalesForce CodeGen models in GPT-J format via Hugging Face
Privacy / hosting: Fully self-hosted; no data sent externally; air-gap compatible; your hardware and infrastructure
Models / context window: SalesForce CodeGen models (2B, 6B, 16B parameters); GPT-J format via Hugging Face/Moyix; model choice depends on available VRAM

When to Choose FauxPilot Over GitHub Copilot

Your organization has strict data residency or code privacy requirements that prohibit sending source code to third-party cloud services
You operate in an air-gapped or offline environment where external internet connectivity for AI inference is not permitted
You have existing NVIDIA GPU infrastructure and want to eliminate per-seat subscription costs for AI code completion across a large development team
You require a fully auditable, open-source AI coding tool where the codebase and model behavior can be independently inspected and modified

When GitHub Copilot May Be a Better Fit

You need state-of-the-art frontier model quality for code completion (GPT-4o, Claude 3.5 Sonnet, Gemini 1.5 Pro) — FauxPilot uses CodeGen models that trail the latest frontier models in capability
You do not have NVIDIA GPU hardware available and want a plug-and-play solution that requires no infrastructure setup or maintenance — GitHub Copilot works immediately with a subscription and a supported IDE

Conclusion

FauxPilot is the definitive GitHub Copilot alternative for organizations that require complete data sovereignty and are willing to invest in the hardware and setup required to run AI code completion on their own infrastructure. Its open-source nature, zero subscription cost, and full offline capability make it a uniquely powerful option for security-sensitive environments, regulated industries, and large teams seeking to eliminate recurring AI tooling costs. Developers and organizations for whom privacy, auditability, and self-hosted control are non-negotiable will find FauxPilot the most trustworthy path to AI-assisted coding.

Sources

FAQ

Is FauxPilot free?

Yes, FauxPilot is completely free and open source. There are no subscription fees or usage costs. The only costs involved are the hardware to run it (an NVIDIA GPU with sufficient VRAM) and any associated infrastructure expenses such as electricity and server maintenance.

Does FauxPilot work with VS Code?

Yes, FauxPilot can be used with VS Code through the GitHub Copilot plugin by pointing the plugin's API endpoint to your self-hosted FauxPilot server. It also supports REST API and OpenAI API-compatible client integrations. Setup documentation is available on the FauxPilot GitHub wiki.

How does FauxPilot compare to GitHub Copilot?

FauxPilot replicates the GitHub Copilot API but runs entirely on your own hardware using open-source SalesForce CodeGen models. This means complete data privacy and zero subscription costs, but with lower model capability compared to GitHub Copilot's frontier models (GPT-4o, Claude 3.5 Sonnet). GitHub Copilot is far easier to set up (just install an extension), while FauxPilot requires NVIDIA GPU hardware, Docker, and technical infrastructure knowledge to deploy.

What GPU do I need to run FauxPilot?

FauxPilot requires an NVIDIA GPU with Compute Capability 6.0 or higher. The VRAM required depends on the CodeGen model size you choose: larger models (6B, 16B parameters) need more VRAM. You can split a model across multiple GPUs if needed — for example, two NVIDIA RTX 3080 GPUs can run the 6B model by splitting it evenly. The setup script (setup.sh) provides guidance on model selection based on your available VRAM.