Amazon Q Developer
AWS-native AI assistant for building, securing, and operating software across the development lifecycle.
Open-source, self-hosted GitHub Copilot server using SalesForce CodeGen models on NVIDIA Triton — fully offline, zero subscription cost, and complete code privacy.
FauxPilot is an open-source, self-hosted server that replicates the GitHub Copilot API using the SalesForce CodeGen models running on NVIDIA's Triton Inference Server. It allows development teams and organizations to run AI code completion entirely on their own hardware, with no data leaving their infrastructure. As a GitHub Copilot alternative, it is best suited for privacy-conscious developers, security-focused organizations, and self-hosted infrastructure enthusiasts who want AI code completion without sending code to external cloud services.
| FauxPilot | GitHub Copilot | |
|---|---|---|
| Type | Self-Hosted Open-Source Copilot Server | IDE Extension / CLI |
| IDEs | Any IDE that supports the OpenAI API or GitHub Copilot plugin protocol | VS Code, JetBrains, Vim, Neovim, Visual Studio, Xcode |
| Pricing | Free (open source; hardware costs only) | Free for students/OSS; Individual $10/mo; Business $19/mo; Enterprise $39/mo |
| Models | SalesForce CodeGen models (various sizes); GPT-J format | OpenAI GPT-4o, Claude 3.5 Sonnet, Gemini 1.5 Pro |
| Privacy / hosting | Fully self-hosted (your own servers, no external data transfer) | Cloud (GitHub/Microsoft) |
| Open source | Yes | No |
| Offline / local models | Yes (fully offline once set up) | No |
FauxPilot is best suited for security-conscious organizations, government agencies, defense contractors, and regulated financial or healthcare institutions that cannot send source code to external cloud services for compliance or security reasons. It is also ideal for teams and individual developers who have existing NVIDIA GPU infrastructure and want to eliminate per-seat subscription costs for AI code completion at scale. Open-source advocates and developers interested in self-hosting AI infrastructure will find FauxPilot an excellent learning and production project. Teams operating in air-gapped environments where external internet access is restricted or prohibited will benefit most from FauxPilot's fully offline operation.
FauxPilot is open-source and free to use. Check the official GitHub repository for setup instructions and current requirements.
FauxPilot is the definitive GitHub Copilot alternative for organizations that require complete data sovereignty and are willing to invest in the hardware and setup required to run AI code completion on their own infrastructure. Its open-source nature, zero subscription cost, and full offline capability make it a uniquely powerful option for security-sensitive environments, regulated industries, and large teams seeking to eliminate recurring AI tooling costs. Developers and organizations for whom privacy, auditability, and self-hosted control are non-negotiable will find FauxPilot the most trustworthy path to AI-assisted coding.
Yes, FauxPilot is completely free and open source. There are no subscription fees or usage costs. The only costs involved are the hardware to run it (an NVIDIA GPU with sufficient VRAM) and any associated infrastructure expenses such as electricity and server maintenance.
Yes, FauxPilot can be used with VS Code through the GitHub Copilot plugin by pointing the plugin's API endpoint to your self-hosted FauxPilot server. It also supports REST API and OpenAI API-compatible client integrations. Setup documentation is available on the FauxPilot GitHub wiki.
FauxPilot replicates the GitHub Copilot API but runs entirely on your own hardware using open-source SalesForce CodeGen models. This means complete data privacy and zero subscription costs, but with lower model capability compared to GitHub Copilot's frontier models (GPT-4o, Claude 3.5 Sonnet). GitHub Copilot is far easier to set up (just install an extension), while FauxPilot requires NVIDIA GPU hardware, Docker, and technical infrastructure knowledge to deploy.
FauxPilot requires an NVIDIA GPU with Compute Capability 6.0 or higher. The VRAM required depends on the CodeGen model size you choose: larger models (6B, 16B parameters) need more VRAM. You can split a model across multiple GPUs if needed — for example, two NVIDIA RTX 3080 GPUs can run the 6B model by splitting it evenly. The setup script (setup.sh) provides guidance on model selection based on your available VRAM.
AWS-native AI assistant for building, securing, and operating software across the development lifecycle.
Terminal-based AI coding agent that plans and executes large tasks spanning multiple files.