copilot-cli

Standalone local server that exposes GitHub Copilot as a REST API — like Ollama for Copilot.

Runs at http://localhost:18966 and is auto-detected by the app settings.

Quick Start

# Install
cd copilot-cli
npm install

# First time — authenticate with GitHub
npm run dev          # follow the device-code link printed in terminal

# Start the server (keep running)
npm run dev

Endpoints

Method	Path	Description
POST	`/v1/chat/completions`	OpenAI-compatible chat (stream & non-stream)
GET	`/status`	Health + auth check
GET	`/models`	List available models
POST	`/chat/completions`	Chat (poll-based, returns `runId`)
POST	`/chat/stream`	Chat (SSE streaming, legacy format)
GET	`/run/:runId`	Poll for run result

How It Works

Uses @github/copilot-sdk which reads the auth token from gh copilot (GitHub CLI)
Manages a persistent CopilotClient session (no 3-5s startup per request)
Session mutex ensures one LLM call at a time (SDK limitation)
Auto-retry on auth errors and stuck sessions

API Examples

Check status

curl http://localhost:18966/status
# {"status":"ok","authenticated":true}

List models

curl http://localhost:18966/models
# {"models":[{"id":"gpt-4o","name":"GPT-4o",...}, ...]}

OpenAI-compatible chat (non-streaming)

curl -X POST http://localhost:18966/v1/chat/completions \
  -H 'Content-Type: application/json' \
  -d '{
    "model": "claude-sonnet-4",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'
# {"id":"chatcmpl-...","object":"chat.completion","choices":[{"message":{"role":"assistant","content":"Hi!"},"finish_reason":"stop"}],...}

OpenAI-compatible chat (streaming)

curl -N -X POST http://localhost:18966/v1/chat/completions \
  -H 'Content-Type: application/json' \
  -d '{
    "model": "claude-sonnet-4",
    "messages": [{"role": "user", "content": "Hello!"}],
    "stream": true
  }'
# data: {"id":"chatcmpl-...","choices":[{"delta":{"role":"assistant"},"finish_reason":null}]}
# data: {"id":"chatcmpl-...","choices":[{"delta":{"content":"Hi!"},"finish_reason":null}]}
# data: {"id":"chatcmpl-...","choices":[{"delta":{},"finish_reason":"stop"}]}
# data: [DONE]

Tool Calling (OpenAI-compatible)

The /v1/chat/completions endpoint supports the standard OpenAI/Anthropic tool calling pattern. Tools are executed client-side — no callback server needed.

Step 1: Send request with tool definitions

curl -X POST http://localhost:18966/v1/chat/completions \
  -H 'Content-Type: application/json' \
  -d '{
    "model": "claude-sonnet-4",
    "messages": [{"role": "user", "content": "What is the weather in London?"}],
    "tools": [{
      "type": "function",
      "function": {
        "name": "get_weather",
        "description": "Get weather for a city",
        "parameters": {
          "type": "object",
          "properties": { "city": { "type": "string" } },
          "required": ["city"]
        }
      }
    }]
  }'

Response (model wants to call a tool):

{
  "id": "chatcmpl-...",
  "object": "chat.completion",
  "choices": [{
    "message": {
      "role": "assistant",
      "content": null,
      "tool_calls": [{
        "id": "call_abc123",
        "type": "function",
        "function": { "name": "get_weather", "arguments": "{\"city\":\"London\"}" }
      }]
    },
    "finish_reason": "tool_calls"
  }]
}

Step 2: Execute tool locally, send results back

curl -X POST http://localhost:18966/v1/chat/completions \
  -H 'Content-Type: application/json' \
  -d '{
    "model": "claude-sonnet-4",
    "messages": [
      {"role": "user", "content": "What is the weather in London?"},
      {"role": "assistant", "content": null, "tool_calls": [{"id": "call_abc123", "type": "function", "function": {"name": "get_weather", "arguments": "{\"city\":\"London\"}"}}]},
      {"role": "tool", "tool_call_id": "call_abc123", "content": "14°C, overcast"}
    ],
    "tools": [...]
  }'

Response (final answer):

{
  "choices": [{
    "message": { "role": "assistant", "content": "The weather in London is 14°C and overcast." },
    "finish_reason": "stop"
  }]
}

Run the demo

npx tsx examples/openai-tool-calling-demo.ts

Legacy endpoints

The original /chat/stream and /chat/completions endpoints still work with the callback-based tool calling pattern. See tool-calling-demo.ts.

Chat (legacy streaming)

curl -N -X POST http://localhost:18966/chat/stream \
  -H 'Content-Type: application/json' \
  -d '{"model":"claude-sonnet-4.6","messages":[{"role":"user","content":"Hello!"}]}'

Chat (legacy poll-based)

# Submit
curl -X POST http://localhost:18966/chat/completions \
  -H 'Content-Type: application/json' \
  -d '{"model":"claude-sonnet-4.6","messages":[{"role":"user","content":"Hello!"}]}'
# {"runId":"abc-123","status":"queued"}

# Poll
curl http://localhost:18966/run/abc-123
# {"runId":"abc-123","status":"completed","result":{"content":"Hi there!","model":"gpt-4o",...}}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
examples		examples
src		src
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

copilot-cli

Quick Start

Endpoints

How It Works

API Examples

Check status

List models

OpenAI-compatible chat (non-streaming)

OpenAI-compatible chat (streaming)

Tool Calling (OpenAI-compatible)

Step 1: Send request with tool definitions

Step 2: Execute tool locally, send results back

Run the demo

Legacy endpoints

Chat (legacy streaming)

Chat (legacy poll-based)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

copilot-cli

Quick Start

Endpoints

How It Works

API Examples

Check status

List models

OpenAI-compatible chat (non-streaming)

OpenAI-compatible chat (streaming)

Tool Calling (OpenAI-compatible)

Step 1: Send request with tool definitions

Step 2: Execute tool locally, send results back

Run the demo

Legacy endpoints

Chat (legacy streaming)

Chat (legacy poll-based)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages