Configuration

pwnkit is designed for zero-config usage, but every default can be overridden via CLI flags or environment variables.

Runtime modes

pwnkit is an agentic harness — bring your own AI. The --runtime flag controls which LLM backend powers the agents.

Runtime	Flag	Description
`api`	`--runtime api`	Uses your API key (OpenRouter, Anthropic, OpenAI). Best for CI and quick scans. Default.
`claude`	`--runtime claude`	Spawns the Claude Code CLI with your existing subscription. Best for deep analysis.
`codex`	`--runtime codex`	Spawns the Codex CLI. Best for source-level analysis.
`gemini`	`--runtime gemini`	Spawns the Gemini CLI. Best for large-context source analysis.
`auto`	`--runtime auto`	Auto-detects installed CLIs and picks the best one per pipeline stage.

API runtime

The default api runtime makes direct HTTP calls to an LLM provider. It requires one of these environment variables:

export OPENROUTER_API_KEY="sk-or-..."   # Recommended
export ANTHROPIC_API_KEY="sk-ant-..."
export AZURE_OPENAI_API_KEY="..."
export OPENAI_API_KEY="sk-..."

See API Keys for the full priority order and provider details.

CLI runtimes (claude, codex, gemini)

These runtimes spawn the respective CLI tool as a subprocess. You must have the CLI installed and authenticated:

# Claude Code CLI
npm i -g @anthropic-ai/claude-code

# Codex CLI
npm i -g @openai/codex

# Gemini CLI
npm i -g @anthropic-ai/gemini-cli   # or however it's distributed

Then use them:

npx pwnkit-cli scan --target https://api.example.com/chat --runtime claude
npx pwnkit-cli review ./my-repo --runtime codex --depth deep

Scan modes

The --mode flag controls what kind of target is being scanned.

Mode	Description
`llm`	Probe LLM/AI endpoints with prompt injection, jailbreaks, exfiltration, and tool poisoning attacks. Default for `scan`.
`web`	Deterministic checks for CORS misconfigurations, security headers, exposed files, and fingerprint leakage.
`mcp`	Scan MCP (Model Context Protocol) servers for tool poisoning and schema abuse. (Coming soon)

# LLM endpoint scan (default)
npx pwnkit-cli scan --target https://api.example.com/chat

# Web app scan
npx pwnkit-cli scan --target https://example.com --mode web

Depth settings

The --depth flag controls how thorough the scan is.

Depth	Test Cases	Typical Time	Best For
`quick`	~15	~1 min	CI pipelines, smoke tests
`default`	~50	~3 min	Day-to-day scanning
`deep`	~150	~10 min	Pre-launch audits, thorough review

npx pwnkit-cli scan --target https://api.example.com/chat --depth quick
npx pwnkit-cli audit express --depth deep
npx pwnkit-cli review ./my-repo --depth deep --runtime claude

Output formats

pwnkit supports multiple output formats:

Format	Description
`json`	Machine-readable JSON output for pipelines
`sarif`	SARIF format for the GitHub Security tab
`markdown`	Human-readable Markdown report

In CI (GitHub Action), set format: sarif to populate the Security tab:

- uses: peaktwilight/pwnkit@main
  with:
    mode: review
    path: .
    format: sarif

Diff-aware review

For PR workflows, review only changed files against a base branch:

npx pwnkit-cli review ./my-repo --diff-base origin/main --changed-only

This is particularly useful in CI to avoid scanning the entire codebase on every PR.

Verbose output

Use --verbose to see the animated attack replay and detailed agent reasoning:

npx pwnkit-cli scan --target https://api.example.com/chat --verbose