Virtual bash for AI agents

An awesomely fast virtual bash sandbox. Written in Rust.

Bashkit runs untrusted shell scripts from AI agents without spawning a single OS process. 160 reimplemented commands, substantial POSIX shell language coverage, a virtual filesystem, resource limits, and tool interfaces for agent frameworks — all in-memory, all sandboxed.

Quick starts Rust Python TypeScript

Start in Rust

Install the crate.

cargo add bashkit

Go deeper

Rust docs docs.rs reference Examples Rust, Python, TS

Built-in commands 160 Threats mitigated 250+ Haiku 4.5 eval 97%

Runtime surface

Browse the builtins that make the sandbox usable.

Text processing, files, archives, network, Python, TypeScript, and shell control all live in-process.

grepsedawkjqcurlfindxargstargitsshpythontypescript

See all 160 builtins

Proof

The core runtime, in context.

The core API is just `Bash`, plus virtual FS and limits. No sidecar process, no container bootstrap.

use bashkit::Bash;

#[tokio::main]
async fn main() -> anyhow::Result<()> {
    let mut bash = Bash::new();
    let out = bash.exec("printf 'ready\n'").await?;
    print!("{}", out.stdout);
    Ok(())
}

No process spawning

Every command is reimplemented in Rust. No fork, no exec, no shell escape.

Virtual filesystem

InMemoryFs, OverlayFs, MountableFs. Host access only when you explicitly mount it.

Resource limits

Caps on commands, loops, output, input, and filesystem size. DoS-resistant by construction.

Product surface

What bashkit gives you

A single runtime you can embed in agents, CLIs, editors, and evaluation harnesses. No sidecar process, no container overhead, no external dependencies at runtime.

POSIX-compliant interpreter

Substantial IEEE 1003.1-2024 Shell Command Language coverage, plus bash extensions: arrays, [[ ]], brace expansion, extended globs, coprocesses, traps.

160 reimplemented commands

grep, sed, awk, jq, curl, tar, find, xargs, and 150+ more — pure Rust, no shelling out.

LLM tool contract

BashTool with discovery metadata, streaming output, and system prompts. Plug into any agent framework.

MCP server

bashkit mcp exposes the interpreter over Model Context Protocol for Claude, Cursor, and other clients.

Snapshotting

Serialize shell state and VFS contents to bytes. Checkpoint any workload, resume anywhere.

Scripted tool orchestration

Compose ToolDef + callback pairs into a ScriptedTool driven by a bash script.

Quick start

Three languages, one runtime

Start with the core Rust crate, or drop the same runtime into Python and TypeScript when you need it inside an existing stack.

Rust

The core crate

Install

cargo add bashkit

use bashkit::Bash;

#[tokio::main]
async fn main() -> anyhow::Result<()> {
    let mut bash = Bash::new();
    let out = bash.exec("echo hello world").await?;
    println!("{}", out.stdout);
    Ok(())
}

Python

PyO3 wheel with direct Bash API

Install

pip install bashkit

from bashkit import Bash

bash = Bash()
result = bash.execute_sync("echo 'Hello, World!'")
print(result.stdout)

bash.execute_sync("export APP_ENV=dev")
print(bash.execute_sync("echo $APP_ENV").stdout)

TypeScript

NAPI-RS runtime for Node, Bun, Deno

Install

npm i @everruns/bashkit

import { Bash } from "@everruns/bashkit";

const bash = new Bash();
const result = bash.executeSync('echo "Hello, World!"');
console.log(result.stdout);

bash.executeSync("X=42");
console.log(bash.executeSync("echo $X").stdout);

use bashkit::Bash;

#[tokio::main]
async fn main() -> anyhow::Result<()> {
    let mut bash = Bash::new();
    bash.exec("mkdir -p /tmp/data").await?;
    bash.exec("echo 'hello' > /tmp/data/out.txt").await?;

    let r = bash.exec("cat /tmp/data/out.txt | tr a-z A-Z").await?;
    print!("{}", r.stdout); // HELLO
    Ok(())
}

Security

Hostile input is the default assumption

Defense in depth across every layer — process, filesystem, network, parser, and runtime. See the full threat model for 250+ mitigations.

No process spawning

160 commands reimplemented in Rust — no fork, exec, or shell escape.

Virtual filesystem

Scripts see an in-memory FS by default. No host access unless mounted.

Network allowlist

HTTP is denied by default. Each domain must be explicitly allowed.

Resource limits

Caps on commands (10K), loops (100K), function depth, output (10MB), input (10MB).

Parser limits

Timeout, fuel budget, AST depth — pathological input can't hang the interpreter.

Panic recovery

Every builtin is wrapped in catch_unwind. A panic in one command can't crash the host.

LLM evals

How well do LLMs use bashkit?

Bashkit ships with a 58-task LLM eval harness across 15 agentic categories. Results below are from the 2026-02-28 run:

Model	Score	Tasks passed
Claude Haiku 4.5	97%	54/58
Claude Sonnet 4.6	93%	48/58
Claude Opus 4.6	91%	50/58
GPT-5.3-Codex	91%	51/58
GPT-5.2	77%	41/58

Open source

Pick a path and go deeper.

The crate is MIT-licensed. These are the links that actually help you evaluate, integrate, and operate it.

docs.rs

An awesomely fast virtual bash sandbox. Written in Rust.

Browse the builtins that make the sandbox usable.

What bashkit gives you

POSIX-compliant interpreter

160 reimplemented commands

LLM tool contract

MCP server

Snapshotting

Scripted tool orchestration

Three languages, one runtime

The core crate

PyO3 wheel with direct Bash API

NAPI-RS runtime for Node, Bun, Deno

Hostile input is the default assumption

No process spawning

Virtual filesystem

Network allowlist

Resource limits

Parser limits

Panic recovery

How well do LLMs use bashkit?

Pick a path and go deeper.

Rust API

Python

TypeScript

Threat model

MCP server

Examples