LAPRAS — AI Pokemon Red Player

Featured Pokemon

Architecture

graph LR
    CC["Claude Code
Screen Recognition + Decision"] -->|"stdio"| MCP["MCP Server
gameboy_mcp_server.py"]
    MCP -->|"TCP :9876"| EMU["emulator.py
TCP Client"]
    EMU -->|"TCP"| PB["pyboy_server.py
PyBoy 60fps + SDL2"]
    PB -->|"JSON"| EMU
    PB -->|"mmap
/tmp/pyboy_screen.shm"| API["game_state_api.py
SSE Streaming"]
    EMU -->|"Response"| MCP
    MCP -->|"stdio"| CC

    style CC fill:#1a0a0a,stroke:#f8d030,color:#f8d030
    style MCP fill:#3a1010,stroke:#e03030,color:#f0e8e0
    style EMU fill:#3a1010,stroke:#e03030,color:#f0e8e0
    style PB fill:#3a1010,stroke:#e03030,color:#f0e8e0
    style API fill:#3a1010,stroke:#e03030,color:#f0e8e0

Claude loops autonomously — streaming reads screen directly via shared memory (mmap)

Boot Sequence

Boot sequence from Claude Code startup to gameplay. The PyBoy server runs as a separate process and connects via TCP.

Start PyBoy Server (Separate Process)

Launch pyboy_server.py in a separate terminal. An SDL2 window opens and starts listening on TCP :9876.

Start Claude Code

Opening Claude Code in this directory auto-detects .mcp.json. The MCP server starts as a child process.

load_rom → TCP Connection

When Claude Code calls load_rom, emulator.py connects to pyboy_server.py via TCP. JP/EN is auto-detected from the ROM header.

Gameplay Begins

From here, operations flow through Claude Code → MCP (stdio) → TCP → PyBoy server. Real-time display via the SDL2 window.

Stream Overlay (Optional)

game_state_api.py reads the screen directly from shared memory (mmap), then streams to the browser via SSE. Pokemon Red Theme in 16:9.

graph TD
    subgraph parent["Claude Code (Parent Process)"]
        CC[Claude Code]
    end

    CC -->|"stdio (JSON-RPC)"| MCP
    MCP -->|"stdio Response"| CC

    subgraph child["Child Process: gameboy_mcp_server.py"]
        MCP["FastMCP
gameboy_mcp_server.py"]
        MCP -->|"Function Call"| EMU["emulator.py
(TCP Client)"]
        EMU -->|"Response (JSON)"| MCP
    end

    EMU -->|"TCP :9876 Send Command"| SRV
    SRV -->|"TCP Response (JSON)"| EMU

    subgraph server["Separate Process: pyboy_server.py"]
        SRV["PyBoy TCP Server
PNG Generation → mmap"]
        SRV --- PB["PyBoy Emulator
60fps + SDL2 Display"]
    end

    SRV -->|"mmap
/tmp/pyboy_screen.shm"| API

    subgraph optional["Optional: game_state_api.py"]
        API["FastAPI + SSE
Stream Overlay"]
    end

    API -->|"Imports emulator.py
Gets state via TCP :9876"| EMU2["emulator.py
(Shared TCP Connection)"]
    EMU2 -->|"TCP :9876"| SRV

    CC -->|"Auto-accumulates movement experience"| NAV["nav_memory.py
Self-Learning Nav"]
    CC -->|"Records play experience"| MEM["CLAUDE.md / Memory
Self-Improvement Rules"]

    style parent fill:#1a0a0a,stroke:#e03030,stroke-width:2px,color:#f0e8e0
    style child fill:#2a1010,stroke:#e03030,stroke-width:2px,color:#f0e8e0
    style server fill:#2a1010,stroke:#f8d030,stroke-width:2px,color:#f0e8e0
    style optional fill:#1a0a0a,stroke:#8a7a70,stroke-width:1px,stroke-dasharray:5,color:#f0e8e0
    style CC fill:#3a1010,stroke:#f8d030,color:#f8d030
    style MCP fill:#3a1010,stroke:#e03030,color:#f0e8e0
    style EMU fill:#3a1010,stroke:#e03030,color:#f0e8e0
    style SRV fill:#4a1515,stroke:#f8d030,color:#f8d030
    style PB fill:#4a1515,stroke:#f8d030,color:#f8d030
    style API fill:#2a1515,stroke:#8a7a70,color:#8a7a70
    style EMU2 fill:#2a1515,stroke:#8a7a70,color:#8a7a70
    style NAV fill:#1a2010,stroke:#48d848,color:#48d848
    style MEM fill:#1a2010,stroke:#48d848,color:#48d848

* Bidirectional arrows represent request/response flows / Green nodes represent self-improvement (map learning + rule accumulation)

How it Works

TCP Client/Server

PyBoy runs as a separate process with an SDL2 window. The MCP server controls it remotely via TCP.

JP/EN Auto-Detect

Auto-detects Japanese or English ROM from the header. Automatically switches character tables.

Auto Screen Return

After button presses, waits for the specified wait_frames then auto-returns the screen. Cuts tool call count in half.

Separate Process

PyBoy runs as a separate process at constant 60fps + SDL2 display. Loosely coupled with the MCP server via TCP.

Self-Improvement

Movement experience is auto-learned by nav_memory.py. Gameplay know-how is recorded in CLAUDE.md / Memory for use in future plays.

Director Direct

No LLM advisors needed. Director uses battle_calc + nav_memory + objective computation modules directly for all decisions.

Play Flow

Load ROM

Start the emulator with load_rom("/path/to/pokemon_red.gb")

Check State

Get structured JSON with get_game_state. Understand the scene, coordinates, and party

Execute Actions

Use press_button for step-by-step control, do_action for walking/text. Battles are handled one press at a time

Decide & Repeat

Analyze the returned screen/JSON to decide the next action. Autonomously loops steps 2-4

Self-Learning

Movement successes and failures are auto-recorded by nav_memory.py. Learns walls, dead ends, and map transitions for shortest routes next time

MCP Tools

Tool	Args	Description
`press_button`	`button`, `wait_frames`, `include_image`	Press a button. Returns JSON by default, include_image=true for screen
`press_buttons`	`buttons`, `interval_ms`, `wait_frames`	Sequential button input, returns screen after last press
`hold_button`	`button`, `frames`, `wait_frames`	Hold button then return screen
`wait`	`seconds`	Wait specified seconds then return screen
`get_game_state`	-	Structured JSON (scene / player / party / battle)
`get_collision_map`	-	Collision map (9x10) + player direction + NPC + door positions
`get_wide_map`	-	Full map walkability (tri-state: 2=confirmed walkable, 0=wall, -1=unknown) + grass grid
`press_button_fast`	`button`, `wait_frames`	Button press + JSON state (no image)
`press_buttons_fast`	`buttons`, `interval_ms`	Sequential input + JSON state (no image)
`do_action`	`action`, `count`, `direction`	Batch walk or text advance. Interrupts on encounter/map transition
`navigate_to`	`target_x`, `target_y`, `target_map_id`	Self-learning nav auto-moves to destination
`navigate_smart`	`target_type`, `direction`, `target_map_id`	Intent-based movement (exit/explore/transition/grass)
`load_rom`	`rom_path`, `headless`	Load ROM and start emulator
`quit_emulator`	-	Stop emulator (with save)
`say`	`text`	Display Claude's live commentary on the overlay
`get_emulator_info`	-	Check emulator status

Director Direct Architecture

No LLM advisors. The Director uses computation modules directly for all decisions.

graph TD
    DIR["Director
Claude Code Opus
MCP Control + All Decisions"] -->|"MCP stdio"| MCP["gameboy_mcp_server.py"]

    BC["battle_calc.py
Type Matchup + Damage Estimation"] -->|"Pre-computed Data"| DIR
    NM["nav_memory.py
Self-learning Map Knowledge"] -->|"Walk History + Grass Tiles"| DIR
    NA["nav_analyst.py
Real-time Analysis"] -->|"Auto-runs Every Step"| DIR
    OBJ["objective.py
Goal Management"] -->|"Progress Tracking"| DIR

    style DIR fill:#3a1010,stroke:#f8d030,color:#f8d030
    style MCP fill:#3a1010,stroke:#e03030,color:#f0e8e0
    style BC fill:#1a2010,stroke:#48d848,color:#48d848
    style NM fill:#1a2010,stroke:#48d848,color:#48d848
    style NA fill:#1a2010,stroke:#48d848,color:#48d848
    style OBJ fill:#1a2010,stroke:#48d848,color:#48d848

No LLM advisors needed. Director reads battle_calc + nav_memory + objective results directly.

🎮 Director

The Claude Code session controls the game via MCP. Uses battle_calc, nav_memory, and objective directly for all decisions.

📊 battle_calc.py

Gen1 type matchup table (15x15), all 165 moves, type data for 151 species. Pre-computes STAB and damage % for Director's move selection.

🗺️ nav_memory.py

Auto-accumulates walls, transitions, and grass tiles during play. Provides collision_cache for wide_map and encounter-based grass detection.

📡 nav_analyst.py

Runs real-time analysis automatically on every do_action. Detects frontiers, exits, and movement loops.

🎯 objective.py

Goal setting, progress tracking, and review triggers. Manages grind/gym/heal/explore/story goal types.

🔧 collision_map.py

Collision map generation, A* pathfinder, wide_map construction. Tri-state system (confirmed/wall/unknown) lets A* traverse unexplored areas.

Buttons

Start

Select

Down

Left

Right

wait_frames Guide

Scene	wait_frames	Time
Menu select	10 (default)	~0.17s
Text advance	20	~0.33s
Character move	15	~0.25s
Screen transition	30-60	~0.5-1s
Battle animation	60-180	~1-3s

Setup

# Python environment
python3 -m venv .venv
.venv/bin/pip install -r requirements.txt

# Start Claude Code in this directory
# .mcp.json auto-connects the gameboy MCP server

# Give Claude an instruction
# e.g. "Load Pokemon Red and play it"
      

Tech Stack

Python 3.10+Runtime

PyBoy 2.7.0GB/GBC Emulator

MCP SDK 1.26+Claude Code Protocol

Pillow 10+Image Processing

MCP Server Implementation

MCP Configuration

.mcp.json

When Claude Code starts in this directory, the MCP server auto-connects with the following configuration.

{ "mcpServers": { "gameboy": { "command": ".venv/bin/python3", "args": ["src/gameboy_mcp_server.py"], "env": { "PYTHONPATH": "src" } } } }

gameboy_mcp_server.py

src/gameboy_mcp_server.py

The MCP server core built with FastMCP. Exposes all 16 tools and communicates via stdio transport.

Button Tools (4)

press_button — Button press (JSON by default, image optional)
press_buttons — Sequential input, returns screen after last press
hold_button — Hold + return screen after release
wait — Wait specified seconds + return screen

State Tools (4)

get_game_state — Structured JSON (scene/party/battle)
get_collision_map — Collision map + NPC positions
press_button_fast — Button press + JSON state
press_buttons_fast — Sequential input + JSON state

Composite Tools (2)

do_action — Batch walk and text advance
navigate_to — Self-learning nav auto-move

Lifecycle & Utility (4)

load_rom — Load ROM + start emulator
quit_emulator — Stop with save
get_emulator_info — Runtime status and cartridge info
say — Display Claude's live commentary on overlay

collision_map.py

src/collision_map.py

Screen collision map (9x10) + full map wide_map construction + A* pathfinder.

Screen: 18x20 tile map downsampled to 9x10 grid
wide_map: tri-state (2=walkable, 0=wall, -1=unknown), A* traverses unknown at cost 3
collision_cache: persisted to nav_memory.json, survives server restarts
Grass grid: generated from nav_memory encounter history
A* algorithm finds shortest path avoiding walls and NPCs
Tile-pair collision table handles ledge (one-way drop) traversal
Dynamic collision updates that treat NPCs as obstacles
ASCII map display (█=wall, ·=path, S=NPC, ↓=player)

battle_calc.py

src/battle_calc.py

Gen1 battle calculation engine. Provides type matchups, move data, and damage estimation, passing pre-computed data to the Battle Advisor.

Gen1 type matchup table (15x15) — accurately calculates neutral/resist/immune/super effective
Database of all 165 moves (type, power, accuracy, physical/special/status)
Type data for all 151 species (single type and dual type)
enrich_battle_context() — batch computes matchup labels, damage %, STAB checks, and defensive matchups for switch candidates
Special handling for fixed-damage moves (Sonic Boom, Dragon Rage, etc.) and OHKO moves

agent_log.py

src/agent_log.py

Director state tracking. Displays real-time status on the overlay.

Director lifecycle management (IDLE → THINKING → DONE → IDLE)
Timeouts: auto-revert to IDLE after 60s THINKING / 10s DONE
SSE agents event pushes only on state changes

emulator.py

src/emulator.py

PyBoy TCP client. Connects to pyboy_server.py to control the emulator.

Singleton PyBoy instance with threading.Lock for mutual exclusion
Background thread runs _tick_loop at 60fps for frame advancement
get_screen_bytes — PIL Image → JPEG bytes conversion (for MCP). Streaming reads PNG directly via mmap
press_button — button press → sleep for wait_frames/60 seconds → return screen
press_button_hold — press → hold → release → return screen
ROM path validation: only .gb / .gbc allowed, resolved via os.path.realpath
Window mode: selectable between SDL2 (GUI) and null (headless)

Target Game

Pokemon Red / Blue (Gen1)

Both Red and Blue versions supported. Turn-based battle + map navigation pairs well with MCP control.

History

v1 - 2026/03/29

Initial implementation with screencapture + Anthropic API loop approach

v2 - 2026/03/31

Full rewrite to PyBoy + MCP server approach. Auto screen return via wait_frames

v2.1 - 2026/04/04

Removed dashboard, removed get_screen, added project intro HTML

v3 - 2026/04/04

Scene detection (8 scenes), composite tools (do_action), 165 move name table, stream overlay (Pokemon Red Theme, anime.js), action log, PokeAPI sprite integration

v3.1 - 2026/04/04

TCP client/server separation (SDL2 window display support), Japanese ROM auto-detection, hiragana character table added, fixed scene detection false positives at game start

v3.2 - 2026/04/05

Button input fix (button_press/release method), full _INTERNAL_TO_DEX fix, map navigation data (map_data.py), stream overlay JSON display, scene detection battle priority, do_action encounter interrupt detection, viewer UI redesign (Pokemon Red Theme), first test play completed (Pallet Town → Viridian City → Pokedex obtained)

v3.3 - 2026/04/06

Collision map & A* pathfinder implementation (auto wall detour), A* detour integrated into navigate_to (_walk_toward auto-finds detour on wall detection), Route 1 cleared → arrived at Viridian City, Charmander Lv.9 (learned Ember)

v3.4 - 2026/04/07

Stream overlay speedup — replaced screen transfer from TCP to shared memory (mmap), ~60x latency improvement. RGBA→RGB conversion fix, PNG format adoption eliminated artifacts. Scanline CSS disabled (H.264 moire countermeasure), player name display added. Audio stability confirmed

v3.5 - 2026/04/07

Token efficiency & TCP optimization — bulk read reduced TCP calls from 40-60 to 1-2, collision_map separation (saving 300-400 tokens per call), do_action(walk) per-step check removed (scene byte only), press_button image made optional, state cache layer added. ~95% token reduction for 20-step walks

v3.6 - 2026/04/07

Self-learning navigation (nav_memory.py), map transition detection added (auto-wait on building entry/exit in do_action/navigate_to), Pokemon Center exit coordinate fix

v3.7 - 2026/04/07

Project cleanup — CLAUDE.md reduced by 65% (removed duplicate MCP tool listings), rule files consolidated from 10 to 4 (69% reduction), memory files organized, map_data.py removed (fully migrated to nav_memory.py), dead code detection, README.md updated

v4.0 - 2026/04/07

Multi-agent gameplay — Implemented Director+Advisor architecture. battle_calc.py (Gen1 type matchup table 15x15, all 165 moves, 151 species types, damage estimation), 4 advisors (Battle/Nav/Strategist/Map Analyst), agent state visualization (agent_log.py → overlay AGENTS panel), Claude live commentary MCP tool (say), collision_map AI LOG display, session event log, 137 tests all passing

v4.1 - 2026/04/07

Wide map & overlay improvements — Added get_wide_map MCP tool (reads full-map collision data), compacted overlay right panel (reduced font, sprite, padding sizes), fully removed Bookmark feature, fixed Director state to immutable composition, verified multi-agent operation (Navigation/Strategist/Map Analyst parallel launch confirmed)

v4.2 - 2026/04/07

Map Analyst removal & DESIGN.md — Replaced Map Analyst agent with nav_memory.get_exploration_stats() (saves one Haiku call per cycle), exploration_stats fed directly to Navigation Advisor and Strategist, introduced DESIGN.md for unified design tokens, added English index_en.html

v4.3 - 2026/04/08

Grass detection fix & autonomous loop — Fixed cave tiles being misidentified as grass, separated JP/EN detection logic, door display (collision map D marker), autonomous game loop (objective.py for goal management & Strategist auto-trigger), semantic navigation (smart_nav.py for intent-based movement), CLAUDE.md compressed by 52%

v1.0.0 - 2026/04/09 🎉

First official release — Director-direct architecture (removed LLM advisors, battle_calc + nav_memory + objective handle all decisions within the Director). nav_analyst.py (real-time map analysis on every do_action), capture parameter fully removed from press_button. Demonstrated autonomous play: Viridian Forest → Pewter City, Pokemon Center healing, wild battle auto-handling.

v1.0.1 - 2026/04/13

Map data bug fix & code cleanup — Fixed JP/EN map data parsing. Introduced two-layer wide_map fallback (collision_cache + border block comparison). Grass grid now based on nav_memory encounter history. Dead code removal.

v2.0.0 - 2026/04/14 — LAPRAS 🌊

Navigation overhaul (P0-P6) & project renamed to "LAPRAS" — Tri-state wide_map (unknown≠wall, A* traverses unexplored at cost 3). Collision detection accuracy improved to 100%. collision_cache persisted to nav_memory.json (survives restarts). no_progress threshold 4→12 (persevere in mazes). Frontier limit removed + diversity scoring. HP safety check (faint prevention). Warp pre-recording on map transitions. PyBoy headless mode for parallel testing. 8 concurrent Claude Code Agent() investigations discovered and implemented all improvements. Refactored: unified JP/EN detection, removed redundant TCP calls, extracted tri-state constants.

v2.0.1 - 2026/04/16

Navigation coordinate fix & door avoidance — Unified warp/player coordinate systems, added //2 grid conversion to A* targets (navigate_to now reaches short-distance targets accurately). Added door avoidance to find_path (prevents navigate_to from entering buildings mid-route). Removed aggressive collision_cache mass-invalidation on stuck (preserves high-accuracy cache data). Adaptive waypoint selection on wall hits (references further wide_map waypoints as no_progress increases, enabling building detours). Dynamic tileset-aware collision detection for greater accuracy. Added mark_wall/set_speed commands to pyboy_server.

v2.0.2 - 2026/04/17

Ledge direction check & door avoidance unification — Added movement direction check to ledge (one-way tile) detection: south jumps allowed, north climbing blocked. Unified door avoidance goal exclusion across all A* calls in navigate_to (waypoint and screen-edge goals no longer blocked when coinciding with door positions). Extracted _DIR_BLK to module-level constant. Beat Pewter City Gym (Brock) with 2 HP remaining, earned Boulder Badge.