{
"mcpServers": {
"baba-is-eval": {
"command": "<see-readme>",
"args": []
}
}
}No install config available. Check the server's README for setup instructions.
Are you the author?
Add this badge to your README to show your security score and help users find safe servers.
Claude et al. play the brilliant puzzle title "Baba is You"
Is it safe?
No package registry to scan.
No authentication — any process on your machine can connect.
License not specified.
Is it maintained?
Last commit 282 days ago. 54 stars.
Will it work with my client?
Transport: stdio. Works with Claude Desktop, Cursor, Claude Code, and most MCP clients.
No automated test available for this server. Check the GitHub README for setup instructions.
No known vulnerabilities.
This server is missing a description. Tools and install config are also missing.If you've used it, help the community.
Add informationHave you used this server?
Share your experience — it helps other developers decide.
Sign in to write a review.
Dynamic problem-solving through sequential thought chains
A Model Context Protocol server for searching and analyzing arXiv papers
An open-source AI agent that brings the power of Gemini directly into your terminal.
The official Python SDK for Model Context Protocol servers and clients
MCP Security Weekly
Get CVE alerts and security updates for Baba_is_eval and similar servers.
Start a conversation
Ask a question, share a tip, or report an issue.
Sign in to join the discussion.
https://github.com/user-attachments/assets/cb6d0d49-b583-46a2-b262-1cf38279a531
We evaluate language models' meta-level reasoning in an interactive setting, like the upcoming ARC-AGI-3, except we use the brilliant puzzle game "Baba Is You". In short, the player manipulates the game rules by pushing word blocks around to form or destroy rules. We provide an MCP server to interact with the game in text format. The project is currently an alpha and not stable. Contributions are welcome, and brave devs with model credits to spare are invited to give it a try.
Data folder
/Users/[username]/Library/Application Support/Steam/steamapps/common/Baba Is You/Baba Is You.app/Contents/Resources/DataC:\Program Files (x86)\Steam\steamapps\common\Baba Is You\Datauv)pip install mcp fastmcp pyautogui configparser
io.lua which in turn require restarting the game)cd baba_is_eval
chmod +x setup.sh
./setup.sh
/Users/[username]/Library/Application Support/Steam/steamapps/common/Baba Is You/Baba Is You.app/Contents/MacOS/Chowdrenmcp dev baba_is_eval/game_mcp.py
On top of this MCP client agnostic setup, you can use a client like Claude Desktop to have a model interact with the server and play the game.
The server provides these tools for interacting with the game:
enter_level(level: str) - Enter a specific level (e.g., "1", "2", "3")get_game_state() - Get the current game state as a matrixexecute_commands(commands: str) - Execute movement commands (e.g., "right,up,down")undo_multiple(n: int) - Undo the last n movesrestart_level() - Restart the current levelleave_level() - Exit the current levelgame_rules(topic: str) - Get help on game rulesThe game state is returned as a matrix like this:
y/x | 1 | 2 | 3 | 4 | 5
----+-----+-----+-----+-----+-----
1 | | | | |
2 | | baba| | |
3 | | | flag| |
4 | | | | |
5 | | | | |
Contributions welcome! Goals listed in order of pressingness.
This works in the dumbest way possible; we reverse engineer the exposed Lua functions, use the mod functionality to read the game state, and write to one of the game state config files using mod hooks, which is then read in by the MCP server. For move and undo inputs, we write Lua files to commands/ from the MCP server to be read in, if detected, in the always mod hook. Perhaps this dooms the project to be brittle and slow forever, but perhaps there is some better way.
Baba is You was created by Hempuli. It is a lovely game, go play it yourself!