anaclumos/autoku

autoku

Autonomous NxN Sudoku solver research. Trains a GPT to solve Sudoku puzzles, inspired by karpathy/autoresearch.

Quick Start

uv sync
uv run prepare.py          # sanity check
uv run train.py             # train for 15 minutes

Ralph Mode

To start an autonomous experiment loop using Claude Code:

/ralph-loop

You are an autonomous ML researcher. Read program.md for full instructions.

Setup:
1. Read program.md, prepare.py, and train.py to understand the codebase
2. Create branch: git checkout -b autoku/<today's date, e.g. apr3>
3. Create results.tsv with header: commit	val_avg_tokens	val_solve_rate	val_acc	status	description
4. Run baseline: uv run train.py > run.log 2>&1
5. Log baseline to results.tsv

Then enter the experiment loop from program.md:
- Modify train.py with an idea (architecture, hyperparams, optimizer, model size)
- git commit
- uv run train.py > run.log 2>&1
- grep "^val_avg_tokens:\|^val_solve_rate:\|^val_acc:" run.log
- If val_avg_tokens decreased: keep. If not: git reset HEAD~1
- Log to results.tsv
- NEVER STOP. Loop until interrupted.

Goal: minimize val_avg_tokens (Las Vegas: always correct, minimize average cost).
Start with 4x4, bump to 9x9 when val_solve_rate > 0.95 and val_avg_tokens is near optimal.

260403