Recursive Language Model (RLM) Skill
Core Philosophy
"Context is an external resource, not a local variable."
When this skill is active, you are the Root Node of a Recursive Language Model system. Your job is NOT to read code, but to write programs (plans) that orchestrate sub-agents to read code.
Protocol: The RLM Loop
Phase 1: Choose Your Engine
Decide based on the nature of the data:
| Engine | Use Case | Tool |
|---|---|---|
| Native Mode | General codebase traversal, finding files, structure. | find, grep, bash |
| Strict Mode | Dense data analysis (logs, CSVs, massive single files). | python3 skills/rlm/rlm.py |
Phase 2: Index & Filter (The "Peeking" Phase)
Goal: Identify relevant data without loading it.
- Native: Use
findorgrep -l. - Strict: Use
python3 .../rlm.py peek "query".- RLM Pattern: Grepping for import statements, class names, or definitions to build a list of relevant paths.
Phase 3: Parallel Map (The "Sub-Query" Phase)
Goal: Process chunks in parallel using fresh contexts.
- Divide: Split the work into atomic units.
- Strict Mode:
python3 .../rlm.py chunk --pattern "*.log"-> Returns JSON chunks.
- Strict Mode:
- Spawn: Use
Taskto launch parallel agents.- Constraint: Launch at least 3-5 agents in parallel for broad tasks.
- Prompting: Give each background agent ONE specific chunk or file path.
- Format:
Task(agent="explore", prompt="Analyze chunk #5 of big.log: {content}...")
Phase 4: Reduce & Synthesize (The "Aggregation" Phase)
Goal: Combine results into a coherent answer.
- Collect: Read the outputs from
Task(viaTask output). - Synthesize: Look for patterns, consensus, or specific answers in the aggregated data.
- Refine: If the answer is incomplete, perform a second RLM recursion on the specific missing pieces.
Critical Instructions
- NEVER use
cat *or read more than 3-5 files into your main context at once. - ALWAYS prefer
Taskfor reading/analyzing file contents when the file count > 1. - Use
rlm.pyfor programmatic slicing of large files thatgrepcan't handle well. - Python is your Memory: If you need to track state across 50 files, write a Python script (or use
rlm.py) to scan them and output a summary.
Example Workflow: "Find all API endpoints and check for Auth"
Wrong Way (Monolithic):
read src/api/routes.tsread src/api/users.ts- ... (Context fills up, reasoning degrades)
RLM Way (Recursive):
- Filter:
grep -l "@Controller" src/**/*.ts-> Returns 20 files. - Map:
Task(prompt="Read src/api/routes.ts. Extract all endpoints and their @Auth decorators.")Task(prompt="Read src/api/users.ts. Extract all endpoints and their @Auth decorators.")- ... (Launch all 20)
- Reduce:
- Collect all 20 outputs.
- Compile into a single table.
- Identify missing auth.
Recovery Mode
If Task is unavailable or fails:
- Fall back to Iterative Python Scripting.
- Write a Python script that loads each file, runs a regex/AST check, and prints the result to stdout.
- Read the script's stdout.
Checklist
- Chose appropriate engine (Native or Strict) for the data type.
- Indexed and filtered before loading any file content.
- Spawned parallel Task agents (3-5 minimum for broad tasks).
- Each agent received ONE specific chunk or file path.
- Collected and synthesized all agent outputs.
- Refined with a second RLM pass if answer was incomplete.
- Never loaded more than 3-5 files directly into main context.
Cross-Skill Integration
| Situation | Skill to invoke | How |
|---|---|---|
| RLM discovers architectural issues | architect skill | Read skills/architect/SKILL.md |
| RLM finds security concerns across files | security-reviewer skill | Read skills/security-reviewer/SKILL.md |
| Cross-file refactoring needed | refactoring skill | Read skills/refactoring/SKILL.md |
| Large-scale dependency analysis | dependencies skill | Read skills/dependencies/SKILL.md |
| Results need implementation plan | para skill | Read skills/para/SKILL.md, use /plan |
| Performance analysis across codebase | performance skill | Read skills/performance/SKILL.md |
