askill
context-compression

context-compressionSafety 100Repository

This skill should be used when the user asks to "compress context", "summarize conversation history", "implement compaction", "reduce token usage", or mentions context compression, structured summarization, or long-running agent sessions exceeding context limits.

1 stars
1.2k downloads
Updated 2/5/2026

Package Files

Loading files...
SKILL.md

Context Compression Strategies

When agent sessions generate millions of tokens, compression becomes mandatory. The naive approach is aggressive compression to minimize tokens per request. The correct optimization target is tokens per task: total tokens consumed to complete a task, including re-fetching costs when compression loses critical information.

When to Activate

Activate this skill when:

  • Agent sessions exceed context window limits
  • Codebases exceed context windows (5M+ token systems)
  • Designing conversation summarization strategies
  • Debugging cases where agents "forget" what files they modified

Core Approaches

1. Anchored Iterative Summarization (Recommended)

Maintain structured, persistent summaries with explicit sections. When compression triggers, summarize only newly-truncated span and merge with existing summary.

Key insight: Structure forces preservation. Dedicated sections act as checklists.

2. Opaque Compression

Compressed representations optimized for reconstruction fidelity. Achieves 99%+ compression but sacrifices interpretability.

3. Regenerative Full Summary

Generate detailed summaries on each compression. Readable but may lose details across repeated compression cycles.

The Artifact Trail Problem

Artifact trail integrity is universally weak (2.2-2.5 out of 5.0). Coding agents need to know:

  • Which files were created/modified
  • What changed in each file
  • Function names, variable names, error messages

Solution: Separate artifact index or explicit file-state tracking in agent scaffolding.

Structured Summary Sections

## Session Intent
[What the user is trying to accomplish]

## Files Modified
- auth.controller.ts: Fixed JWT token generation
- config/redis.ts: Updated connection pooling

## Decisions Made
- Using Redis connection pool instead of per-request
- Retry logic with exponential backoff

## Current State
- 14 tests passing, 2 failing

## Next Steps
1. Fix remaining test failures
2. Run full test suite

Compression Triggers

StrategyTrigger PointTrade-off
Fixed threshold70-80% utilizationSimple but may compress too early
Sliding windowKeep last N turns + summaryPredictable context size
Importance-basedCompress low-relevance firstComplex but preserves signal
Task-boundaryCompress at logical completionsClean summaries

Compression Performance

MethodCompression RatioQuality Score
Anchored Iterative98.6%3.70
Regenerative98.7%3.44
Opaque99.3%3.35

The 0.7% additional tokens retained by structured summarization buys 0.35 quality points—worth it when re-fetching costs matter.

Guidelines

  1. Optimize for tokens-per-task, not tokens-per-request
  2. Use structured summaries with explicit file tracking sections
  3. Trigger compression at 70-80% context utilization
  4. Implement incremental merging rather than full regeneration
  5. Track artifact trail separately if file tracking is critical
  6. Monitor re-fetching frequency as a compression quality signal

Created: 2025-12-22 | Version: 1.1.0

Install

Download ZIP
Requires askill CLI v1.0+

AI Quality Score

85/100Analyzed 2/25/2026

High-quality technical skill on context compression with clear activation triggers, multiple implementation strategies with performance data, structured summary templates, and actionable guidelines. Well-organized with tables and examples. The .github/skills/ path suggests some GitHub-specific framing but content is broadly applicable to LLM/agent systems.

100
85
75
85
80

Metadata

Licenseunknown
Version-
Updated2/5/2026
Publisherbthillerup

Tags

securitytesting