askill
crawl4ai

crawl4aiSafety 90Repository

Use when crawling web pages, extracting markdown content, or scraping website data with intelligent chunking and skeleton planning.

8 stars
1.2k downloads
Updated 2/14/2026

Package Files

Loading files...
SKILL.md

crawl4ai

High-performance web crawler with intelligent chunking. Crawls web pages and extracts content as markdown using LLM-based skeleton planning.

Commands

crawl_url (alias: webCrawl)

Crawl a web page with LangGraph workflow and LLM-based intelligent chunking.

Parameters:

ParameterTypeDefaultDescription
urlstr-Target URL to crawl (required)
actionstr"smart"Action mode: "smart", "skeleton", "crawl"
fit_markdownbooltrueClean and simplify markdown output
max_depthint0Maximum crawling depth (0=single page)
return_skeletonboolfalseAlso return document skeleton (TOC)
chunk_indiceslist[int]-List of section indices to extract

Action Modes:

ModeDescriptionUse Case
smart (default)LLM generates chunk plan, then extracts relevant sectionsLarge docs where you need specific info
skeletonExtract lightweight TOC without full contentQuick overview, decide what to read
crawlReturn full markdown contentSmall pages, complete content needed

Examples:

# Smart crawl with LLM chunking (default)
@omni("crawl4ai.CrawlUrl", {"url": "https://example.com"})

# Skeleton only - get TOC quickly
@omni("crawl4ai.CrawlUrl", {"url": "https://example.com", "action": "skeleton"})

# Full content crawl
@omni("crawl4ai.CrawlUrl", {"url": "https://example.com", "action": "crawl"})

# Extract specific sections
@omni("crawl4ai.CrawlUrl", {"url": "https://example.com", "chunk_indices": [0, 1, 2]})

# Deep crawl (follow links up to depth N)
@omni("crawl4ai.CrawlUrl", {"url": "https://example.com", "max_depth": 2})

# Get skeleton with full content
@omni("crawl4ai.CrawlUrl", {"url": "https://example.com", "return_skeleton": true})

Core Concepts

TopicDescriptionReference
Skeleton PlanningLLM sees TOC (~500 tokens) not full content (~10k+)smart-chunking.md
Chunk ExtractionToken-aware section extractionchunking.md
Deep CrawlingMulti-page crawling with BFS strategydeep-crawl.md

Best Practices

  • Use skeleton mode first for large documents to understand structure
  • Use chunk_indices to extract specific sections instead of full content
  • Set max_depth > 0 carefully - limits pages crawled to prevent runaway crawling
  • Keep fit_markdown=true for cleaner output, false for raw content

Advanced

  • Batch multiple URLs with separate calls
  • Combine with knowledge tools for RAG pipelines
  • Use skeleton + LLM to auto-generate chunk plans for custom extraction

Install

Download ZIP
Requires askill CLI v1.0+

AI Quality Score

95/100Analyzed 2/13/2026

An exemplary skill definition for a web crawler. It features comprehensive metadata for agent routing, detailed parameter tables, clear action modes, and practical code examples, making it highly effective for agentic use.

90
95
85
95
95

Metadata

Licenseunknown
Version-
Updated2/14/2026
Publishertao3k

Tags

github-actionsllm