arXiv Research Skill

Overview

This skill enables systematic academic research through three core capabilities that form the minimal complete loop of knowledge building:

connect -> understand -> evidence
  Find  ->  Comprehend -> Cite

Setup

Install dependencies: uv sync

The Three Pillars

1. Connect (Knowledge Navigation)

Purpose: Find relevant existing knowledge

When to use:

Starting research on a new topic
Finding related work for a paper
Discovering what exists in a field

Capabilities:

Semantic search across arXiv
Filter by category, author, date
Rank by citation impact (via Semantic Scholar)
Find similar papers to a known paper

Usage:

uv run python scripts/connect.py search "transformer attention mechanism" --category cs.LG --limit 20
uv run python scripts/connect.py search "LLM agents" --since 2023-01 --until 2024-06  # Date filtering
uv run python scripts/connect.py similar "2301.00001" --limit 10
uv run python scripts/connect.py recent cs.AI --days 7
uv run python scripts/connect.py by-author "Yann LeCun"
uv run python scripts/connect.py cited-by "2301.00001" --limit 20  # Forward citations
uv run python scripts/connect.py coauthors "Yann LeCun" --limit 20  # Collaboration network

2. Understand (Meaning Extraction)

Purpose: Comprehend what the knowledge contains

When to use:

Need to quickly grasp a paper's contribution
Extracting methodology details
Comparing multiple papers
Writing literature review sections

Capabilities:

Structured paper analysis (problem, method, contribution, limitations)
Key findings extraction
Methodology breakdown
Multi-paper comparison

Usage:

# Get paper content for analysis (single or batch)
uv run python scripts/connect.py content "2301.00001"
uv run python scripts/connect.py content "2301.00001,2302.00002,2303.00003"

# Pipe content into analysis prompts
uv run python scripts/connect.py content "2301.00001" | uv run python scripts/understand.py analyze quick

Available prompts: uv run python scripts/understand.py list

Prompts: quick, methodology, contribution, critical, compare, literature, implementation, evidence

3. Evidence (Source Attribution)

Purpose: Create verifiable links to sources

When to use:

Writing academic papers
Need proper citations
Building bibliography
Ensuring traceability of claims

Capabilities:

BibTeX generation
Multiple citation formats (APA, IEEE, ACM, Chicago, RIS)
Batch citation export
RIS export for Zotero/Mendeley/EndNote

Usage:

uv run python scripts/evidence.py bibtex "2301.00001"
uv run python scripts/evidence.py apa "2301.00001"
uv run python scripts/evidence.py ris "2301.00001"  # For Zotero/Mendeley
uv run python scripts/evidence.py batch "2301.00001,2302.00002,2303.00003" --format bibtex
uv run python scripts/evidence.py batch "2301.00001,2302.00002" --format ris > refs.ris

4. TikZ (Figure Extraction)

Purpose: Extract TikZ source code from arXiv paper LaTeX sources

When to use:

Reusing or adapting figures from papers
Analyzing visualization techniques
Understanding diagram construction

Capabilities:

Extracts tikzpicture, tikzcd, circuitikz, pgfplots environments
Captures captions, labels, and library dependencies
Outputs as pure TikZ, compilable LaTeX, JSON, or brief summary

Usage:

uv run python scripts/tikz.py extract "2301.00001"
uv run python scripts/tikz.py extract "2301.00001" --format latex > figures.tex
uv run python scripts/tikz.py extract "2301.00001,2302.00002" --format json
uv run python scripts/tikz.py list "2301.00001"
uv run python scripts/tikz.py extract "2301.00001" --format tikz | uv run python scripts/understand.py analyze quick

Workflow Examples

Literature Review Workflow

Progress:
- [ ] Step 1: Find seed papers
- [ ] Step 2: Expand with similar papers
- [ ] Step 3: Analyze each paper
- [ ] Step 4: Generate bibliography

# Step 1: Find seed papers (ranked by citation impact)
uv run python scripts/connect.py search "your topic" --limit 50 --with-citations --sort citations

# Step 2: Expand with similar papers from top results
uv run python scripts/connect.py similar "top_paper_id"

# Step 3: Analyze each paper
uv run python scripts/connect.py content "paper_id" | uv run python scripts/understand.py analyze literature

# Step 4: Generate bibliography
uv run python scripts/evidence.py batch "id1,id2,id3" --format bibtex > refs.bib

Finding Evidence for a Claim

# 1. Search for supporting research
uv run python scripts/connect.py search "your claim keywords" --with-citations

# 2. Verify the paper supports your claim
uv run python scripts/connect.py content "paper_id" | uv run python scripts/understand.py analyze evidence

# 3. Generate proper citation
uv run python scripts/evidence.py apa "paper_id"

API Dependencies

Service	Purpose	Rate Limit	API Key Required
arXiv	Paper search, content	1 req/3s	No
Semantic Scholar	Citations, similar papers	100 req/5min	No (optional for higher limits)
Jina Reader	Full text extraction	Generous	No

File Structure

arxiv-research-skill/
├── SKILL.md              # This file - usage guide
└── scripts/
    ├── connect.py        # Knowledge navigation
    ├── understand.py     # Analysis prompts
    ├── evidence.py       # Citation generation
    ├── tikz.py           # TikZ figure extraction
    ├── cache.py          # SQLite caching (~/.cache/arxiv-research/papers.db)
    └── utils.py          # Shared utilities (extractPaperId, cleanText)

Common Patterns

Finding Foundational Papers

uv run python scripts/connect.py search "topic" --sort citations --limit 10

Tracking Recent Developments

uv run python scripts/connect.py recent cs.AI --days 30

Building a Reading List

uv run python scripts/connect.py search "topic" > papers.json
# Review and filter
uv run python scripts/evidence.py batch "selected_ids" --format bibtex

Error Handling

Rate limited: Wait and retry, scripts have built-in backoff
Paper not found: Verify arXiv ID format (YYMM.NNNNN)
No citations: Paper may be too new for Semantic Scholar

arxiv-researchSafety 95Repository

Package Files

arXiv Research Skill

Overview

Setup

The Three Pillars

1. Connect (Knowledge Navigation)

2. Understand (Meaning Extraction)

3. Evidence (Source Attribution)

4. TikZ (Figure Extraction)

Workflow Examples

Literature Review Workflow

Finding Evidence for a Claim

API Dependencies

File Structure

Common Patterns

Finding Foundational Papers

Tracking Recent Developments

Building a Reading List

Error Handling

Install

AI Quality Score

Metadata

Tags

arxiv-researchSafety 95Repository ShareFavorite skill

Package Files

arXiv Research Skill

Overview

Setup

The Three Pillars

1. Connect (Knowledge Navigation)

2. Understand (Meaning Extraction)

3. Evidence (Source Attribution)

4. TikZ (Figure Extraction)

Workflow Examples

Literature Review Workflow

Finding Evidence for a Claim

API Dependencies

File Structure

Common Patterns

Finding Foundational Papers

Tracking Recent Developments

Building a Reading List

Error Handling

Install

AI Quality Score

Metadata

Tags

arxiv-researchSafety 95Repository