askill
haiku-prompting

haiku-promptingSafety 95Repository

Claude Haiku 4.5 behavioral flaw catalog with prompt-level mitigations, guard calibrations, and cost-optimization techniques. Use when writing prompts, constraints, or methodology for Haiku-powered subagents — ensures output quality by addressing reasoning limits, instruction fragility, hallucination, and other documented failure modes.

1 stars
1.2k downloads
Updated 2/13/2026

Package Files

Loading files...
SKILL.md

Haiku 4.5 Prompting Guide

Prompt engineering patterns that exploit Haiku 4.5's strengths (speed, cost, routine tool use) while guarding against its documented behavioral flaws. Complements generic prompting skills with model-specific mitigations.

Scope boundary: This skill covers how to prompt Haiku effectively. For model selection (when to use Haiku vs Sonnet vs Opus), apply model-selection. For generic prompt structure patterns, apply prompting-guide. For reasoning directive quality (phrasing, timing, chain length), apply reasoning-calibration. For prompt file mechanics (.prompt.md format), apply prompt-file-design. For Sonnet-specific guards, apply sonnet-prompting.


When to Use This Skill

  • Writing or reviewing <constraints> for a Haiku-powered subagent
  • Designing methodology sections where Haiku will execute routine tasks
  • Diagnosing poor output quality from a Haiku-powered subagent (hallucination, missed instructions, truncation)
  • Optimizing prompt structure for Haiku's processing characteristics (speed, cost, capacity tradeoffs)
  • Deciding which guard sections to include for a given Haiku task type
  • Calibrating existing Sonnet guard patterns for Haiku's tighter operating envelope

Haiku 4.5 Flaw Catalog

Nine documented behavioral flaws from benchmark data, practitioner reports, and observed workspace behavior. Each is assigned a severity and primary mitigation. Flaws are categorized by their relationship to the Sonnet 4.5 flaw catalog (F1-F10).

Amplified Sonnet Flaws (worse in Haiku)

IDFlawSeveritySonnet Equiv.DeltaDescriptionPrimary Guard
H1Reasoning depth wallCRITICALF6 (3-4 hops)50-66% worseReliable at 1-2 hops only. Beyond 2, produces wrong conclusions, skipped steps, or premature "done". Any "if X then analyze Y then decide Z" chain exceeds capacity.Task decomposition to ≤2 hops
H2Instruction fragilityHIGHF4 (drift)Faster onsetDrops IMPORTANT-tier constraints after 3-5 tool calls (vs Sonnet's ~10). CRITICAL constraints hold longer but the safe window is narrower.Constraint compression — ≤3 CRITICAL
H3Synthesis incapacityHIGHF1+F10New failure classCannot connect disparate findings. Extracts data point A and B individually but fails at "A implies B because C." Tends to list findings without drawing conclusions.Restrict to extraction, not analysis
H4Tool chain breakdownHIGHF3 (premature)WorseWhen task requires multi-tool sequences (grep → read → analyze), degrades at step 3+. Produces premature conclusions from step 1-2 results, skipping deeper investigation.Single-tool-per-invocation design
H5Verification rubber-stampHIGHF1+F8Significantly worseNear-zero capacity for critical evaluation. If caller says "verify X works," Haiku confirms even with contradictory evidence. Cannot balance expectations vs evidence.Never assign verification tasks
H6Output truncationMEDIUMF2 (lazy)Moderately worseSummarizes prematurely or drops tail content on long outputs. Cuts off earlier than Sonnet. Silent truncation without acknowledgment.Explicit truncation markers
H7Hallucination under ambiguityMEDIUMF7 (tool halluc.)WorseWhen target not found, invents plausible file paths, function names, or parameter values rather than reporting absence. Less capacity to "know it doesn't know."Mandatory NOT FOUND protocol

New Haiku-Specific Flaws (not present in Sonnet)

IDFlawSeverityDescriptionPrimary Guard
H8Format compliance erosionMEDIUMFollows structured output formats well on short tasks, but compliance degrades faster than Sonnet as task complexity grows. JSON schemas, tables, and markdown break under multi-step pressure.Format re-anchor before output
H9Overly literal interpretationLOW-MEDInterprets instructions literally, missing intent. "Find the test file for this module" returns only exact filename matches rather than understanding project conventions. Lacks contextual inference.Explicit search strategy instructions

Haiku Operating Envelope

Understanding Haiku's safe operating parameters prevents assigning tasks that exceed its capacity.

ParameterHaiku 4.5Sonnet 4.5Implication
Reliable reasoning hops1-23-4T0 tasks only (reasoning-strategy)
CRITICAL constraint cap≤3≤5Tighter constraint budget
Tool calls before drift3-5~10Shorter agent sessions
Structured output reliabilityShort tasks onlySustainedRe-anchor format on multi-step
Synthesis / connectionNoneBasicExtract only, no analysis
Self-correctionNoneLimitedParent must verify all output
Speed4-5x SonnetBaselineReal advantage for high-volume work
Cost0.33x SonnetBaseline3 Haiku calls ≈ 1 Sonnet call

Key insight: Haiku's operating envelope is T0 (Direct) on the reasoning-strategy tier model. Any task requiring T1+ reasoning should use Sonnet minimum.


Methodology

Phase 1: Verify Task Fitness

Before writing a prompt for a Haiku-powered subagent, confirm the task fits Haiku's envelope:

Task CharacteristicHaiku FitIf No →
Requires 0-1 reasoning hops?✅ ProceedUse Sonnet
Single tool type per invocation?✅ ProceedUse Sonnet
No evaluation/judgment needed?✅ ProceedUse Sonnet
Output is data, not analysis?✅ ProceedUse Sonnet
Session ≤5 tool calls?✅ ProceedAdd constraint anchor or use Sonnet
No synthesis across findings?✅ ProceedUse Sonnet

If ANY answer is "No" → either redesign the task to fit, or upgrade to Sonnet. No prompt trick compensates for a capability gap.

Phase 2: Assess Risk Profile

If the task fits, identify which Haiku flaws it's exposed to:

Task CharacteristicExposed FlawsRequired Guards
File reading / extractionH7, H9NOT FOUND protocol, search strategy
Command executionH6, H7Truncation markers, no-guess directive
Search / grep operationsH7, H9Literal interpretation guard, widening strategy
Multi-file parallel readsH2, H8Constraint compression, format re-anchor
Doc navigationH9, H3Explicit path instructions, no synthesis
Data transformationH6, H8Output scope lock, format anchor

Phase 3: Apply Guard Patterns

Inject guards matching the risk profile. Haiku has a tighter constraint budget than Sonnet — every guard must earn its place.

Priority 1 — Always include for Haiku subagents:

  1. Anti-hallucination (guards H7): The single most important Haiku guard.

    If target not found, respond "NOT FOUND: {what was searched}".
    NEVER guess, infer, or fabricate file paths, function names, or values.
    
  2. Constraint compression: Keep CRITICAL tier to ≤3 items. Haiku's adherence drops sharply beyond 3. Move overflow to IMPORTANT.

Priority 2 — Include based on risk profile:

  1. Output scope lock (guards H6): Required when output may be long.

    If output would exceed {N} lines, summarize with "[TRUNCATED: {count} additional items]".
    Never silently drop content.
    
  2. Search strategy (guards H9): Required for search/discovery tasks.

    Search exact match FIRST, then broaden with pattern/glob.
    If zero results, try {alternative strategies} before reporting NOT FOUND.
    
  3. Format re-anchor (guards H8): Required for multi-step tasks with structured output.

    Re-read the output_format section before writing your final response.
    

Priority 3 — Include for edge cases:

  1. Constraint anchor (guards H2): If session may reach 4-5 tool calls.
    ⚠️ Re-read CRITICAL constraints before proceeding.
    

Phase 4: Calibrate from Sonnet Guards

When adapting a Sonnet prompt for Haiku, apply these calibration rules:

Sonnet GuardHaiku Calibration
<anti-sycophancy>Remove — assign evaluation tasks to Sonnet instead (H5)
<completeness>Replace with output scope lock — Haiku truncates rather than placeholders
<scope-fence>Simplify — Haiku rarely makes bold changes (insufficient capacity for F5)
<constraint-anchor>Lower threshold — trigger at 3 tool calls, not 10
<reasoning_guidance>Remove — if task needs reasoning, upgrade to Sonnet (H1)
CRITICAL ≤5 capTighten to ≤3 (H2)
Convention anchoringUnnecessary — Haiku doesn't generate code (H1 prevents it)

Phase 5: Validate Prompt

CheckWhat to VerifyFix If Failing
CRITICAL cap≤3 items in CRITICAL tier?Move overflow to IMPORTANT
Task fitnessTask fits Haiku operating envelope (Phase 1)?Redesign or upgrade model
Guard coverageAll high-risk flaws from Phase 2 have guards?Add missing guards
No evaluationNo judgment/verification/analysis expected?Remove or upgrade to Sonnet
Output formatStructured format with concrete example?Add format with example
NOT FOUND protocolAnti-hallucination guard present?Always add — non-negotiable

Quick Reference: Guard Injection Patterns

Minimal guard blocks for Haiku subagent prompts. Each targets a specific flaw.

Anti-Hallucination (H7) — MANDATORY, 2 lines

If target not found, respond "NOT FOUND: {what was searched}".
NEVER guess, infer, or fabricate file paths, function names, or values.

Output Scope Lock (H6) — 2 lines

If output would exceed {N} lines, summarize with "[TRUNCATED: {count} items omitted]".
Never silently drop content — acknowledge what was omitted.

Constraint Compression (H2) — design rule

Keep CRITICAL constraints to ≤3 items.
Merge related constraints. Move all overflow to IMPORTANT tier.

Search Strategy (H9) — 3 lines

Search exact match FIRST, then broaden with glob/regex patterns.
If zero results after broadening, try alternative names or paths.
Report all search attempts before concluding NOT FOUND.

Format Re-Anchor (H8) — 1 line

Re-read the <output_format> section before writing your final response.

Constraint Anchor (H2) — 1 line, place at tool call 3

⚠️ Re-read CRITICAL constraints before proceeding.

Synthesis Fence (H3) — 2 lines

Report findings as extracted data only.
Do NOT draw conclusions, infer causality, or recommend actions.

Haiku vs Sonnet: Guard Selection Matrix

Side-by-side comparison for deciding which guards apply to which model:

Guard SectionSonnetHaikuWhy Different
<anti-sycophancy>Required for eval tasksNever — don't assign eval to HaikuH5 makes it pointless
<completeness>Required for code genReplace with output scope lockHaiku truncates, doesn't placeholder
<scope-fence>Required for write tasksSimplify or removeHaiku lacks capacity for bold changes
<constraint-anchor>At 10+ tool callsAt 3+ tool callsH2 faster drift onset
<reasoning_guidance>For structured thinkingNever — upgrade model insteadH1 makes reasoning guards futile
<anti-hallucination>Optional (verify-first)Always mandatoryH7 is Haiku's #1 risk
<output-scope-lock>Rarely neededOften neededH6 earlier truncation
<search-strategy>Rarely neededFor search tasksH9 literal interpretation
<synthesis-fence>Not applicableFor multi-finding tasksH3 prevents analysis
<format-re-anchor>Not neededFor multi-step tasksH8 format erosion

Anti-Patterns

  • Assigning evaluation tasks — "Review this code" or "verify the fix" on Haiku. H5 (rubber-stamp) means it will confirm anything. Use Sonnet for all evaluation.
  • Expecting synthesis — "Analyze these 5 files and explain the pattern." Haiku can extract from each file individually but cannot connect findings (H3). Use Sonnet for synthesis.
  • Long tool chains — Tasks requiring 4+ sequential tool calls where each depends on the previous. Haiku degrades at step 3+ (H4). Redesign as independent parallel reads or upgrade to Sonnet.
  • Overloading CRITICAL — More than 3 CRITICAL items. Haiku's constraint adherence drops sharply beyond 3, faster than Sonnet's 5-item cliff. Keep CRITICAL tight.
  • Reasoning guards without model upgrade — Adding <reasoning_guidance> to a Haiku prompt. No prompt trick fixes a 1-2 hop ceiling (H1). If the task needs reasoning, upgrade to Sonnet.
  • Trusting Haiku output without parent verification — Parent agents (Sonnet/Opus) must always verify Haiku subagent outputs. Haiku has near-zero self-correction capacity.
  • Silent failure assumption — Expecting Haiku to report "I can't do this." Haiku will attempt any task and produce plausible-looking but potentially wrong output (H7). Design for failure at the architecture level.
  • Using Sonnet guard patterns unmodified — Sonnet guards assume 3-4 hop reasoning, 5-item CRITICAL cap, and 10+ call sessions. All three are wrong for Haiku. Always calibrate per Phase 4.

Resources

  • template.md — Haiku-specific calibration rules and flaw-to-section mapping
  • examples.md — Concrete guard examples per flaw with before/after comparisons

Install

Download ZIP
Requires askill CLI v1.0+

AI Quality Score

90/100Analyzed 2/18/2026

Excellent technical reference skill documenting Haiku 4.5 behavioral flaws and prompt-level mitigations. Comprehensive flaw catalog (H1-H9) with severity ratings, operating envelope parameters, 5-phase methodology, and reusable guard injection patterns. Highly actionable with concrete thresholds and templates. Well-structured with tables and clear sections. Ends abruptly but core content is complete. Bonus rules applied: has When to Use section, structured steps, tags, skills folder location, and high-density technical reference content.

95
90
80
85
95

Metadata

Licenseunknown
Version-
Updated2/13/2026
PublisherfaroukBakari

Tags

promptingtesting