Partition-first log analysis methodology. Use for log searches, error analysis, pattern finding across Datadog, CloudWatch, or Kubernetes logs.
353
AI 88
observability
incidentfox2/16/2026
Log, metric, and trace analysis methodology. Use when analyzing logs, investigating errors, querying metrics, or correlating signals across observability backends (Coralogix, Datadog, CloudWatch).
293
aws-troubleshoot
incidentfox2/15/2026
AWS service troubleshooting patterns. Use for EC2, ECS, Lambda, CloudWatch, RDS issues.
291
AI 78
remediation
incidentfox2/15/2026
Safe remediation actions for Kubernetes. Use when proposing or executing pod restarts, deployment scaling, or rollbacks. Always use dry-run first.
291
infrastructure
incidentfox2/13/2026
Infrastructure debugging for Kubernetes and AWS. Use when investigating pod crashes, deployment issues, resource problems, container failures, or cloud infrastructure issues.
287
AI 52
deployment-correlation
incidentfox2/7/2026
Correlate incidents with recent deployments and code changes. Use when investigating if a deployment caused an issue, finding what changed, or identifying the commit that introduced a bug.
263
AI 95
k8s-debug
incidentfox2/7/2026
Kubernetes debugging patterns. Use for pod crashes, CrashLoopBackOff, OOMKilled, ImagePullBackOff, scheduling failures, deployment issues.
263
AI 92
knowledge-base
incidentfox2/7/2026
Search runbooks, documentation, and knowledge base articles from Confluence. Use when looking for incident response procedures, service documentation, post-mortems, or troubleshooting guides.
263
AI 92
investigate
incidentfox2/7/2026
Systematic incident investigation methodology. Use when investigating production issues, service degradation, errors, latency spikes, or outages.
263
metrics-analysis
incidentfox2/7/2026
Prometheus/Grafana metrics analysis and PromQL queries. Use when investigating latency, error rates, resource usage, or any time-series metrics.