askill
score-analyzer

score-analyzerSafety 95Repository

Analyze multi-round evaluation result scoring data, calculate various metrics, calculate rating grades. Suitable for analyzing scoring trends, calculating S/A/B ratings

25 stars
1.2k downloads
Updated 2/15/2026

Package Files

Loading files...
SKILL.md

Score Analysis Agent

Functionality

Analyze scoring data from multiple evaluation rounds, calculate various scoring metrics, and calculate rating grades (S Strong Focus/A Suggested Focus/B Ordinary).

Usage Scenarios

  • Quickly grasp overall performance and trends of multi-round evaluation results
  • Provide decision support for project approval and IP adaptation based on quantitative data
  • Identify high-potential works for S/A/B grading
  • Assist evaluators in analyzing scoring deviations to optimize evaluation processes

Rating Grade Definitions

  • S Grade (Strong Focus): At least one 8.5 score or at least eight 8.0 scores accumulated
  • A Grade (Suggested Focus): At least five 8.0 scores accumulated
  • B Grade (Ordinary): Does not meet A Grade standards

Statistical Metrics

  • Evaluation Count: Total evaluation rounds
  • Valid Score Count: Evaluation rounds with valid scoring data
  • First Score: Record first evaluation score
  • Highest Score: Record highest score among all evaluations
  • Lowest Score: Record lowest score among all evaluations
  • Average Score: Calculate average score of all evaluations
  • Trimmed Mean Score: Average score after removing highest and lowest scores
  • High Score Statistics: Count occurrences in each score range (8.5 and above, 8.0-8.4, etc.)

Core Steps

Receive multi-round evaluation results
    ↓
Extract all scoring data
    ↓
Calculate various scoring metrics
    ↓
Calculate rating grades
    ↓
Generate comprehensive evaluation report
    ↓
Output structured results

Input Requirements

  • Evaluation Results: Structured text containing multi-round evaluation scoring data (recommend at least 10 evaluation results)
  • Scoring Dimensions: Clear dimensions and standards for scoring
  • Special Requirements (optional): Any specific statistical or analysis requirements

Output Format

[Score Analysis Report]

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
I. Evaluation Overview
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
- Evaluation Count: [count]
- Valid Scores: [count]
- Rating Grade: [S/A/B]

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
II. Score Statistics
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
- First Score: [score]
- Highest Score: [score]
- Lowest Score: [score]
- Average Score: [score]
- Trimmed Mean: [score]

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
III. Score Sequence
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
1. [score]
2. [score]
...

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
IV. High Score Statistics
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
- 8.5 and above: [count]
- 8.0-8.4: [count]
- 7.5-7.9: [count]
- 7.4 and below: [count]

Constraints

  • Input evaluation data must contain clear scores for statistical analysis
  • Report content must be objective and fair, based on data generation without subjective judgment
  • Ensure calculation results are accurate

Examples

See {baseDir}/references/examples.md for more detailed examples:

  • examples.md - Contains detailed analysis report examples for different evaluation results (multiple high scores, stable average scores, high score fluctuation, etc.)

Detailed Documentation

See {baseDir}/references/examples.md for detailed guidance and cases on score analysis.


Version History

VersionDateChanges
2.1.02026-01-11Optimized description field; changed model to opus; optimized descriptions for functionality, usage scenarios, rating grade definitions, statistical metrics, core steps, input requirements, and output format; added constraints, examples, and detailed documentation sections
2.0.02026-01-11Restructured according to official specifications
1.0.02026-01-10Initial version

Install

Download ZIP
Requires askill CLI v1.0+

AI Quality Score

65/100Analyzed 2/24/2026

Well-structured technical reference specification for score analysis with comprehensive metrics definitions, rating grade criteria (S/A/B), and detailed output templates. However, it lacks actual implementation code or executable logic - it's a specification document rather than an actionable skill. The content is highly detailed and reusable as a reference but requires the user to build the actual analysis functionality. Scores well on clarity, completeness, and safety but low on actionability due to no implementation provided.

95
80
60
75
35

Metadata

Licenseunknown
Version2.1.0
Updated2/15/2026
PublisherGongLingRui

Tags

observability