Task Scoring

Individual Task Evaluation · Data Science Team

Task Scoring
Standard

A clear, fair, consistent way to recognize impact. Every task is evaluated on 4 weighted dimensions, then mapped to one or more performance pillars with contribution weights.

4·Dimensions

3·Work Types

100·Points total

Evaluation Framework

01 · Dimensions

4 Weighted Dimensions

Every task is scored 1–5 on each dimension. The weighted total (out of 100) determines the star rating.

40%

Quality / Correctness

How accurate, reliable, and complete the output is. Meets requirements and holds up to validation.

30%

Impact / Value

The value created for users, stakeholders, or the business. Adoption, decision influence, efficiency, cost.

20%

Timeliness

How well work was delivered against the agreed timeline. On-time, ahead, or behind schedule.

10%

Engineering Maturity

How well work is designed, documented, automated, and enables future use or maintenance.

Scope Change Fairness Rule: If a task is deprioritized, paused, or blocked by external dependencies, evaluation is based on progress made vs. plan, quality of work completed, and clarity of findings/handover — not on final impact or adoption.

02 · Star Guide

Star Rating Thresholds

After scoring all 4 dimensions and computing the weighted total (out of 100), the result maps to a star rating.

3 Star

70 – 79

out of 100

4 Star

80 – 89

out of 100

5 Star

90 – 100

out of 100

Scores 1–2 (below 70) indicate significant gaps requiring targeted improvement.

03 · Rubric

Scoring Guidelines by Work Type

Select the work type that best describes the task, then apply the rubric to score each dimension.

Production / Delivery

Dashboards, pipelines, models in production, APIs, scheduled jobs

Score	Quality / Correctness (40%)	Timeliness (20%)	Impact / Value (30%)	Engineering Maturity (10%)
5	Exceeds requirements. Highly accurate, stable, no rework needed.	Significantly ahead of schedule with buffer.	High adoption (≥90%) and measurable positive business impact.	Introduced automation, reusable components, or scalable solution.
4	Meets requirements with high quality. Minor improvements only.	Ahead of schedule or optimized delivery time well.	Meaningful value added — improved efficiency, usability, or data quality.	Well-documented, clean and maintainable implementation.
3	Requirements fully met. Correct and stable output. No major issues.	Delivered on time (within agreed deadline).	Value delivered as expected; useful to intended users.	Basic documentation and maintainable implementation.

04 · Formula

Aggregation Formula & Example

Pillar scores are weighted averages of task scores. Simple averaging would assume equal task relevance — the weighted approach reflects actual contribution.

Pillar Score Formula

Pillar Score (1–5) =

Σ (Task Score × Pillar Weight)

Σ (Pillar Weights)

where weights are each task's % contribution to this specific pillar

Worked Example — Calculating the Delivery Score

Task	Task Score	Delivery Weight	Score × Weight
ML Pipeline Automation	4.5	40%	4.5 × 0.40 = 1.80
Incident Root Cause Resolution	5.0	20%	5.0 × 0.20 = 1.00
Stakeholder Dashboard Project	4.0	50%	4.0 × 0.50 = 2.00

Sum of Weighted Scores

1.80 + 1.00 + 2.00 = 4.80

Sum of Weights

0.40 + 0.20 + 0.50 = 1.10

Delivery Score

4.80 ÷ 1.10 = 4.36 / 5

Why not a simple average? (4.5 + 5.0 + 4.0) ÷ 3 = 4.50. Assumes every task contributed equally to Delivery. The weighted result (4.36) reflects the actual relevance of each task to this specific pillar — more accurate, more fair.

05 · How to Use

Step-by-Step Guide

Classify the Work Type

Identify whether the task is Production/Delivery, POC/Research, or Operational/Support. This determines which rubric applies.

Score Each Dimension 1–5

Using the rubric for your work type, score Quality, Timeliness, Impact/Value, and Engineering Maturity independently.

Multiply by Weight

Quality × 40, Impact × 30, Timeliness × 20, Engineering × 10. Sum those four to get your total out of 100.

Map to Star Rating

70–79 = 3★, 80–89 = 4★, 90–100 = 5★. This task score becomes evidence that feeds into the 7-pillar Performance Evaluation.

Map Task to Performance Pillars

Assign each task to the relevant performance pillars with contribution weights (must total 100% across all pillars for that task).

Task scores are evidence. Pillar scores are judgment-backed.

Together, they create fair, balanced, and meaningful performance evaluations.

Task ScoringStandard

4 Weighted Dimensions

Quality / Correctness

Impact / Value

Timeliness

Engineering Maturity

Star Rating Thresholds

Scoring Guidelines by Work Type

Aggregation Formula & Example

Step-by-Step Guide

Task Scoring
Standard