Performance Evaluation
Framework
A structured, evidence-backed approach to fair and transparent 6-month performance reviews across the Data Science Team. Seven pillars. One consistent standard. No single category can carry the rest.
How This Framework Fits Together
Two complementary systems create a complete, evidence-backed performance picture.
Step 1 · Input
Task Scoring Standard
Rates individual tasks 1–5 stars across 4 weighted dimensions. Produces evidence-backed task scores that feed into pillar calculations.
- 4 weighted dimensions (Quality, Impact, Timeliness, Engineering)
- Work type guidelines (Production, POC/Research, Operational)
- Star rating output (1–5)
- Evidence & artifacts
Step 2 · This
Performance Evaluation
Evaluates overall contribution across 7 pillars over 6 months. Task scores are evidence — but not the only factor.
- Broader behaviors
- Sustained impact over the period
- Growth & influence
- Balanced contribution across all 7 pillars
Core Principle
No single category can compensate for another. We value balanced, sustainable, and well-rounded contribution.
The 7 Performance Pillars
Each pillar has a defined evidence type. Task-driven pillars flow from work output; independent pillars require separate evidence collection.
Delivery
Consistently delivers end-to-end solutions with clear SLAs and measurable business impact.
Evidence Sources
Technical Excellence
Advances MLOps, Agentic AI practices, automation, scalability and reduces operational toil.
Evidence Sources
Reliability
Ensures model & data quality, monitoring, stability and upholds privacy/ethics-by-design.
Evidence Sources
Collaboration
Translates business goals, communicates clearly, aligns stakeholders and influences decisions.
Evidence Sources
Leadership / Team Contribution
Raises team standards, creates assets, mentors others and improves team practices.
Evidence Sources
Development
Continuously improves skills, builds portfolio, learns, experiments and shares knowledge.
Evidence Sources
Company Culture
Demonstrates growth mindset, respectful behavior, and supports a blameless, psychologically safe culture.
Evidence Sources
How Performance Is Rated
All 7 pillars use the same 1–5 scale. Scores are assigned through calibrated manager judgment, supported by evidence.
Significant Concerns
Performance far below expectations.
Partially Meets
Inconsistent. Needs improvement.
Meets Expectations
Consistent, solid, dependable.
Exceeds Expectations
Clearly above expectations. Strong impact.
Exceptional Impact
Outstanding, sustained. Role model.
Threshold Rule
To receive an overall rating of 4 or 5, every critical pillar must score at least 3 (Meets Expectations). Company Culture is an explicit critical pillar — a score below 3 caps the overall rating at 3, regardless of all other scores.
Framework Guardrails
Built-in protections that prevent distortion, gaming, and accidental over-rewarding or under-recognition.
Category Isolation
Strong performance in one pillar cannot offset weak performance in another. Each pillar stands entirely on its own.
Minimum Threshold Rule
An overall rating of 4 or 5 requires every critical pillar to be at least 3 (Meets Expectations). No pillar can be critically low.
Sustained Performance
One-off big projects don't define performance. Consistency, repeatability, and long-term impact across the full period are what matter.
Framework Self-Audit — Two Dimensions to Always Distinguish
Behavioral Integrity Override
2+ formal manager observations or 2+ independent peer behavioral reports automatically cap the overall rating at 3, regardless of all other scores. Requires written documentation to prevent misuse.
Company Culture — Explicit Hard Gate
Company Culture is a critical pillar. A score below 3 prevents any overall rating above 3, regardless of all other pillar scores.
Collaboration: Engagement Quality Sub-Dimension (~20%)
Measures whether the person engages respectfully, gives credit, and handles disagreement constructively — not just what they delivered, but how they showed up.
Structured Anonymous Pulse Survey
For Company Culture, a 3–4 question anonymous survey at mid-period and end-period supplements manager observation to reduce popularity bias and create consistency.
Manager Override Clause (±1)
Managers may adjust the final calculated rating by ±1 with required written justification, handling edge cases the formula cannot capture (e.g., high scores but a serious behavioral incident).
The Review Process
A structured 5-step cycle to ensure consistent, fair, and actionable performance reviews.
Self-Reflection
Employee prepares evidence and self-assessment across all 7 pillars. Collect impact metrics, task scores, artifacts, and examples.
Manager Review
Manager evaluates using this framework, submitted evidence, peer feedback, and behaviors observed over the 6-month period.
Calibration
Managers align scores across all team members to ensure consistency and fairness. Outlier scores in either direction require justification.
Feedback Conversation
Discuss strengths, growth areas, and the development plan together. Frame opportunities against the career progression path.
Action Plan
Set concrete goals and focus areas for the next 6 months. Tie development directly to Career Paths and team strategic roadmap.
Employee Best Practices
Build evidence continuously throughout the period — not just at review time.
Maintain an Accomplishment Log
Document completed tasks, decisions, and outcomes in real time — not just at review time.
Collect Impact Metrics & Artifacts
Attach dashboards, code links, feedback emails, and adoption numbers to your task records.
Document Learning & Growth
Track certifications, talks, courses, experiments, and any knowledge-sharing activities.
Share Early, Communicate Clearly
Timely status updates and stakeholder alignment are evidence for Collaboration and Delivery.
Our Goal
Grow people. Deliver impact. Build the future together.