I was recently asked to be part of a focus group for a new performance management framework. One of the things I noticed is that the framework fell into the same trap as what I’ve seen at other companies I’ve worked that had the same framework. I’m not sure where these HR teams get the ideas for these frameworks, but they all suffer from the same flaws. Regardless, I wanted to highlight an issue I saw with their 9-Box model.

The 9-Box was being measured on two axes: impact and growth velocity. The impact axis I had no issue with. The growth velocity, however, has the innate problem in that it is not measurable as defined. At a previous company, they called it growth potential. I raised the same objections then but was dismissed. Now, I’m at a more open-minded company and I was asked for what would “good” look like. It gave me some pause and then I had to admit that I needed more time, that this topic would be good to discuss off-line. The reason is that fixing the measurability of a person’s growth is backed by complexity, so first understanding that complexity makes it then simple to apply.

This page is my attempt to bring measurement to the growth debate. It will never be 100% accurate but it will be evidence-based, which helps to mitigate biases and subjectivity. The end goal is to move the measurement of someone’s growth out of the realm of vibes, and into the realm of science.

It is borne out of how I’ve always wanted my growth to be measured.

This is now an interactive tool: Growth Velocity Rubric or check the Tools link at the top of this page.

Growth Velocity Rubric Specification

Purpose

The Growth Velocity Rubric is a structured assessment tool for evaluating how quickly and effectively an employee is becoming more capable over time.

It is designed to support performance management, talent reviews, calibration discussions, and 9-box-style assessments where “growth,” “potential,” or “trajectory” may otherwise be evaluated subjectively.

The core problem this rubric solves is that growth velocity is often discussed as if it is obvious, but without a rubric it can easily become based on vague impressions, charisma, self-promotion, manager preference, or recency bias.

This rubric provides:

  • A consistent scoring model.
  • Role-normalized expectations.
  • Observable behavioural criteria.
  • Evidence capture.
  • A low / medium / high growth velocity rating.
  • Confidence scoring.
  • A generated summary for calibration or documentation.

The rubric is not intended to replace manager judgment. It is intended to discipline manager judgment so that ratings are based on observable evidence rather than unsupported impressions.

Definition of Growth Velocity

We need a definition to intentionally focus on visible change over time. Growth velocity is defined as:

The rate at which someone converts feedback, new challenges, and increasing ambiguity into demonstrably better performance.

Growth velocity is not the same as:

  • Effort.
  • Ambition.
  • Confidence.
  • Busyness.
  • Working long hours.
  • Taking training courses.
  • Being well-liked.
  • Having high impact today.
  • Being early-career and therefore learning many visible basics.

Growth velocity asks:

Is this person becoming more capable, and is that increased capability showing up in their work?

Relationship Between Impact and Growth Velocity

Impact and growth velocity are related but distinct. The first is backwards-looking. The second is forward-looking. Anything based in the future is naturally prone to a lack of measurement.

Impact asks:

What did the person deliver?

Growth velocity asks:

How much more capable is the person becoming, and how quickly is that change appearing in their work?

For example, a person may have high impact but medium growth velocity if they continue delivering well but are not materially expanding capability, judgment, independence, or scope.

Another person may have medium current impact but high growth velocity if they are rapidly improving, taking feedback well, expanding scope, and becoming effective in more ambiguous work.

The rubric is meant to make this distinction clearer during performance calibration.

Assessment Dimensions

The rubric evaluates seven dimensions:

  1. Feedback uptake
  2. Scope expansion
  3. Skill acquisition
  4. Judgment improvement
  5. Independence
  6. Learning transfer
  7. Time compression

Each dimension is scored from 0 to 3.

The maximum total score is 21.

Limitations

The rubric has limitations.

  • It does not fully remove subjectivity. It structures subjectivity.
  • It does not prove growth velocity completely scientifically. It creates a more consistent evidence-based assessment.
  • It may still reflect bias if managers provide biased evidence.
  • It may be harder to use when managers have weak observation quality.
  • It may require calibration to avoid score inflation.
  • It may need job-family examples (see Appendix) to become operationally useful across a whole company.
  • It should not be used as the only input into compensation, promotion, or succession decisions.

Scoring Scale

Each dimension uses the following general scoring pattern:

ScoreMeaning
0No clear evidence of growth, or evidence is below current role-level expectations.
1Some evidence of growth, but inconsistent, narrow, prompted, or highly dependent on support.
2Clear evidence of healthy growth appropriate for the person’s current role level.
3Strong evidence of unusually fast, repeatable, or leverage-building growth beyond current role-level expectations.

A score of 2 should represent good, expected growth for the selected role level.

A score of 3 should be reserved for growth that is faster, broader, more repeatable, or more leveraged than expected.

The scoring model deliberately uses a small 0–3 scale because growth velocity is not precise enough to justify fine-grained scoring. A larger scale would create false precision. But scoring is just one facet and it can cause more experienced people to look like they have less velocity than less experienced people, simply because less experienced people have more headroom.

Role Normalization

Role normalization is central to the rubric.

The same growth dimension means different things at different levels. For example, “independence” for an early-career employee may mean completing familiar tasks with less guidance. For a senior/staff employee, independence may mean turning ambiguous organizational problems into clear direction for others.

Role normalization is applied inside each dimension score.

The tool does not calculate a raw score and then adjust it after the fact. Instead, the selected role level determines which behavioural expectations should be used when assigning each score.

The supported role levels are:

  1. Junior / Early Career
  2. Intermediate
  3. Senior / Staff

The selected role level is highlighted in the rubric table so the evaluator can compare the employee against the appropriate expectation set.

The final score therefore means:

Total growth velocity score against the expected growth path for the selected role level.

Role-Level Philosophy

A 3-level philosophy is used. It is possible to add more, but realize that more stratification doesn’t always lend itself to more accuracy.

Junior / Early Career

At this level, growth is often visible through:

  • Learning foundational skills.
  • Becoming reliable on familiar tasks.
  • Needing less direct guidance.
  • Applying feedback to day-to-day work.
  • Avoiding repeat mistakes.
  • Moving from task execution toward small workstream ownership.

High growth velocity at this level usually means the person is becoming productive and reliable faster than expected.

Intermediate

At this level, growth is often visible through:

  • Owning normal role-scope work independently.
  • Expanding from tasks to projects, workflows, or features.
  • Applying judgment to tradeoffs.
  • Transferring learning across projects.
  • Becoming effective in new responsibilities within expected time.
  • Reducing the need for manager intervention.

High growth velocity at this level usually means the person is expanding ownership and decision quality faster than expected.

Senior / Staff

At this level, growth is often visible through:

  • Improving judgment in ambiguous, high-impact, or cross-functional situations.
  • Creating clarity where there is ambiguity.
  • Expanding influence across teams or systems.
  • Translating learning into standards, patterns, practices, coaching, or reusable frameworks.
  • Improving the work of others, not only their own output.
  • Building organizational leverage.

High growth velocity at this level is less about learning more visible basics and more about increasing leverage, influence, judgment, and system-level contribution.

Dimension Criteria

These are a set of attributes to gauge someone’s growth. Because growth is not just a simple metric, and we want to disincentivize burnout, we focus these criteria across different dimensions.

Feedback Uptake

Feedback uptake measures whether the employee receives feedback and converts it into changed behaviour or improved work.

ScoreJunior / Early CareerIntermediateSenior / Staff
0Receives feedback but does not visibly apply it; same issues repeat.Receives feedback but behaviour or output does not materially change.Resists or rationalizes feedback; repeated issues affect trust, influence, or execution.
1Applies feedback when reminded or closely guided.Applies feedback in familiar situations, but inconsistently or only after manager prompting.Applies feedback selectively; improvement is visible but narrow or dependent on direct prompting.
2Takes feedback seriously and shows clear improvement in day-to-day work.Integrates feedback into normal work and reduces repeat issues over time.Adjusts communication, judgment, leadership style, or stakeholder approach based on feedback.
3Rapidly converts feedback into changed behaviour and improved reliability.Proactively seeks feedback, applies it across situations, and shows fast improvement.Actively seeks difficult feedback, changes approach quickly, and improves trust, leverage, or cross-team effectiveness.

Guidance

Feedback uptake should be based on observable change, not whether the person seemed receptive in the conversation. Someone can appear gracious when receiving feedback but not change behaviour. Conversely, someone may need time to process feedback but later apply it well.

Useful evidence includes:

  • Reduced repeat issues.
  • Better stakeholder communication.
  • Improved work quality after feedback.
  • Peer or stakeholder confirmation.
  • Clear before/after examples.

Scope Expansion

Scope expansion measures whether the employee is growing into broader, more complex, or more ambiguous ownership.

ScoreJunior / Early CareerIntermediateSenior / Staff
0No clear expansion beyond assigned tasks.No clear expansion beyond normal ownership.No growth in strategic, systemic, or cross-team scope.
1Takes on slightly larger tasks when assigned.Handles adjacent work with support.Contributes to broader initiatives but mostly as a participant.
2Reliably handles larger tasks in the same area.Owns a broader feature, project, process, or workflow.Leads cross-functional or ambiguous work expected at level.
3Quickly expands from tasks to small workstreams.Moves into broader ownership faster than expected.Expands influence across teams, systems, standards, or organizational practices.

Guidance

Scope expansion should not mean simply giving someone more work. It should mean the person is successfully operating at a broader level of ownership or complexity.

For senior/staff employees, scope expansion should often involve influence beyond a single task or project. This might include cross-team standards, reusable practices, architectural direction, operational improvements, or system-level change.

Skill Acquisition

Skill acquisition measures whether the employee is developing new capabilities and applying them in meaningful work.

ScoreJunior / Early CareerIntermediateSenior / Staff
0Does not build required foundational skills; needs repeated support on basics.Does not develop skills needed for current role expectations.Does not close meaningful skill gaps that limit effectiveness at level.
1Learns required skills when given structured training or direct coaching.Builds new skills in familiar areas, but application remains narrow.Builds skills in adjacent areas but applies them inconsistently or tactically.
2Develops core skills and applies them in normal assigned work.Acquires new skills and applies them independently to improve delivery quality.Builds capability in adjacent domains and uses it to make better decisions or improve outcomes.
3Learns foundational skills quickly and becomes productive faster than expected.Rapidly develops new capabilities and applies them across multiple projects or contexts.Quickly develops leverage-building expertise and uses it to raise team, system, or organizational capability.

Guidance

Skill acquisition should not be scored based only on completed training, certifications, or courses. The important question is whether the new skill appears in delivered work, better decisions, better execution, or increased team capability.

For senior/staff employees, learning a tool may not be sufficient evidence of high growth velocity. Stronger evidence would be using an adjacent skill to improve team practices, architecture, security, delivery quality, customer outcomes, or decision-making.

Judgment Improvement

Judgment improvement measures whether the employee is making better decisions, identifying risks earlier, and improving tradeoff quality.

ScoreJunior / Early CareerIntermediateSenior / Staff
0Repeats avoidable mistakes; does not recognize obvious tradeoffs or risks.Continues to miss normal role-scope risks, dependencies, or tradeoffs.Misses systemic risks, second-order effects, or organizational tradeoffs expected at level.
1Recognizes issues after they are pointed out.Identifies common risks and tradeoffs with some guidance.Identifies broader risks but still needs frequent steering on prioritization or stakeholder impact.
2Shows better judgment on familiar tasks; asks better questions before acting.Makes sound tradeoffs in normal work and escalates issues appropriately.Makes better tradeoffs earlier, with fewer reversals, escalations, or preventable surprises.
3Rapidly improves decision quality and begins anticipating simple risks independently.Anticipates risks across projects and improves decision quality faster than expected.Improves the quality of decisions around ambiguous, high-impact, or cross-functional problems; helps others make better tradeoffs too.

Guidance

Judgment improvement is one of the most important dimensions for senior employees. At higher levels, growth may be less visible as task execution and more visible as improved prioritization, risk detection, tradeoff quality, and decision framing.

Useful evidence includes:

  • Fewer preventable surprises.
  • Earlier escalation of meaningful risks.
  • Better prioritization.
  • Stronger recommendations.
  • Improved architecture, design, operational, or stakeholder decisions.
  • Helping others make better decisions.

Independence

Independence measures whether the employee needs less direction over time and can operate effectively with appropriate autonomy for their level.

ScoreJunior / Early CareerIntermediateSenior / Staff
0Needs repeated direction on familiar tasks.Needs repeated direction on normal role-scope work.Needs heavy direction on ambiguous or cross-functional work expected at level.
1Can complete familiar tasks with guidance.Can complete normal tasks with guidance.Can handle some ambiguous work but needs frequent steering.
2Can complete familiar tasks independently.Can own normal role-scope work independently.Can structure ambiguous work and drive it with limited guidance.
3Rapidly moves from guided to independent work.Takes on broader or less-defined work faster than expected.Turns ambiguity into clear plans, decisions, frameworks, or direction for others.

Guidance

Independence does not mean never asking questions or never escalating issues. Healthy independence includes knowing when to ask, when to decide, when to escalate, and when to create clarity for others.

For senior/staff employees, the strongest evidence is not merely that they can work alone. It is that they can take ambiguous situations and turn them into direction, structure, decisions, or reusable approaches.

Learning Transfer

Learning transfer measures whether the employee applies lessons from one situation to other contexts.

ScoreJunior / Early CareerIntermediateSenior / Staff
0Learns only in the exact situation; repeats the same mistakes in similar contexts.Does not apply lessons from one project, incident, or feedback cycle to another.Does not convert experience into reusable judgment, standards, coaching, or system improvement.
1Applies lessons when the situation is very similar or when reminded.Applies lessons across similar work, but inconsistently.Applies lessons to adjacent situations, but mostly for their own work.
2Applies lessons from one task to similar future tasks.Transfers learning across projects, stakeholders, or problem types.Converts lessons into reusable practices, standards, patterns, decision records, coaching, or team improvements.
3Quickly generalizes lessons and avoids repeat mistakes across related tasks.Reuses learning across meaningfully different contexts and improves outcomes faster than expected.Turns individual learning into organizational leverage: better systems, better defaults, better team judgment, or reduced future risk.

Guidance

Learning transfer is especially important because growth velocity should not only measure isolated improvement. It should measure whether the person can generalize what they learn.

For senior/staff employees, learning transfer should often produce leverage. Examples include:

  • Standards.
  • Playbooks.
  • Design patterns.
  • Decision records.
  • Coaching.
  • Improved defaults.
  • Reusable frameworks.
  • Process improvements.
  • Reduced future risk.

Time Compression

Time compression measures how quickly the employee reaches effective capability after exposure to new work, new context, or new ambiguity.

ScoreJunior / Early CareerIntermediateSenior / Staff
0Takes longer than expected to reach basic competence, with little visible progress.Takes longer than expected to become effective in normal role-scope work.Takes longer than expected to become effective in new domains, ambiguous problems, or broader influence areas.
1Progresses with significant structure, repetition, or coaching.Becomes effective with normal support, but not faster than expected.Becomes effective in new areas with support, but requires more context-setting than expected.
2Reaches competence within expected time for level and context.Becomes effective in new work within expected time.Becomes effective in new domains, teams, or problem spaces within expected time for level.
3Reaches competence faster than expected after exposure to new work.Ramps quickly into new responsibilities, domains, or ownership areas.Rapidly becomes effective in unfamiliar, ambiguous, or cross-functional spaces and creates clarity for others.

Guidance

Time compression should be used carefully. It should not reward overwork, unsustainable intensity, or constant urgency.

The dimension measures speed to effective capability, not speed to exhaustion.

Evidence should show that the person became genuinely effective faster than expected, not merely that they worked longer hours or appeared busy.

Evidence Model

Each dimension includes three evidence fields:

  1. Before
  2. After
  3. Evidence

Before

This captures the prior state.

Examples:

  • Needed repeated reminders to update stakeholders.
  • Required detailed task breakdowns.
  • Missed dependencies until late in delivery.
  • Had limited understanding of a domain.
  • Struggled to apply feedback across contexts.

After

This captures the observable change.

Examples:

  • Now sends concise weekly stakeholder updates without prompting.
  • Breaks down ambiguous work independently.
  • Identifies cross-team dependencies earlier.
  • Applies new domain knowledge to improve design choices.
  • Uses feedback from one situation in another project.

Evidence

This captures the artifact, observation, or validation that supports the score.

Examples:

  • Delivered project.
  • Design document.
  • Decision record.
  • Peer feedback.
  • Stakeholder feedback.
  • Manager observation.
  • Incident review.
  • Sprint output.
  • Architecture review.
  • Customer outcome.
  • Reduced repeat issues.
  • New standard, playbook, or reusable pattern.

The evidence model exists because a numeric score without evidence becomes subjective. The rubric should require evidence for any non-zero score, especially scores of 2 or 3.

Total Score and Rating Bands

Each of the seven dimensions is scored from 0 to 3.

Maximum score:

7 dimensions × 3 points = 21 points

The rating bands are:

Total ScoreGrowth Velocity Rating
0–7Low
8–14Medium
15–21High

Low Growth Velocity

A low rating means evidence is mostly absent, inconsistent, or below current role-level expectations.

This does not necessarily mean the person is low-performing overall. It means the evidence does not show meaningful growth velocity during the review period.

Medium Growth Velocity

A medium rating means evidence shows normal, healthy growth for the selected role level.

This should be considered a positive and acceptable rating. Most people should likely be in this band most of the time if they are growing appropriately.

High Growth Velocity

A high rating means evidence shows growth beyond current role-level expectations across multiple dimensions.

A high rating should be uncommon and evidence-heavy.

High-Rating Gate

To avoid inflated high ratings, the rubric includes a high-rating gate.

For a defensible High rating, the assessment should meet both of the following conditions:

  1. At least 4 of the 7 dimensions are scored 2 or higher.
  2. At least 2 dimensions are scored 3.

This prevents a person from being rated High based on one exceptional area or a single recent success. If the total score reaches the High band but the gate is not satisfied, a warning should be displayed so that there is a chance to understand the improvement path. This does not prevent the evaluator from seeing the numeric score, but it signals that the High rating may not be well-supported.

Confidence Rating

The rubric also needs a separate confidence rating:

  • Low
  • Medium
  • High

Confidence is not part of the numeric score. It describes how strong the evidence base is.

Low Confidence

Use Low confidence when:

  • Evidence is mostly anecdotal.
  • There are too few examples.
  • The before/after state is unclear.
  • The rating is based on a narrow interaction.
  • There is little stakeholder or artifact support.

Medium Confidence

Use Medium confidence when:

  • There are some concrete examples.
  • Evidence is real but limited.
  • Evidence may be concentrated in one project.
  • More time or broader validation would improve confidence.

High Confidence

Use High confidence when:

  • There are multiple examples across time.
  • Evidence comes from more than one project, stakeholder, or artifact.
  • The before/after pattern is clear.
  • The growth appears repeatable.
  • The rating would likely survive calibration.

The confidence rating is important because someone may appear to have high growth velocity based on a single project, but the evidence may not yet be broad enough to support high confidence.

Intended Usage

The rubric can be used by:

  • Managers preparing performance reviews.
  • P&C teams designing performance frameworks.
  • Calibration groups.
  • Talent review committees.
  • Employees preparing self-assessments.
  • Mentors or career coaches helping someone understand growth areas.

Recommended usage:

  1. Select the employee’s current role level.
  2. Review each dimension.
  3. Score against the selected role-level criteria.
  4. Capture before/after/evidence notes.
  5. Review the total score.
  6. Check whether the high-rating gate passed, if applicable.
  7. Set evidence confidence.
  8. Use the generated summary for calibration or documentation.

Calibration Guidance

During calibration, reviewers should ask:

  • What changed over the review period?
  • Is the change observable?
  • Is the evidence tied to work output, behaviour, or stakeholder impact?
  • Is the person being scored against the correct role level?
  • Are we rewarding effort, or actual capability growth?
  • Are we rewarding confidence or communication polish instead of evidence?
  • Is the rating based on one recent event?
  • Would another manager reach the same rating from the evidence?
  • Are experienced employees being penalized because their growth is subtler?
  • Are junior employees being over-rewarded simply because their learning is more visible?

Calibration should focus on the evidence, not on defending the initial score.

Bias Risks and Safeguards

Growth velocity is vulnerable to bias if left undefined.

Common risks include:

RiskExpanationSafeguard
Recency BiasA recent success or failure may distort the whole rating.Require evidence across the review period where possible.
Charisma BiasPolished communicators may appear to be growing faster.Require artifacts, before/after examples, and stakeholder evidence.
Ambition TheatrePeople who talk about growth may be rated higher than people who quietly improve.Score visible change in work, not stated ambition.
Manager PreferenceA manager may favour employees whose style resembles their own.Use role-normalized criteria and calibration discussion.
Junior Visibility BiasJunior employees may appear to grow faster because their growth is more obvious.Normalize expectations by role level.
Senior Invisibility BiasSenior employees may be underrated because their growth appears as better judgment, influence, or leverage rather than visible task learning.Use senior/staff criteria focused on ambiguity, systems, standards, judgment, and organizational leverage.

Appendix: Job-Family Customization

It is important to be honest that the current rubric is generic. It can apply broadly across functions, but some organizations may want job-family examples. Job-family customization means keeping the scoring model the same, but changing the examples, evidence expectations, and language so the rubric makes sense for a specific type of work. Do not customize the whole rubric from scratch for each job family. That creates inconsistency. I’d customize only the parts that help managers recognize evidence.

For example, this is what growth can look like for each job family:

Engineering

  • Better design documents.
  • More reliable delivery.
  • Stronger architecture tradeoffs.
  • Reduced operational risk.
  • Better code review judgment.
  • Reusable technical patterns.

Security

  • Earlier risk detection.
  • Better threat modelling.
  • Stronger control design.
  • Improved stakeholder influence.
  • Better incident learning transfer.
  • More scalable security patterns.

Product

  • Better prioritization.
  • Stronger customer insight.
  • Improved discovery quality.
  • Better stakeholder alignment.
  • Clearer tradeoff decisions.

Operations

  • Improved process reliability.
  • Better escalation judgment.
  • Reduced repeat incidents.
  • Stronger documentation.
  • More scalable workflows.

The core scoring model can stay the same while examples are adapted to each function.

Example: same dimension, different job families

Take Judgment Improvement.

The generic senior/staff criterion is:

Makes better tradeoffs earlier, with fewer reversals, escalations, or preventable surprises.

That is useful, but still abstract. Job-family customization makes it concrete.

Job familyWhat improved judgment might look like
EngineeringMakes better architecture tradeoffs; catches scalability, reliability, or maintainability risks earlier; avoids overengineering.
SecurityIdentifies meaningful risk earlier; distinguishes theoretical risk from practical risk; designs controls that balance security, cost, usability, and operational burden.
ProductMakes better priority calls; separates customer signal from noise; balances customer value, delivery cost, and business strategy.
Customer SupportEscalates the right issues earlier; distinguishes one-off complaints from systemic issues; improves resolution quality.
FinanceSpots anomalies earlier; improves forecast assumptions; identifies material business risks or cost drivers sooner.
People & CultureAnticipates fairness, consistency, adoption, and manager-behaviour risks before rolling out programs.

Example: customized evidence artifacts

For Learning Transfer, the generic senior/staff criterion is:

Converts lessons into reusable practices, standards, patterns, decision records, coaching, or team improvements.

That could become:

Job familyEvidence artifacts
EngineeringADRs, reusable libraries, coding standards, post-incident fixes, design review improvements.
SecurityThreat model templates, control patterns, risk decision records, secure design guides, incident lessons applied to controls.
ProductDiscovery templates, prioritization framework, customer insight synthesis, roadmap decision records.
SalesImproved discovery playbook, objection handling guide, better qualification criteria, reusable account strategy.
P&CManager enablement guides, calibration examples, policy clarification, performance-review training materials.
OperationsSOPs, runbooks, escalation paths, quality checks, process controls.

Again, the scoring model stays the same. The examples become function-specific.