Skip to content

How Managers Should Hire: A Practitioner's Playbook for Building a Team That Sticks

Aastha Bensla
Aastha Bensla 13 min read
How Managers Should Hire: A Practitioner's Playbook for Building a Team That Sticks

Most hiring failures are not interview failures. They are role-definition failures dressed up as interview failures.

A manager loses a senior IC. The team is hurting. A requisition opens within the week. The job description is copied from the last similar role. Sourcing starts.

Four weeks later, three finalists sit in front of a panel that has not aligned on what good looks like. Someone gets hired on instinct. Six months later, that person is on a performance plan, and everyone says the interview missed something.

The interview did not miss anything. The role definition did.

This is a playbook for managers who want to stop running that loop. It collapses the hiring process into three decisions that actually predict who succeeds: role clarity before sourcing, structured interviews over gut, and a 90-day ramp plan written before the offer.

The Eleanor problem

Eleanor manages a platform engineering team of seven. Her senior IC just resigned. She has been told to backfill quickly because two roadmap commitments depend on that role.

Here is what Eleanor will be tempted to do: open the old job description, change the date, send it to the recruiter, and start scheduling interviews next week.

Here is what Eleanor should do instead: spend two days defining what this role needs to deliver at 30, 60, and 90 days. Then work backward to the rubric, the panel, and the offer. Those two days are the most valuable hours in the entire hiring cycle. Skip them and the next 90 days of work get harder, not the next 90 hours.

The three decisions below are what those two days look like, and what comes after.

Decision 1: Role clarity before sourcing

Role clarity in hiring means a written, measurable definition of what success looks like in the first 30 days, the behaviors that predict it, and the gap on the team this role will close.

Most job descriptions are wish lists. They list responsibilities, not outcomes. They list skills, not behaviors. They give a candidate no way to self-select, and they give a panel no way to calibrate.

Role clarity has three artifacts:

The 30-day outcome. One or two measurable things this person will own by day 30. For Eleanor’s senior IC, that might be: “Owns the auth service on-call rotation independently and ships a documented runbook update by end of week four.” Not “contributes to platform reliability.”

The good-better-best rubric. Three to five behaviors the role depends on, with three levels each. For a senior IC, behaviors might include systems design judgment, async written communication, mentoring junior engineers, incident leadership, and stakeholder alignment. For each behavior, write what a 3 (target), 4 (above bar), and 5 (exceptional) example sounds like. This rubric is the spine of every interview that follows.

The team gap. What is missing on the team today that this hire fills? Eleanor’s team is strong on greenfield builds but weak on production hardening. That gap should be on page one of the JD, not buried in “nice to haves.” Candidates who solve the gap will recognize themselves in the description. Candidates who do not will skip it. Both outcomes are wins.

Once these three artifacts exist, setting expectations with the recruiter, the panel, and the candidate becomes mechanical. Without them, every conversation is improvised.

A note on headcount and employee type: these are decisions that belong upstream of the rubric, not parallel to it. Decide whether you need full-time, contract, or fractional support before you write the JD. The behaviors you screen for change based on the answer.

Decision 2: Structured interviews over gut

Structured interviews are interview loops where every candidate is asked the same calibrated questions, scored against the same rubric, by panelists who have aligned in advance on what each rubric level sounds like.

Unstructured interviews predict job performance about as well as a coin flip. The data on this has been settled for thirty years. Managers keep ignoring it because gut feels like signal. It is not. Gut is a confidence score on the candidate’s similarity to people you already trust, which is exactly the bias you are trying to remove.

Structured interviews have four components:

The same questions for every candidate. If you ask candidate A about a time they shipped under ambiguous requirements and candidate B about a time they handled a difficult stakeholder, you cannot compare them. Pick the questions, write them down, and use them for everyone.

Behavioral over hypothetical. “Tell me about a specific time you had to push back on a product decision” outperforms “How would you handle a disagreement with PM?” Past behavior is concrete and probeable. Hypothetical answers are rehearsed.

Rubric-anchored scoring. Panelists score each behavior on the rubric you wrote in Decision 1, not on overall impression. A 4 on systems design and a 2 on written communication is more useful than “great candidate, good vibes.”

Calibration before the loop, debrief after. The panel should meet once before interviews start to align on what a 3 versus a 4 sounds like for each behavior. They should meet again after every candidate finishes the loop to compare scores before anyone shares a hire-no-hire opinion. Sharing opinions before scores anchors the room and destroys the signal.

A few specific question patterns that consistently produce useful data:

  • “Walk me through the most technically interesting decision you made in the last six months. What were the alternatives?”
  • “Tell me about a time you changed your mind about a strongly held opinion at work. What made you change it?”
  • “Describe a piece of feedback you received that was hard to hear. What did you do with it?”

For more on the discipline of writing these down, see 10 examples of interview feedback.

Assessments and personality fit

Skill assessments belong in the loop when the role has a measurable craft component: code, design, writing, analysis. They should be timed, scoped, and graded against the same rubric every candidate sees.

Personality assessments are different. They are useful as conversation starters and as a way to surface working-style mismatches before they become Monday-morning mismatches. Risely’s DISC for hiring guide covers how to use DISC results in the interview loop without letting them become a filter.

A warning: do not use any assessment as a gate. Use it as a data point, weighted alongside rubric scores from the structured interviews. A candidate who scores low on a written test but high on every behavioral panel is usually telling you something the test is not measuring.

Values and working-style fit, not “cultural fit”

The phrase “cultural fit” has done more damage to hiring than almost any other phrase in the manager vocabulary. It collapses two different things into one, and the second one is usually bias.

The two things:

Values fit. Does this candidate share the principles the team operates on? Honesty about mistakes, written-first communication, evidence over opinion, whatever your team’s principles are. This is real and worth screening for.

Working-style fit. Does this candidate’s preferred way of operating mesh with how the team works today? Async-heavy versus meeting-heavy, deep work versus high-collaboration, autonomous versus structured. Also real, also worth a conversation.

Cultural fit, as commonly used, is neither of these. It is “do I want to grab a beer with this person.” That question is a filter for sameness. It removes the people who would have made the team stronger by being different.

Replace it with two specific questions: do they share our values, and does their working style work here. Score those separately. Be willing to hire someone whose working style stretches the team a bit, because that is often where growth comes from.

Decision 3: The 90-day ramp plan, agreed before the offer

A 90-day ramp plan is a written set of week 1, day 30, day 60, and day 90 milestones that the manager and the candidate agree to before the offer letter is signed.

Most managers write the ramp plan in week one of the new hire’s first month. By then, it is too late. The new hire has already absorbed a week of ambiguity. The manager has missed the first opportunity to set the tone. The team has started forming opinions based on first impressions instead of agreed expectations.

Write it earlier. Write it as part of the final round of interviews, not as part of onboarding.

The plan has four checkpoints:

Week 1. The new hire has read the team’s operating doc, met every direct collaborator one-on-one, set up their dev environment or equivalent, and shadowed one piece of live work. By Friday of week one, they should be able to draw the team’s main systems on a whiteboard and name three people they will work with closely.

Day 30. The 30-day outcome from Decision 1 is delivered. For Eleanor’s senior IC, that meant owning the on-call rotation and shipping a runbook update. The outcome is owned, not assisted.

Day 60. The new hire has shipped one piece of independent work that exposed them to the team’s full delivery cycle, from spec to review to ship to retro. They have given written feedback on at least one teammate’s work. They have surfaced one thing they would change about how the team operates.

Day 90. The new hire is fully load-bearing on the responsibilities the role was hired for. The hiring manager and the new hire run a structured 90-day check-in: what is working, what is not, what changes for quarter two.

Sharing this plan during the final interview round does two things. First, it filters. Candidates who push back hard on the milestones are telling you something useful before you make the offer.

Second, it commits both sides. The manager has now promised the candidate a structured first quarter. The candidate has agreed to specific outcomes. Day one starts with a contract, not a blank page.

Where Merlin fits in

The week the new hire starts, the harder job is actually the manager’s. Eleanor has to deliver feedback in the first week without sounding harsh. She has to course-correct on small things before they become patterns, and make the new hire feel safe enough to ask questions that reveal gaps.

This is exactly the kind of preparation an AI coach helps with. Merlin can help Eleanor draft her week-one feedback message, rehearse the day-30 check-in conversation, and pressure-test her ramp plan against scenarios Eleanor has not thought through yet. The 26% skill improvement that Risely users see in 12 weeks shows up most clearly in moments like these. A manager has 10 minutes between meetings to prepare for a conversation that will shape the next quarter.

This is also where the hiring loop closes. A great hire becomes a great teammate when the manager invests in their own coaching practice alongside the new hire’s ramp.

Putting it together

Eleanor’s two days of role clarity work, plus the structured loop, plus the agreed ramp plan, take roughly three weeks of calendar time before the offer is signed. That is not slower than the copy-paste-and-hope approach. It is faster, because it does not produce a six-month performance plan on the back end.

The three decisions are sequential. Skip role clarity and your interviews have nothing to calibrate against. Skip structured interviews and your rubric is decoration. Skip the ramp plan and your great hire walks into ambiguity in week one.

Run all three and the next 90 days of team work get easier. The new hire knows what success looks like. The manager has an honest answer to “is this working” by day 30 instead of month six.

Try Merlin’s manager coaching free at risely.me/try-merlin. Bring your next hiring decision and we will work through the rubric, the panel, and the ramp plan together.


FAQs

What is the single biggest mistake managers make when hiring?

They start sourcing before they have written down what good looks like at day 30. Without a measurable 30-day outcome and a rubric for the role, every interview becomes a vibe check. The fix takes 45 minutes: define the outcome, list 3-5 must-have behaviors, and write the rubric before posting the role.

How long should a structured interview loop take?

Four to five conversations across two weeks is enough for most IC roles. A recruiter screen, a hiring manager conversation, two skill-based panels with calibrated rubrics, and a values plus working-style discussion. More rounds rarely improve signal. They usually just exhaust the candidate and the panel.

When should the 90-day ramp plan be written?

Before the offer letter goes out. The plan should be agreed with the candidate during the final round, not after they accept. This does two things: it filters out candidates who disagree with the ramp expectations, and it gives the new hire a contract for week one instead of a blank page.

Talk to Merlin

Get personalized coaching on the skills covered in this article — powered by AI that understands your context.

Try Merlin Free
Aastha Bensla

Written by

Aastha Bensla

MA Applied Psychology, Manav Rachna International. Industrial-organizational psychologist and clinical counselor.

Aastha has sat across from people in two very different settings: as a clinical counselor helping individuals work through personal challenges, and as an I/O psychologist at Risely helping managers work through professional ones. Her MA in Applied Psychology from Manav Rachna gave her the frameworks; the counseling gave her the instinct for what people actually need to hear versus what sounds good on paper.

Take Assessment Try Merlin Free