Knowledge CenterVerify track record and case studies

Territory Hub

agency vettingB2B marketingpipeline attributionrevenue accountabilitydue diligenceCRM proofdemand generation

B2B Agency Vetting Frameworks

Last updated: May 26, 2026

Six structured frameworks for vetting B2B marketing agencies on CRM-backed pipeline proof, industry fit, and revenue attribution across complex buying cycles.

B2B Marketing Agency Vetting Frameworks for Pipeline and Revenue Proof

The B2B Agency Vetting Framework Catalog is a structured methodology for de-risking marketing agency selection in complex, industry-specific buying cycles where generic credentials are worthless and only CRM-backed, deal-size-relevant proof counts. It organizes the entire due diligence process into six named frameworks across three purposes: Diagnostic (does this agency fit my industry and motion?), Evaluation (can they prove what they claim?), and Verification (will the proof hold up under scrutiny?). You don't get fired for a bad agency deck. You get fired for unprovable pipeline.

Most agency vetting fails the same way. A marketing leader collects case studies, runs reference calls, scores chemistry, and signs. Six months later, pipeline hasn't moved, attribution is murky, and the agency blames sales enablement. The problem was never the conversation. It was the absence of a structured proof standard going in. If your agency can't prove pipeline in your CRM, you're not hiring a partner. You're buying a story.

This catalog prevents three failures: wrong vertical fit, unprovable attribution, and contracts that reward activity instead of revenue. The sequencing matters: diagnosis prevents false positives, evaluation scores the claims, verification stops the cherry-picking. Yes, chemistry matters. No, chemistry doesn't create pipeline. Hiring an agency without CRM proof is like buying a car based on a photo of the speedometer.

We've watched this movie for 25 years. The ending is always the same when proof is optional. Most agency-selection advice, including widely cited guidance from BCG, MarTech.org, and MarketVeep, presents single-lens advice. None of it sequences the diagnosis-evaluation-verification arc. None of it specifies recommended proof bars that must be calibrated to your economics. None of it differentiates vetting for SaaS, fintech, or manufacturing, where sales cycles, compliance burden, and deal sizes diverge dramatically. The Starr Conspiracy built this catalog to force agencies to show the system behind the results, not just the results. Learn more about how this connects to broader B2B marketing operations and the Ten Demand States framework that anchors several of the diagnostics below.

A note on thresholds throughout: treat them as minimum bars we recommend, not universal truths. Calibrate CAC payback, ACV bands, committee size, and cycle length to your model and vertical. When CRM data is messy, require source definitions, sample reports, and signed third-party validation. When references must be anonymous, demand a signed reference call and redacted screenshots, not just a logo grid.

The Six Frameworks at a Glance

Diagnostic (fit before evaluation):

Industry-Fit Diagnostic. Does the agency know your vertical's economics and committee?
Demand-State Coverage Audit. Can they operate across the states your buyers actually move through?

Evaluation (score the proof):

Case Study Scoring Rubric. Rate every case study on eight dimensions, not vibes.
CRM-Backed Pipeline Proof Standard. The "CRM-or-it-didn't-happen" bar for revenue claims.

Verification (does the proof survive contact with reality):

Retention and Attribution Verification. What reference calls reveal when you structure them.
Revenue Accountability Contract Review. Do they sign for what they pitched?

Read the summaries, pick the framework that matches your risk, then run the components like a checklist.

The Three Categories

Diagnostic frameworks answer a fit question before you evaluate anything else. If an agency fails diagnosis, do not advance them. Period.

Evaluation frameworks score the proof an agency presents. This is where marketing theater dies and revenue accountability starts. If they lead with impressions, you're funding content, not revenue.

Verification frameworks test whether the proof survives contact with reality. Reference calls without structure are vibes. Vibes don't move revenue.

This order matters: diagnosis prevents false positives, evaluation scores the claims, verification kills the cherry-picked anecdote.

The Six Frameworks

1. Industry-Fit Diagnostic

The Industry-Fit Diagnostic is a screening framework developed by The Starr Conspiracy for filtering agencies before any evaluation begins. It covers five components: vertical sales cycle length match, deal size band match, buying committee composition fluency, regulatory and compliance literacy, and channel mix relevance. Use it when evaluating agencies for verticals with sales cycles exceeding 6 months or buying committees with 5 or more stakeholders.

Check these five fit signals:

Vertical execution within the past 24 months with documented pipeline outcomes, not logos.
Reference deals within your ACV (annual contract value) band, recommended within 50% above or below your median.
Fluency in the 4 to 7 roles on your typical buying committee, named without prompting.
SOC 2, GDPR, PCI literacy and constraints-aware execution where the vertical demands it.
Channel mix that matches where your buyers actually consume, not where the agency likes to work.

Vertical calibration: manufacturing buying committees include distributors and channel partners whose influence rarely shows in CRM. Fail any two of five and do not advance them.

When to use: Run this first on every longlist agency in SaaS, fintech, or manufacturing before reviewing a single case study.

2. Demand-State Coverage Audit

The Demand-State Coverage Audit tests whether an agency can operate across the demand states your buyers move through, not just the narrow band where they happen to be strong. It covers four components: demand state range, motion fluency, creative-to-conversion linkage, and vertical pattern recognition. Use it when your gap is unclear or when you suspect an incumbent is over-indexed on top-funnel awareness.

Demand state range across the full Ten Demand States, with named work in each they claim.
Motion fluency across inbound, outbound, ABM, and partner, with proof of which they've actually run.
Creative-to-conversion linkage showing how assets ladder to pipeline mechanics, not just engagement.
Vertical pattern recognition. Can they describe how demand states behave differently in your industry?

Vertical calibration: SaaS demand states are dominated by category education and competitive displacement; fintech is dominated by trust and compliance signals.

When to use: Run after the Industry-Fit Diagnostic to confirm capability range before requesting case studies.

3. Case Study Scoring Rubric

The Case Study Scoring Rubric scores agency case studies on eight dimensions instead of accepting narrative claims at face value. Use it on every case study an agency presents in a finalist round.

Score these eight dimensions, 0 to 3 each:

Client industry match. Same vertical, same motion, same buyer economics.
Deal size relevance within your ACV band, not a small-business win dressed up for enterprise.
Baseline disclosure of the pre-intervention number, not just the post-intervention claim.
Intervention specificity. What they actually did, not what they "led."
Time-to-impact transparency tied to a realistic sales cycle for that ACV.
Attribution methodology that names the model and its known limits.
Named CRM or analytics source: HubSpot, Salesforce, or the client's system of record.
Post-engagement retention data showing the client kept buying.

A case study scoring below 16 of 24 is marketing theater. Below 12, it's fiction. If they can't name the report, they didn't pull the number.

When to use: Apply to every finalist case study before reference calls begin.

4. CRM-Backed Pipeline Proof Standard

The CRM-Backed Pipeline Proof Standard (internally: the "CRM-or-it-didn't-happen" bar) separates revenue-accountable agencies from impression-accountable ones. Use it as the minimum bar for any agency claiming revenue impact.

Six proof components:

Sourced pipeline in dollars, not MQLs and not "engaged accounts."
Influenced revenue with the attribution model named, governed, and defensible.
CAC payback under a recommended bar of 18 months for SaaS, 24 months for fintech and manufacturing, calibrated to your model.
Win rate lift on agency-sourced opportunities compared to baseline.
ACV trend on sourced deals over the engagement window.
NRR contribution for SaaS with NRR reporting in place.

Non-negotiable CRM proof artifacts to demand on screen-share:

Report name and where it lives in the CRM.
Date range and opportunity stages included.
Source and influence definitions written down, not improvised.
The person who pulled the report, live in the meeting.

A real agency can pull this report in 2 minutes and explain every filter. Watch for attribution gaming: self-reported influence, last-touch cherry-picking, and "engaged account" pseudo-metrics. We've sat through finalist presentations where a vendor claimed $4M in sourced pipeline, then couldn't name the Salesforce report, the attribution model, or the date range. The "proof" was a slide. Make them screen-share. If they can't, the claim does not exist. Deck proof is not system proof.

Vertical calibration: fintech requires compliance-aware reporting; manufacturing requires distributor influence visibility; SaaS requires NRR linkage.

When to use: At finalist presentation and on every incumbent review. If you use one thing from this page, make it this framework.

5. Retention and Attribution Verification

The Retention and Attribution Verification framework is a reference-call structure that tests whether case study claims survive after the agency relationship ends or matures. Use it on every finalist's references, including the references they didn't volunteer.

Client retention length. Under 18 months is a flag; over 36 months is a signal.
Scope expansion pattern. Did the client buy more, buy less, or rebid?
Attribution method audit. Did the client's RevOps team independently validate the reporting?
Offboarding context. Why did the relationship end, and what did the client do next?

Stop rewarding stories. Require structured answers. Anonymous reference? Require a signed call and redacted screenshots, not a logo.

When to use: On every finalist before LOI, and on every incumbent at renewal.

6. Revenue Accountability Contract Review

The Revenue Accountability Contract Review tests whether an agency will sign for the outcomes they claimed in the pitch. Use it at the LOI or MSA stage, before signature.

Pipeline-tied compensation component with a defensible measurement floor.
Sourced-versus-influenced split defined in writing, not assumed.
QBR cadence with revenue data, not activity slides.
Exit clause tied to performance thresholds you can actually invoke.
Joint attribution governance shared with your RevOps team.

Have counsel review the terms. An agency that talks revenue in the pitch and refuses revenue terms in the contract told you everything you need to know.

When to use: At LOI or MSA stage, before signature, for every finalist.

How to Sequence the Catalog

For a full procurement cycle, run the frameworks in order: Industry-Fit Diagnostic to screen the longlist, Demand-State Coverage Audit to confirm capability range, Case Study Scoring Rubric on finalist materials, CRM-Backed Pipeline Proof Standard during finalist presentations, Retention and Attribution Verification on references, and Revenue Accountability Contract Review at commercial close.

For incumbent review, start with the CRM-Backed Pipeline Proof Standard. If the proof isn't there, the rest of the catalog is academic.

A real agency can explain your category narrative and your demand states, then prove pipeline impact in your CRM. If they can't do both, they're not strategic. They're decorative.

Run Framework 4 on your current agency this week. If they can't screen-share the proof, stop debating and start replacing. Every quarter you wait is another quarter you can't explain to the CFO. If you want us to run this vetting with you before your next renewal or agency search kickoff, talk to The Starr Conspiracy. We'll help you force proof before you sign, with CRM-backed standards and a vetting sequence that holds up in the boardroom. No more vibes-based procurement. Make them prove it, or move on.

Steps

Run the Industry-Fit Diagnostic

Screen every agency on the longlist against five vertical-fit dimensions before evaluating any creative work or case studies. This is a pass-fail gate, not a scored comparison. Agencies that fail two or more dimensions do not advance.

•Confirm vertical execution within the last 24 months
•Verify reference deals fall within 50% of your ACV band
•Test buying committee role fluency without prompting
•Check compliance literacy for regulated verticals
•Map their channel mix to your buyer behavior

Audit Demand-State Coverage

Map each surviving agency against the demand states your buyers move through. Test whether they can support more than the narrow band where they happen to be strong, and whether their creative work ladders to pipeline mechanics rather than awareness vanity metrics.

•List the demand states your pipeline depends on
•Ask the agency to map their work to those states
•Test fluency across inbound, outbound, ABM, and partner motions
•Inspect creative-to-conversion linkage in their portfolio

Score Case Studies with the Rubric

Apply the eight-dimension scoring rubric to every case study presented in the finalist round. Score each dimension 0 to 3. Reject case studies that score below 16 of 24 and treat anything below 12 as fiction.

•Require named clients or defensible anonymity
•Demand pre-intervention baselines
•Identify the CRM or analytics source of every number
•Validate that time windows match realistic sales cycles
•Check for post-engagement retention data

Apply the CRM-Backed Pipeline Proof Standard

Require finalists to screen-share live CRM reports for any pipeline or revenue claim. Score against the six pipeline proof components, including CAC payback period calibrated to your vertical. Claims that cannot be shown in a CRM do not exist.

•Request a live HubSpot or Salesforce report walkthrough
•Verify sourced pipeline dollars, not MQL counts
•Confirm CAC payback meets your vertical threshold
•Inspect win rate lift on agency-sourced opportunities
•Check ACV trend on sourced deals over time

Verify with Retention and Attribution References

Structure reference calls around the four verification components. Talk to references the agency did not provide, including former clients. Test whether the case study data was independently validated by the client's RevOps team or accepted on faith.

•Map client retention length and scope expansion pattern
•Ask whether RevOps validated agency attribution
•Source one to two references outside the agency's provided list
•Probe offboarding context for ended relationships

Review the Contract for Revenue Accountability

Before signature, test whether the agency will sign for the outcomes they claimed in the pitch. Negotiate the five contract accountability components into the MSA or LOI. An agency that refuses revenue terms after pitching revenue impact has answered the most important question for you.

•Negotiate a pipeline-tied compensation component
•Define the sourced-versus-influenced attribution split
•Set quarterly business review cadence with revenue data
•Add an exit clause tied to performance thresholds
•Establish joint attribution governance with your RevOps team

When to Use This Framework

Use the B2B Agency Vetting Framework Catalog when you are hiring, renewing, or replacing a marketing agency in a complex B2B buying cycle where pipeline accountability matters more than creative chemistry. The catalog is calibrated for SaaS, fintech, manufacturing, and industrial technology buying cycles with sales cycles exceeding six months, buying committees of five or more stakeholders, and ACVs above 25,000 dollars. It is appropriate when your CFO or CEO has asked you to justify agency spend against pipeline contribution, when a previous agency relationship failed on attribution disputes, or when you are entering a new vertical and need to verify that an agency's claimed expertise translates to your specific motion. Prerequisites include access to your own CRM data so you can pressure-test agency claims against your baseline, a defined ACV band and sales cycle length, a documented view of your buying committee composition, and clarity on which demand states your pipeline depends on most. The catalog is not appropriate for transactional or commodity creative work where pipeline attribution is not the success metric, for sub-25,000 dollar ACV motions where the vetting overhead exceeds the agency spend, or for agencies hired purely for production execution against a strategy you own. If you only need to pressure-test one dimension of an incumbent partner, pull the single applicable framework rather than running the full sequence. The most common partial application is the CRM-Backed Pipeline Proof Standard applied to an existing agency at renewal, which surfaces accountability gaps faster than any other intervention.

Explore this territory

Every published piece in this topical cluster, grouped by format.

B2B Marketing Automation Frameworks

Six named frameworks for selecting, adopting, and operationalizing a B2B marketing automation stack that actually drives pipeline.

Benchmark

B2B Campaign ROI Benchmarks: 2024

Only 23% of B2B marketers can accurately measure campaign ROI across channels, according to Demand Gen Report's 2024 survey of 350+ marketing leaders. This comp

Guide

Demand Generation vs. Creation: B2B Guide

Demand generation vs. demand creation: key differences and how to build a B2B plan that drives real pipeline.

Guide

AI Lead Generation for B2B Teams

AI lead generation uses machine learning to find, score, and engage prospects automatically. Learn how it works, what it replaces, and when to use it.

Framework

B2B Agency Pricing Frameworks

Six B2B agency pricing frameworks for marketing leaders defending spend to the board. Benchmarks, components, and ROI structure from The Starr Conspiracy.

Framework

B2B Agency Vetting Frameworks

Six sequenced frameworks for vetting and onboarding a B2B marketing agency that can build AI-augmented pipeline without breaking the fundamentals.

About The Starr Conspiracy

Bret StarrFounder & CEO

25+ years in B2B marketing. Built and led agencies, launched products, and helped hundreds of companies find their market position.

LinkedIn YouTube

Racheal BatesChief Experience Officer

Leads client delivery and experience design. Ensures every engagement delivers measurable strategic outcomes.

JJ La PataChief Strategy Officer

Drives go-to-market strategy and demand generation for TSC clients. Expert in building B2B growth engines.

Ready to talk strategy?

Book a 30-minute call to discuss how we can help your team.

Loading calendar...

Prefer email? Contact us

See what AI-native GTM looks like

Explore our AI solutions built for B2B marketers who want fundamentals and transformation in one place.

Explore solutions

B2B Agency Vetting Frameworks

B2B Marketing Agency Vetting Frameworks for Pipeline and Revenue Proof

The Six Frameworks at a Glance

The Three Categories

The Six Frameworks

1. Industry-Fit Diagnostic

2. Demand-State Coverage Audit

3. Case Study Scoring Rubric

4. CRM-Backed Pipeline Proof Standard

5. Retention and Attribution Verification

6. Revenue Accountability Contract Review

How to Sequence the Catalog

Steps

Run the Industry-Fit Diagnostic

Audit Demand-State Coverage

Score Case Studies with the Rubric

Apply the CRM-Backed Pipeline Proof Standard

Verify with Retention and Attribution References

Review the Contract for Revenue Accountability

When to Use This Framework

Explore this territory

FAQs

Benchmarks

Trends

Glossary

Related Insights

B2B Marketing Automation Frameworks

B2B Campaign ROI Benchmarks: 2024

Demand Generation vs. Creation: B2B Guide

AI Lead Generation for B2B Teams

B2B Agency Pricing Frameworks

B2B Agency Vetting Frameworks

About The Starr Conspiracy

Ready to talk strategy?

See what AI-native GTM looks like