Advanced Content

Advanced Content

Data Quality Metrics: How to Track What Matters

Data Quality Metrics: How to Track What Matters

Benjamin Douablin

CEO & Co-founder

edit

Updated on

You know your CRM data isn't perfect. But how bad is it, exactly? And is it getting better or worse?

That's what data quality metrics answer. They turn vague complaints like "our data is a mess" into specific numbers: "email completeness is at 74%" or "we have a 9% duplicate rate." Numbers you can track, fix, and report to leadership without hand-waving.

If you've already wrapped your head around data quality dimensions, metrics are the next step. Dimensions tell you what to measure. Metrics tell you how much and how bad. This guide covers the core metrics every B2B team should track, how to calculate each one, and how to build a scorecard that people actually use.

Data Quality Metrics vs. Dimensions vs. KPIs

Before diving in, let's clear up the vocabulary. These three terms get used interchangeably, and they shouldn't be.

  • Dimensions are the categories of quality you care about — accuracy, completeness, timeliness, etc. They describe what to measure. (We've covered these in depth in our data quality dimensions guide.)

  • Metrics are the specific, quantifiable measurements within each dimension. "Email completeness rate = 87%" is a metric. It's a number you can calculate right now from your database.

  • KPIs are metrics tied to a target. "Email completeness rate ≥ 95%" is a KPI — it combines the metric with a threshold that defines "good enough."

Think of it this way: dimensions are the exam subjects, metrics are your scores, and KPIs are the passing grades. If your team only talks about dimensions ("we need better accuracy"), nobody knows what to do. Talk in metrics with targets ("accuracy dropped to 91% — target is 98%"), and you have a clear problem and a clear goal.

The Core Data Quality Metrics to Track

You don't need twenty metrics. You need six to eight that cover the dimensions that matter most for your business. Here are the ones that B2B sales and marketing teams should start with.

1. Accuracy Rate

What it measures: The percentage of records where the data reflects reality. Does the job title match what the person actually does? Does the email actually belong to them? Is the company information current?

How to calculate it:

Accuracy Rate = (Number of Correct Records / Total Records Checked) × 100

Accuracy is the hardest metric to measure because you need a source of truth to compare against. In practice, B2B teams use proxies: email bounce rate (hard bounces indicate inaccurate addresses), phone connection rate (consistent low rates signal bad data), and spot-check sampling (pull 100 random records, verify key fields against LinkedIn).

Benchmark: For B2B contact data, aim for ≥ 95% email accuracy (measured by bounce rate ≤ 5%, ideally ≤ 2%) and ≥ 90% accuracy on job titles and company names based on spot-check sampling.

2. Completeness Rate

What it measures: The percentage of records that have all required fields filled in. Not every field matters equally — focus on the ones your team needs to take action.

How to calculate it:

Completeness Rate = (Records with All Required Fields Populated / Total Records) × 100

You can also measure completeness per field:

Field Completeness = (Non-Null Values for Field / Total Records) × 100

For B2B sales, "required fields" typically means: first name, last name, company, job title, verified email, and direct phone number. If your CRM has 10,000 leads but 4,000 are missing a phone number, your phone completeness is 60%.

Benchmark: Email completeness ≥ 95%, phone ≥ 60-70% (phones are harder to find), job title ≥ 90%, company name ≥ 98%.

3. Validity Rate

What it measures: The percentage of records where values conform to the expected format and rules. An email without an "@" is invalid. A phone number with six digits is invalid. A country field containing "United States" when your system expects "US" is a validity issue.

How to calculate it:

Validity Rate = (Records Passing Format Validation / Total Records) × 100

Validity is usually the easiest metric to automate. You define rules — email must match a regex pattern, phone must include country code, dates must be in ISO format — and run every record through them.

Common B2B validity checks: email contains "@" and a valid domain, phone includes country code with correct digit count, LinkedIn URLs resolve to active profiles, and country fields match ISO codes.

Benchmark: Aim for ≥ 98% validity on structured fields like email and phone. For free-text fields (job title, company name), validity is harder to enforce — focus on the fields that feed automations and filters.

4. Consistency Rate

What it measures: The percentage of records where the same data matches across different systems or follows standardized formats. If your CRM says a lead's company is "HubSpot" but your marketing automation tool says "Hubspot, Inc.", that's an inconsistency.

How to calculate it:

Consistency Rate = (Records Matching Across Sources / Total Records Compared) × 100

Measuring consistency requires picking a source of truth — usually your CRM — and comparing records against the same entity in other systems. You can also measure format consistency within a single system: how many different ways does "United States" appear in your country field? Five variants of the same value will break every filter and segment.

Benchmark: ≥ 95% consistency between CRM and marketing automation on shared fields. For controlled-vocabulary fields (country, industry, seniority), target 100% using standardized picklists.

5. Timeliness (Freshness) Rate

What it measures: The percentage of records that have been updated or verified within an acceptable time window. In B2B, contact data decays fast — people change jobs, companies rebrand, phone numbers get reassigned.

How to calculate it:

Freshness Rate = (Records Updated Within [X] Days / Total Records) × 100

The time window depends on your use case. For active outbound prospecting, you might want records verified within the last 90 days. For a general marketing database, 180 days might be acceptable.

Practical shortcut: Run a CRM report grouped by "last modified date" buckets — 0-30 days, 31-90, 91-180, and 180+ days. The 180+ bucket is your danger zone.

Benchmark: ≥ 80% of records updated within the last 90 days, and ≤ 10% older than 180 days without re-verification.

6. Uniqueness (Deduplication) Rate

What it measures: The percentage of records that represent distinct, non-duplicate entities. If the same person appears three times in your CRM — once from a web form, once from a Sales Navigator import, once from a partner list — that's a uniqueness problem.

How to calculate it:

Uniqueness Rate = (Distinct Records / Total Records) × 100

Or, expressed as a duplicate rate:

Duplicate Rate = (Total Records − Distinct Records) / Total Records × 100

Exact-match deduplication catches "john@acme.com" appearing twice, but fuzzy matching is needed to catch "John Smith at Acme Corp" vs. "J. Smith at Acme Inc." Most CRM platforms offer deduplication tools that handle fuzzy matching on name, email, and domain combinations.

Benchmark: Aim for a duplicate rate below 5%. High-performing teams keep it under 2%. If your duplicate rate is above 10%, you likely have a systemic issue with how data enters your CRM — multiple import sources without deduplication rules, web forms that don't check for existing records, or manual entry without validation.

7. Enrichment Coverage Rate

What it measures: Of all records that could be enriched (missing key fields), how many were successfully enriched? This is especially relevant for B2B teams that use third-party data providers to fill gaps in contact information.

How to calculate it:

Enrichment Coverage = (Records Successfully Enriched / Records Submitted for Enrichment) × 100

If you submit 1,000 records for email enrichment and get emails back for 650, your enrichment coverage is 65%. From a single provider, that might be expected — but it means 350 records are still unreachable.

Benchmark: Single-vendor enrichment typically yields 40-60% coverage. Multi-source enrichment can push this to 80%+. If coverage drops over time, your provider's data may be going stale.

How to Set Benchmarks That Actually Work

A metric without a benchmark is just a number. Here's how to set targets that are realistic and useful.

First, measure your baseline. Run the formulas above against your CRM data and document the results. Most B2B databases score lower than teams expect on completeness and uniqueness.

Then, set targets by impact — not perfectionism. Don't aim for 100% across the board. The cost of going from 95% to 100% is usually higher than going from 70% to 95%. Instead, set thresholds based on what breaks: email accuracy below 95% damages sender reputation. Phone completeness below 50% means SDRs can only reach half their list. Duplicate rates above 10% mean your pipeline reports are fiction.

Use a tiered approach. Not all data is equally important:

  • Tier 1 (revenue-critical): Active pipeline contacts, customer records. Highest thresholds — accuracy ≥ 98%, completeness ≥ 95%.

  • Tier 2 (operations): Marketing lists, lead databases. Moderate thresholds — accuracy ≥ 95%, completeness ≥ 85%.

  • Tier 3 (nice-to-have): Historical records, archived leads. Monitor trends, but don't invest heavily in cleanup.

Building a Data Quality Scorecard

A scorecard is a single dashboard that rolls up your metrics, shows where you stand against benchmarks, and makes it obvious what needs attention. If it sounds simple, that's the point. The best scorecards are the ones people actually look at every week.

For each metric, show the current value, target, gap, status (green/yellow/red), owner, and trend direction. Here's a template:

Metric

Current

Target

Gap

Status

Owner

Trend

Email Accuracy (bounce rate)

96.5%

≥ 98%

-1.5%

Yellow

RevOps Lead

Email Completeness

87%

≥ 95%

-8%

Red

RevOps Lead

Phone Completeness

52%

≥ 65%

-13%

Red

RevOps Lead

Email Validity

99.1%

≥ 98%

+1.1%

Green

Marketing Ops

Duplicate Rate

7.2%

≤ 5%

-2.2%

Yellow

CRM Admin

Record Freshness (≤ 90 days)

68%

≥ 80%

-12%

Red

RevOps Lead

Enrichment Coverage

61%

≥ 75%

-14%

Red

Data Ops

A scorecard like this takes five minutes to read and tells you exactly where to focus. The red items are your priorities. The yellow items are worth watching. The green items are working.

Where to put it: A scorecard that lives in a forgotten spreadsheet is useless. Pin it in your CRM dashboard, share it in weekly standups, or post a summary in Slack each Monday. If the scorecard isn't visible, it doesn't exist.

Reporting Data Quality Metrics to Leadership

Leadership doesn't care about email validity rates. They care about pipeline, revenue, and efficiency. Your job is to translate data quality metrics into business language.

Instead of saying "our completeness rate improved from 74% to 89%," say: "We found verified phone numbers for 1,500 additional leads this quarter, giving SDRs 1,500 more prospects to call." Or: "Deduplication removed 3,200 duplicates, so pipeline reporting is now based on real accounts — not inflated counts."

Leadership thinks in pipeline created, deals closed, and dollars saved. Map every metric to those outcomes.

Show trends, not snapshots. A single reading is a data point. A three-month trend is a story. An improving trend proves your investment is working. A declining trend is an early warning.

Quantify the cost of inaction. If 20% of dials hit wrong numbers and an SDR makes 60 calls a day, that's 12 wasted dials daily — multiply by team size and hourly cost. If 8% of your pipeline is duplicated, your $5M forecast is really $4.6M. Those aren't rounding errors — they're misallocated resources.

Common Pitfalls When Measuring Data Quality

Teams that start measuring data quality often fall into the same traps. Here's what to watch for.

Measuring Everything, Fixing Nothing

If you track 30 metrics but nobody acts on them, you don't have a data quality program — you have a reporting hobby. Start with five to seven metrics tied to your biggest business pain points. Add more only after you've proven you can improve the first batch.

Setting Unrealistic Targets

Aiming for 100% accuracy on every field sounds principled but it's not practical. Some fields are inherently harder to verify (job titles change without notice), and the last few percentage points cost disproportionately more to fix. Set targets based on business impact: what breaks at each threshold?

Treating Measurement as a One-Time Audit

"We cleaned the CRM last quarter" is not a strategy. Data decays continuously — metrics need to run on a cadence, not as a one-off project. (See our data quality assessment guide for setting up ongoing assessments.)

No Clear Ownership

Every metric needs an owner — a real person, not a committee. In most B2B organizations, this falls to RevOps or a CRM administrator. Without a named owner, quality degrades by default.

Ignoring the Source of Bad Data

Cleaning records after they enter your system is treating symptoms. If bad data keeps arriving — from bulk imports without validation, web forms without required fields, or stale vendor feeds — you'll be cleaning forever. Fix the input, not just the output. Add validation rules at every entry point.

Connecting Data Quality Metrics to Revenue

Data quality metrics aren't operational busywork. For B2B teams, they directly connect to pipeline and revenue. Here's how.

  • Email accuracy → Deliverability → Pipeline. High bounce rates damage sender reputation, pushing even valid emails into spam. Fewer opens, fewer replies, fewer meetings booked.

  • Completeness → Reachability → Conversations. If 40% of leads are missing phone numbers, your SDRs can only phone-prospect 60% of their list. Phone is often the highest-converting B2B channel — every missing number is a conversation that never happens.

  • Freshness → Relevance → Reply rates. Stale data means reps reference outdated titles, email dead addresses, and call people who've moved on. Fresh data means personalized, relevant outreach that converts.

  • Uniqueness → Forecast accuracy → Resource allocation. Duplicates inflate pipeline counts and lead to reps working the same account without knowing it. Clean data gives you a real pipeline view.

These metrics don't operate in isolation. Poor data quality compounds — an inaccurate, incomplete, stale record fails at every stage of the funnel, not just one. Improving quality across multiple dimensions has a multiplying effect on outbound performance and revenue predictability.

Putting Metrics Into Practice

Data quality metrics only matter if they drive action. Here's a quick-start plan:

  1. Pick your top 5 metrics. Start with accuracy (bounce rate), completeness (email + phone), validity (format checks), duplicate rate, and freshness.

  2. Measure your baseline. Run each formula against your CRM data today. Document the results — they're your starting point, not your report card.

  3. Set realistic targets. Use the benchmarks in this guide, then adjust for your specific business.

  4. Assign owners. Every metric needs a name next to it. No name, no accountability, no improvement.

  5. Build the scorecard. A single spreadsheet or CRM dashboard with the seven columns from the template above.

  6. Set a cadence. Check bounce rates and connection rates weekly. Run the full scorecard monthly. Do a deeper spot-check audit quarterly. (Our data quality assessment guide walks through the full quarterly process.)

This isn't a six-month governance project. You can have a working scorecard by end of week. The teams that improve data quality are the ones that start measuring it consistently — even if the first measurements aren't pretty.

If completeness is your biggest gap — especially for phone numbers and emails — it's worth evaluating enrichment tools that aggregate multiple data sources. Platforms that cross-reference 15 or 20 providers in a single lookup tend to find significantly more contact data than any individual source. FullEnrich takes this approach with waterfall enrichment, typically achieving 80%+ find rates compared to single-vendor alternatives.

Find

Emails

and

Phone

Numbers

of Your Prospects

Company & Contact Enrichment

20+ providers

20+

Verified Phones & Emails

GDPR & CCPA Aligned

50 Free Leads

Reach

prospects

you couldn't reach before

Find emails & phone numbers of your prospects using 15+ data sources.

Don't choose a B2B data vendor. Choose them all.

Direct Phone numbers

Work Emails

Trusted by thousands of the fastest-growing agencies and B2B companies:

Reach

prospects

you couldn't reach before

Find emails & phone numbers of your prospects using 15+ data sources. Don't choose a B2B data vendor. Choose them all.

Direct Phone numbers

Work Emails

Trusted by thousands of the fastest-growing agencies and B2B companies: