Cross-list deduplication means finding duplicate records across two or more lists.

It is useful when you have several CSV files, CRM exports, campaign lists, or scraped datasets and need to know which records overlap.

Common use cases

Cross-list deduplication helps when you need to:

  • Remove contacts from a new campaign list if they already exist in your CRM
  • Compare a new scraped list with an old scraped list
  • Deduplicate Pipedrive People and Organizations after export
  • Keep only records that appear in a master list
  • Remove existing customers from a prospecting list

🔍 Example

If Q1 leads and Q2 leads share the same company domain, cross-list deduplication can remove Q1 accounts from the Q2 import.

How cross-list deduplication works

Each list can have different column names. Before matching, map the columns that describe the same value.

Example:

  • List A has Company Website
  • List B has Domain

Both can be compared as company domains.

Then choose a matching method. Use exact matching for stable identifiers such as email, domain, and LinkedIn URL. Use smart or fuzzy matching for names.

Datablist workflow

Datablist can check duplicates across several collections and then clean one collection based on another.

You can:

  • Remove duplicates from a selected collection
  • Keep duplicates only in a selected collection
  • Match different column names across collections
  • Use URL, email, text, and company-name processors

See the deduplicate multiple files guide, the CRM cleanup guide, and the Pipedrive duplicate merge guide.