Cross-list deduplication means finding duplicate records across two or more lists.
It is useful when you have several CSV files, CRM exports, campaign lists, or scraped datasets and need to know which records overlap.
Common use cases
Cross-list deduplication helps when you need to:
- Remove contacts from a new campaign list if they already exist in your CRM
- Compare a new scraped list with an old scraped list
- Deduplicate Pipedrive People and Organizations after export
- Keep only records that appear in a master list
- Remove existing customers from a prospecting list
🔍 Example
If Q1 leads and Q2 leads share the same company domain, cross-list deduplication can remove Q1 accounts from the Q2 import.
How cross-list deduplication works
Each list can have different column names. Before matching, map the columns that describe the same value.
Example:
- List A has
Company Website - List B has
Domain
Both can be compared as company domains.
Then choose a matching method. Use exact matching for stable identifiers such as email, domain, and LinkedIn URL. Use smart or fuzzy matching for names.
Datablist workflow
Datablist can check duplicates across several collections and then clean one collection based on another.
You can:
- Remove duplicates from a selected collection
- Keep duplicates only in a selected collection
- Match different column names across collections
- Use URL, email, text, and company-name processors
See the deduplicate multiple files guide, the CRM cleanup guide, and the Pipedrive duplicate merge guide.