Cleaning and enriching CRM data is one of the highest-reward tasks you can do, and here's why:

  • When you enrich a CRM, sales reps work more efficiently and book more meetings.
  • When you clean up a CRM, the marketing team has a better overview to run better campaigns.
  • When you structure CRM data, RevOps & GTM teams see better what's working and what's not.

Obviously, there are 99+ more reasons why everyone should have a clean CRM, but it would take too long to name them all, so I'll just tell you how to clean and enrich a CRM.

Let’s begin!

Outline of this CRM Data Cleaning guide

This guide’s broken down into four main sections, each one super important for keeping your CRM data clean and up-to-date. Before we dive in, here’s a quick look at what we’ll cover and the CRM cleanup workflows I’ve included:

First Section: Deduplication

  1. Deduplicating contacts and accounts if all attributes are identical
  2. Deduplicating CRM records with different attributes
  3. How to deduplicate CRM records if they are in separate lists

Second Section: Data Validation

  1. Validating domain and email status
  2. Verifying if an email is deliverable or not
  3. How to verify if a person still works at their company

Third Section: Data Structuring

  1. Splitting first, middle and last names
  2. Extracting domains from emails
  3. Formatting phone numbers based on country in bulk

Fourth Section: CRM Enrichment

  1. Find websites from company names
  2. Enriching and updating account details
  3. How to find company information that doesn't appear in databases
  4. Updating contact information with fresh LinkedIn data
  5. How to find verified email addresses
  6. How to get verified phone numbers (no landlines)

In other words, we will deduplicate all data, then verify if the remaining data is still valid, then structure it and extract and after that we enrich with new data.

Feel free to jump to the part that's relevant for you, or follow along with me.

❗️ Keep Your Record ID’s

I haven't shown the Record IDs in this example (such as contact ID, note ID, account ID, etc.), but you should keep them; otherwise, you won't be able to import and map your data back into the CRM.

How to Deduplicate CRM Contacts and Accounts

When you want to deduplicate the contacts and accounts in your CRM, you'll have 3 methods to do it in Datablist:

A list can be an account list, a contacts list, or a deals list — regardless of the content, the process stays the same.

Method 1: Deduplicating One List Using All Columns

Sometimes when you scrape contacts from LinkedIn or other sources multiple times, the same contacts get scraped again. In this case, the best and quickest way to clean this up is to deduplicate your contacts using all columns. Here’s how to do it:

Step 1 to Deduplicate a List Using All Columns

First, sign up for Datablist.com

Datablist home page
Datablist home page

Second, import you list as a CSV or Excel.

Datablist start page
Datablist start page

Step 2 to Deduplicate a List Using All Columns

Click on Clean and select the Duplicates Finder.

Datablist collection for contact deduplication
Datablist collection for contact deduplication

Then, click on the switch next to All Properties.

Datablist’s property configuration for deduplication
Datablist’s property configuration for deduplication

Now, click on Next.

Datablist’s settings review for deduplication
Datablist’s settings review for deduplication

Click on Run duplicates check.

Datablist’s algorithm settings for deduplication
Datablist’s algorithm settings for deduplication

Now, you can review the results and click on Auto-merge duplicates when possible.

Datablist’s merging preview for deduplication
Datablist’s merging preview for deduplication

That's how easy deduplicating CRM contacts can be!

Datablist’s success screen after deduplication
Datablist’s success screen after deduplication

Now, let's explore the second method for deduplicating CRM contacts.

Method 2: Deduplicating a List Using Specific Columns

This is the best method when you want to deduplicate a list where you have the same contact with slight variations.

For example, you have the same contact twice with the same emails and names, but the notes are not the same — then you'll have to deduplicate only on two or three specific columns.

We call this a unique identifier, and you can have multiple of them.

Step 1 to Deduplicating a List Using Specific Columns

First, sign up for Datablist.com.

Datablist home page
Datablist home page

Second, import you list as a CSV or Excel.

Datablist start page
Datablist start page

Step 2 to Deduplicate a List Using Specific Columns

Click on Clean and select the Duplicates Finder.

Datablist collection for column-based deduplication
Datablist collection for column-based deduplication

Then, check the columns you want to deduplicate on. In my case, I want to compare the contacts based on the Company Domain and First Name columns.

Column selection for column-based deduplication
Column selection for column-based deduplication

When you select your columns, make sure you select the right processor as well.

Leave the Algorithm option on the default setting unless you have IDs, URLs, or other properties that need exact matches.

For domain, select URL, for company names, select Company Name, and so on.

Most importantly, always align the processor with the content of your column that you want to deduplicate on.

Once you've done that, click on Run duplicates check.

Processor configuration for column-based deduplication
Processor configuration for column-based deduplication

Step 3 to Deduplicate a List Using Specific Columns

Since the notes aren't identical, you'll have a conflict that you can resolve by either:

  • Dropping the values of the conflicting items
  • Combining conflicting properties
Datablist’s duplicates preview and settings
Datablist’s duplicates preview and settings

Now, click on Combine conflicting properties select the column with the conflicts, use Line break as a separator, then click on Refresh Merging Preview

Conflict handling settings of column-based deduplication
Conflict handling settings of column-based deduplication

Now, you can preview how Datablist will combine the two notes before deleting the duplicates. Once you've done that, click on Auto-merge when possible.

Merging preview of Datablist’s column-based deduplication
Merging preview of Datablist’s column-based deduplication

This is how you duplicate-free list will look.

Results of Datablist’s column-based deduplication
Results of Datablist’s column-based deduplication

👉 Read our articles about Mac/iOS Contacts deduplication, Multi-Values Column deduplication, matching similar Company Names in a list, Merging Pipedrive duplicates to learn more.

Now, let's move on to the third method of how to deduplicate CRM records across multiple lists.

Method 3: Deduplicating Across Multiple Lists

Here’s a practical example of deduplicating across lists:

Goal: Run an ABM campaign targeting only new accounts (not engaged in Q1)

Process:

  • Compare two lists: Q1 accounts and new accounts
  • Remove Q1-engaged accounts from the new list

Result: You'll have a clean list containing only truly new accounts for your campaign.

Let’s start!

Step 1 to Deduplicate Across Multiple Lists

First, Sign up for Datablist.com

Datablist home page
Datablist home page

Second, create a folder by clicking on the folder icon in the sidebar.

Datablist start page, folder creation
Datablist start page, folder creation

Now, create a collection within this folder by clicking on the three dots and selecting New collection.

Datablist start page, file creation inside folder
Datablist start page, file creation inside folder

This is what it will look like when you create a new collection. Now, upload your first list to Datablist and repeat this process a second time to upload your second list.

Datablist collection, file upload
Datablist collection, file upload

Step 2 to Deduplicating Across Multiple Lists

Once you've uploaded both lists, make sure to go to your Q2 list (or whatever the newer version is for you).

Now click on Clean and select the Duplicates finder.

Datablist collection for multi-collection deduplication
Datablist collection for multi-collection deduplication

Now, click on the switch to the left of Check deduplicates across several collections?.

Datablist’s deduplication suite
Datablist’s deduplication suite

Select the collection you want to compare to — in my case, it is "Q1 ABM".

File selection of multi-collection deduplication
File selection of multi-collection deduplication

Now, select the property you want to compare on and click on Next. I would suggest to always take domains when you match accounts, and emails or LinkedIn profiles when you deduplicate contacts.

Identifier selection of multi-collection deduplication
Identifier selection of multi-collection deduplication

If you chose the company domain or any other link as a unique identifier, then select URL as the processor and click on Run duplicates check.

Processor selection of multi-collection deduplication
Processor selection of multi-collection deduplication

Now you'll get a preview of the duplicates that both collections share.

Click on the field below Auto cleaning rule and select one of the options — currently you have only one option, which is Remove duplicate items from collection X.

With three or more collections in comparison, you would also have a second option, which is Keep duplicate items only in collection X.

Cleaning rule configuration of multi-collection deduplication
Cleaning rule configuration of multi-collection deduplication

Now you'll have the option to choose from which collection to delete the duplicates (make sure you always delete the old data from the new data).

Then click on Click here to process duplicated items.

Deletion file selection of multi-collection deduplication
Deletion file selection of multi-collection deduplication

📘 How to Get The Best Results

Delete Q1 contacts from Q2 list (NOT the other way around). For example:

✅ Q1 list: 100 contacts → Keep all

✅ Q2 list: 200 contacts → Remove duplicates from Q1 → You end up with 100 unique contacts

❌ Q1 list: 100 contacts → Remove duplicates from Q2

❌ Q2 list: 150 contacts → Keep all

This ensures you:

  • Keep your historical data intact in Q1
  • Remove already-contacted accounts from your Q2 campaign
  • Avoid accidentally targeting the same accounts twice

Now I have deleted 5236 duplicated accounts from my Q2 list and ended up with 3152 unique accounts

Results of multi-collection deduplication in Datablist
Results of multi-collection deduplication in Datablist

👉 To learn more on deduping across datasets, read our "How to Deduplicate Across Several Excel Files" guide.

That’s it with deduplication now let’s talk about validating CRM data.

How to Check if Your CRM Data is Up-To-Date

Checking if CRM data is still up-to-date is a process that most people forget about when cleaning CRMs because they assume that the data gathered by their marketing or sales team will always remain valid once it's marked as valid, but this is a massive mistake.

In this section of this guide I’ll show you:

Let’s start!

How to Check if an Email is Valid or Not for Free

If you've ever collected email addresses through a form, lead magnet, or free offer, you know many people register with non-business email addresses.

If that's you, then use this feature as a first screening step before investing in more comprehensive deliverability checks when you want to send an email campaign.

Understand: Valid Email vs. Deliverable Email

This feature doesn't tell you if the email is deliverable; it tells you only if the domain behind the email is a domain that can receive emails by checking its MX records.

Quick Example:

The email habibi@datablist.com doesn't exist, but the domain has valid MX records, making it valid even without an actual inbox.

The email habib@datablist.ai doesn't exist and the domain lacks valid MX records, making it invalid without an inbox.

The email habib@datablist.com exists and has valid MX records, making it both valid and deliverable.

TL;DR

Email validation = Domain level

Email deliverability = Inbox level

Validated email = Domain can receive messages but email address can still be wrong

Deliverable email = Has an inbox that receives messages

Not every valid email can receive messages.

Step 1 to Check if an Email is Valid or Not for Free

First, sign up for Datablist.com

Datablist home page
Datablist home page

Once you sign up, import your list with your CRM accounts/contacts.

Datablist start page
Datablist start page

Step 2 to Check if an Email is Valid or Not for Free

Now we will use a free feature of Datablist which allows us to get the MX provider responsible for accepting email messages. If a domain doesn't have valid MX Records, it cannot receive emails, which makes it automatically undeliverable.

Click on Enrich.

Datablist collection for email validation
Datablist collection for email validation

Go to People and select Free Email Address Validation.

Datablist people enrichment listing, free email validation
Datablist people enrichment listing, free email validation

Map the field with your emails as input property, and click on Continue to output configuration.

Input configuration of Datablist’s free validation
Input configuration of Datablist’s free validation

Click on the plus icons to create new output properties, then click on Instant Run.

Output column configuration of Datablist’s free email validation
Output column configuration of Datablist’s free email validation

This are the results you’ll get after running Datablist’s Free Email Validator

Results of Datablist’s free email validation
Results of Datablist’s free email validation

What the results will tell you:

  • If the email is from a business or not
  • If the email can receive messages or not (regardless if it's a business or personal email)
  • Which provider handles the MX service for this domain. Here's why this is important: When sending cold emails, you don't want to send to emails with certain providers — especially Microsoft — since your email accounts will be harmed

👉 Check our Free Email List Validation Guide to learn more.

Now that we've checked if those emails are valid, let's check if they're can receive messages or not.

How to Check if an Email Can Receive Messages or Not

If you followed the previous workflow: Continue by filtering the valid emails from your results — invalid emails are automatically not deliverable — I'll show you how.

If you're just starting here you have 2 options:

  • Go back and run the previous workflow.
  • Sign up for Datablist, import your list, and skip to the second step.

Step 1 to Check if an Email Can Receive Messages or Not

Click on Valid Email and select Filter on property.

Datablist collection, opened column header
Datablist collection, opened column header

Make sure this checkbox is filler and click on Apply.

Datablist collection, filter pop-up
Datablist collection, filter pop-up

💡 Tip For B2B Companies

If you work only in B2B, then filter on the column "Business Email" instead of "Valid Email." We are showing a broader example since we have a lot of B2C businesses also using Datablist

Step 2 to Check if an Email Can Receive Messages or Not

Click on Enrich.

Datablist collection for verifying email addresses
Datablist collection for verifying email addresses

Go to People and select Waterfall Advanced Email Address Verification.

Datablist’s people enrichment listing, Email verifier
Datablist’s people enrichment listing, Email verifier

Map your email column to the input property and click on Continue to output configuration.

Input configuration of Datablist’s email verification
Input configuration of Datablist’s email verification

Click on the plus icon to create a new property for "Email Status" and “Role Account” only and click on Instant Run.

Why only these properties: The other properties will only create more complexity, and you don't need them if you're only cleaning up your CRM.

Output column configuration of Datablist’s email verification
Output column configuration of Datablist’s email verification

These are the results you'll get:

Results of Datablist’s email verification
Results of Datablist’s email verification

Valid: There's an inbox behind these emails that can receive messages.

Risky emails fall into two categories:

  • We couldn't verify the email address at the time of the verification.
  • It's a catch-all email, which means the email server is set to accept all mail even if the email account doesn't exist.

Invalid: This email account doesn't exist (don't use them).

How To Check if a Person Still Works in Their Company (In Bulk)

Maybe you've already noticed that people tend to change their jobs more often nowadays, and maybe you're getting more and more notifications that say "This person doesn't work at XYZ anymore" when you send someone an email.

If this is the case, don't worry, I'll show you how to check if a person still works at their company.

But first, let's clarify what you need:

  • The LinkedIn profile of the person (non-negotiable)
  • The email or domain of the company where person is supposed to work

What we are going to do

  1. Scrape the person’s LinkedIn profile to find out in which company they are working in
  2. Find the domain of the company they work in
  3. Letting AI compare the 2 domains

Note: This works also with the company LinkedIn page of the person.

Step 1 to Check if a Person Still Works in Their Company

Sign up for Datablist.com.

Datablist home page
Datablist home page

Import your list into Datablist.

Datablist start page
Datablist start page

Step 2 to Check if a Person Still Works in Their Company

In this step, we are going to scrape the person's LinkedIn profile.

Click on Enrich.

Datablist collection for scraping LinkedIn profiles
Datablist collection for scraping LinkedIn profiles

Go to People and select LinkedIn People Profile Scraper.

Datablist’s people enrichment listing, LinkedIn profile scraper
Datablist’s people enrichment listing, LinkedIn profile scraper

Map the column containing your LinkedIn profile URLs as input property, and click on Continue to output configuration.

Input configuration of Datablist’s LinkedIn profile scraper
Input configuration of Datablist’s LinkedIn profile scraper

Create new columns by clicking on the plus icons for the following outputs: Company Name, Company page URL, and Company website and click on Instant Run.

You can create new columns for the other outputs as well, but you don't need them for this workflow — only if you need other LinkedIn data to so you don't want to pay twice.

Output columns configuration of Datablist’s LinkedIn profile scraper
Output columns configuration of Datablist’s LinkedIn profile scraper

These are the results we got from scraping the LinkedIn profiles. They are good, but we need to find the domains for the remaining records — that's what the next step is about.

Results of Datablist’s LinkedIn profile scraper
Results of Datablist’s LinkedIn profile scraper

Step 3 to Check if a Person Still Works in Their Company

Click on Enrich.

Datablist collection for finding domains
Datablist collection for finding domains

Go to Companies and select Company Domain/Website and LinkedIn Company Page Matcher.

Datablist’s enrichment listing, LinkedIn page to domain
Datablist’s enrichment listing, LinkedIn page to domain

Select Get the company website from the LinkedIn page URL as Matching Type.

Then, map the LinkedIn page URL as input property.

Click on Continue to output configuration.

Settings of Datablist’s LinkedIn page to domain enrichment
Settings of Datablist’s LinkedIn page to domain enrichment

Map the “Company website” output to the new created company website column.

Please note: Don't map it to the column where the old domains are stored!!.

Click on Instant Run.

Output column configuration of Datablist’s LinkedIn page to domain enrichment
Output column configuration of Datablist’s LinkedIn page to domain enrichment

Now you'll have the following options:

  • Run in Async: With this box checked, you'll be running the enrichment in the cloud, which allows you to do other tasks in the meantime if you have a large list.
  • Test on the first 10 items: Do this if you want to get a feel for the enrichments.

Select number of items to process: This allows you to run your enrichment only on the first 10, 100, or a custom amount of items.

Existing data rule: This tells Datablist how to deal with the existing data in your column — select the second option, ****which will Update only the empty cells.

Once you’ve done this, click on Run enrichment on all items

Run settings of LinkedIn page to domain enrichment
Run settings of LinkedIn page to domain enrichment

Step 4 to Check if a Person Still Works in Their Company

Now that we've got the old and new website domains, we're going to compare them to see if they're still the same to verify if the person has changed companies or not.

Click on Enrich.

Datablist collection for employment status check
Datablist collection for employment status check

Go to AI and select Ask ChatGPT/OpenAI.

Datablist’s AI enrichment listing, Ask ChatGPT
Datablist’s AI enrichment listing, Ask ChatGPT

Check the box to use Datablist credits for your enrichment or provide your OpenAI API key, then click on Use template.

Settings of Datablist’s ChatGPT enrichment
Settings of Datablist’s ChatGPT enrichment

Scroll down and select Check if prospect is still working at a company.

Datablist’s employment status check template
Datablist’s employment status check template

Now map the columns in your collection to the template. Here's how:

Use "/" to see a list of the columns in your collection and map the old domain to the first field.

Do the same with the new domain for the second field.

Then, click on Continue to output configuration.

Input column mapping for employment status check AI prompt
Input column mapping for employment status check AI prompt

Click on the plus icon to create a new column for the output, and click on Instant Run.

Output column configuration of employment status check
Output column configuration of employment status check

This is how it looks when you’ve checked if a person still works in their company:

Results of employment status check
Results of employment status check

That’s how you verify if your CRM data is still up-to-date!

You could also score your accounts with AI.

How to Structure and Format Your CRM Data

Structuring and formatting you data is crucial when cleaning your CRM since it will determine later how the sales and marketing teams work with it, to bring all your CRM data in a clean format I’ll show you:

Bringing your CRM data into a unified format is only half the battle — the key to a well-maintained CRM lies in keeping the data clean and structured. Here are some techniques that will help you to do so:

  • Defining standardized input formats—for example, creating a consistent framework for taking call notes
  • Limiting certain columns to specific input types (e.g., making the phone number column a "numbers only" attribute)
  • Creating mandatory field requirements - Example: Making it required for sales reps to fill in specific fields like "Last Contact Date" to ensure data completeness

How to Split First and Last Names

One of the biggest problems when cleaning a CRM is that often first and last names are written in one column, which is a big problem as this leads to a lead or prospect receiving an email with their full name which can be weird sometimes.

Step 1 to Split First and Last Names

Sign up for Datablist.com.

Datablist home page
Datablist home page

Import a CSV or Excel file containing the names of your CRM contacts.

Datablist start page
Datablist start page

Step 2 to Split First and Last Names

Click on Enrich.

Datablist collection for splitting first, middle and last names
Datablist collection for splitting first, middle and last names

Go to People and select Name Parser.

Datablist’s people enrichment listing, Name Parser
Datablist’s people enrichment listing, Name Parser

Now, map the column containing the name as input property and click Continue to outputs configuration.

Input configuration of Datablist’s Name Parser
Input configuration of Datablist’s Name Parser

Here's a list of the outputs you can get:

  • First names
  • Middle names
  • Last names
  • Gender
  • Title
  • Origin country of the name

Click on the plus (+) icons to create a new column for each output you need and click Instant Run.

Output column configuration of Datablist’s Name Parser
Output column configuration of Datablist’s Name Parser

This is how it looks when you split first and last names in Datablist.

Results of Datablist’s Name Parser
Results of Datablist’s Name Parser

How to Extract Domains from Emails

Step 1 to Extract Domains from Emails

Sign up for Datablist.

Datablist home page
Datablist home page

Import a CSV or Excel file.

Datablist start page
Datablist start page

Click on Extract and select Extract domains for email addresses or URLs.

Datablist’s extractor listing, domain extractor
Datablist’s extractor listing, domain extractor

Map the column with your emails as input property and click on Preview extraction.

Input and output configuration of Datablist’s domain extractor
Input and output configuration of Datablist’s domain extractor

Now, you’ll get a preview of the 10 first rows. Click on Extract data once you approved it.

Preview of results of Datablist’s domain extractor
Preview of results of Datablist’s domain extractor

This is how it looks when you extract domains from email addresses with Datablist:

Results of Datablist’s domain extractor
Results of Datablist’s domain extractor

How to Format Phone Numbers for CRM Cleaning

Phone number formatting is one of my favorite CRM cleaning workflows. Here's why Datablist is particularly valuable for this task:

  • Handles phone numbers from any country
  • Works with both:
    • International format (+XX)
    • Local formats
  • Can process multiple countries' phone numbers in a single file

For this example I have a file containing:

  • US phone numbers
  • Indonesian phone numbers
  • German phone numbers
  • Algerian phone numbers

And I’ll format them all in one go.

Important to Know

To format phone numbers from different countries in a single file, you must include a "Country" column that tells Datablist where each phone number is from

Let’s start!

Step 1 to Format Phone Numbers for CRM Cleaning

Sign up for Datablist.com.

Datablist home page
Datablist home page

Import a CSV or Excel.

Datablist start page
Datablist start page

Click on Enrich.

Datablist collection for formatting phone numbers
Datablist collection for formatting phone numbers

Go to AI and select Phone Number Extractor

Datablist AI enrichment listing, phone number formatter
Datablist AI enrichment listing, phone number formatter

Select the origin countries of the phone numbers in your collection and check the box labeled Advanced Settings

Settings of Datablist’s phone number formatter
Settings of Datablist’s phone number formatter

Now, check the box to the left of Define country per Item to enable the setting that lets you add a country input.

Advanced settings of Datablist’s phone number formatter
Advanced settings of Datablist’s phone number formatter

💡 Quick Tip

Enable the "Add phone number type" option only when you process phone numbers of a single country, since it's less accurate when you do it on multiple countries

Now, map your columns with the phone numbers and countries as input properties, then click on Continue to outputs configuration.

Input configuration of Datablist’s phone number formatter
Input configuration of Datablist’s phone number formatter

Click on the plus icon to add a new column for the formatted phone numbers and click on Instant Run.

Output column configuration of Datablist’s phone number formatter
Output column configuration of Datablist’s phone number formatter

Now you'll see the Run Settings which allow you to:

  • Run in Async (especially good for large lists)
  • Test on the first 10 items
  • Select the number of items to process (10, 100, or custom)

Once you’ve configured them click on Run enrichment on all items.

Run settings of Datablist’s phone number formatter
Run settings of Datablist’s phone number formatter

These are the phone numbers we just formatted:

Results of Datablist’s phone number formatter
Results of Datablist’s phone number formatter

How to Update Your CRM Data

Keeping your CRM data current is crucial for business success. This are the things you need to know to update your CRM effectively:

Let’s go!

How to Find Company Domains From Company Names

This is probably one of the most used company enrichments we have, and honestly, without a domain, you can't do anything — that's why I am showing this first (and also because it just makes sense).

Step 1 of How to Find Company Domains From Company Names

Sign up for Datablist.com.

Datablist home page
Datablist home page

Upload a list with company names.

Datablist start page
Datablist start page

Step 2 of How to Find Company Domains From Company Names

Click on Enrich

Datablist collection for domain enrichment
Datablist collection for domain enrichment

Go to URLs and select Find company domains from company names

Datablist’s URLs enrichment listing, Domain Finder
Datablist’s URLs enrichment listing, Domain Finder

Search Settings Explained 🔍

There are two ways to search:

  • Default Option: Companies Dataset + Google Fallback

    Uses database first (1 credit if found) → falls back to Google if needed (2.5 credits)

  • Cheapest Option: Use only Companies Dataset

    Only searches database (1 credit)

These settings help control how company domains are found:

  • Target Country: Limit search to one country for better results
  • Accept non-root websites: This allows finding company websites that are a domain + a path (like platform.com/company) instead of only main domains (company.com)
  • Skip following domains: This helps filter out results from directory websites like Crunchbase or Northdata, that aren't actual company websites

For maximum coverage, leave the default search option enabled, select a country, and leave everything else blank.

Settings of Datablist’s Domain Finder
Settings of Datablist’s Domain Finder

Once you've configured your search, scroll down to map your column with the company names as input property and click on Continue to outputs configuration

Input configuration of Datablist’s Domain Finder
Input configuration of Datablist’s Domain Finder

Now you'll be able to create a column for the Company URL", "Company Domain," or both by clicking on the plus icons. I usually go with the company domain only. Once you've done that, click on Instant Run

Output column configuration of Datablist’s Domain Finder
Output column configuration of Datablist’s Domain Finder

Now you'll see the Run Settings which allow you to:

  • Run in Async (especially good for large lists)
  • Select the number of items to process (10, 100, or custom)

Once you’ve configured them click on Run enrichment on all items

Run settings of Datablist’s Domain Finder
Run settings of Datablist’s Domain Finder

This is how your collection will look after using the company name to domain enrichment (see costs below)

Results of Datablist’s Domain Finder
Results of Datablist’s Domain Finder

I paid 33 credits for finding 16 domains, which is 2.06 credits per domain. If you have a list with 1,000 domains, you would invest only ≈ 2,062 credits = $2.03

Datablist enrichment log, Domain Finder
Datablist enrichment log, Domain Finder

👉 Read our guide on finding company websites from company names to learn more.

How to Scrape Business Information and Company Details

In this section, we will focus on finding simple firmographic data that shows us if it's worth continuing the research and enrichment on certain accounts.

When I say firmographic data, I mean finding the data usually shown on LinkedIn, such as:

  • Headcount
  • Company Name
  • Website
  • Company headquarter
  • Specialities
  • Industry
  • Description
  • Country
  • Region
  • Founding year
  • LinkedIn URL
  • Sales Navigator ID
  • Followers count
  • Slogan

Let’s start!

Step 1 of How to Scrape Business Information and Company Details

Sign up for Datablist.com.

Datablist home page
Datablist home page

Import a list with company domains or LinkedIn URLs of the companies in your CRM.

Datablist start page
Datablist start page

Step 2 of How to Scrape Business Information and Company Details

Click on Enrich.

Datablist collection for account enrichment
Datablist collection for account enrichment

Go to Companies and select Company Enrichment.

Datablist’s companies enrichment listing, Company Enrichment
Datablist’s companies enrichment listing, Company Enrichment

There are two key settings to understand:

1. Data Source Options: The data used to start the enrichment

  • Company domain (default)
  • LinkedIn URL

2. Data Return Options & Costs: The data returned by the enrichment

  • Basic data (1 credit): Gets you industry, location, employee count, and founding year
  • LinkedIn data (5 credits): Pulls live information directly from LinkedIn pages

I suggest using the LinkedIn URL to get live LinkedIn data. If you don't have the LinkedIn URLs, use the company domain and basic data instead.

Set up of Datablist’s Company Enrichment
Set up of Datablist’s Company Enrichment

Once you've chosen your settings, map the column with the LinkedIn URLs as input property and click on Continue to outputs configuration

Input configuration of Datablist’s Company Enrichment
Input configuration of Datablist’s Company Enrichment

Now, create a column for each data point you need by clicking on the plus icons, then click on Instant Run

Output columns configuration of Datablist’s Company Enrichment
Output columns configuration of Datablist’s Company Enrichment

Now you'll see the Run Settings which allow you to:

  • Run in Async (especially good for large lists)
  • Select the number of items to process (10, 100, or custom)

Once you’ve configured them click on Run enrichment on all items

Run settings of Datablist’s Company Enrichment
Run settings of Datablist’s Company Enrichment

Here are the results I got from scraping live company details on LinkedIn:

Results of Datablist’s Company Enrichment
Results of Datablist’s Company Enrichment

But maybe you want to find details that aren't on LinkedIn, maybe you need information that would take you hours to manually research for each company. Well, if that's you: we've got you covered, since with Datablist you can have an AI agent automating repetitive research for you.

How to Scrape Hard-to-Find Company Details

This is probably my favorite part of this guide. Wait, didn't I say this already?

Yes, I did but this part is special since I’ll show you know how you can scrape hidden company data that other traditional databases all miss using an AI research agent.

Let’s illustrate an example of a manufacturing business, they might have multiple retail branches, production facilities, and specializations that aren't available in standard databases.

Here are some examples of hidden company information data you might want to research:

  • Patents and intellectual property holdings
  • Key executives' previous work experience
  • Vendor relationships and supply chain details
  • Customer success stories and case studies
  • Research and development focus areas

These data points require deeper research across websites, press releases, industry reports, and specialized databases. The AI research agent can uncover this information by analyzing multiple sources and compiling relevant details.

In this example I’ll show you how I find out how many production facilities some businesses have.

Let’s go!

Step 1 of How to Scrape Hard-to-Find Company Details

Sign up for Datablist.com.

Datablist home page
Datablist home page

Import a list of companies (With company domains).

Datablist start page
Datablist start page

Step 2 of How to Scrape Hard-to-Find Company Details

Click on Enrich.

Datablist collection with companies for automated location research
Datablist collection with companies for automated location research

Go to AI and select the AI Agent.

Datablist’s AI enrichment listing, AI Agent
Datablist’s AI enrichment listing, AI Agent

Configure a prompt, or use my example prompt below to test the tool. Learn here how to prompt the AI agent

This is what my prompt does:

  • Searches for the number of production facilities each company has
  • Identifies companies with more than five facilities
  • Labels companies with 5+ facilities as "ICP", others as "Irrelevant"
  • Locates where these facilities are situated
  • Provides a complete list of facility locations
Prompt configuration for Datablist’s AI Research Agent
Prompt configuration for Datablist’s AI Research Agent
Scrape company details

Context: I need to know how many locations these companies have to determine if they fit in our ICP

===

What I want you to do:

  • Do a Google search about how many production facilities these companies have
  • Verify that they have more than five facilities
  • Tag those with more than five as "ICP" and the rest as "Irrelevant"
  • Do a second search where those locations are
  • Give me a complete list of all locations

===

The data points you have to look for (with examples):

Operational locations such as:

  • Production sites
  • Manufacturing facilities
  • Plants
  • R&D centers

===

Mistakes to avoid: - Don't look for any retail locations. Only for operational locations. - Do not include a list for "Irrelevant" companies.

===

Here's the an example of the perfect output:

ICP Status: ICP Locations:

  • Bonn, Germany
  • Solingen, Germany
  • Wilkau-Haßlau, Germany
  • Neuss, Germany
  • Graz, Austria
  • Pontefract, UK
  • Castleford, UK
  • Wisconsin, USA

===

Here is the name of the company: /Company

Once you've set up your prompt, scroll down to configure your desired outputs. In my case, I'll configure 2 outputs

  • ICP Status
  • Locations

For that, I'll configure my first output and click on More to create a second. You can do that for as many outputs as you want

When I am done, I check the box next to Advanced Settings.

Output format configuration for Datablist’s AI Research Agent
Output format configuration for Datablist’s AI Research Agent

The advanced settings of the AI Agent in Datablist allow me to:

  • Choose the LLM model I want to use for the task
  • Set a maximum number of iterations the agent can take
  • Use the Render HTML option to give the Agent the ability to scroll websites

Once I am done with that I click on Continue to outputs configuration.

Advanced settings of Datablist’s AI Research Agent
Advanced settings of Datablist’s AI Research Agent

Now, I’ll click on the plus icons to add new columns for each output then, I click on Instant Run

Output column configuration for Datablist’s AI Research Agent
Output column configuration for Datablist’s AI Research Agent

At this point, I can now configure the Run Settings which allow me to select the number of items to process.

After configuring this, I click on Run enrichment on all items to begin the process. The AI agent will then start researching and extracting the requested information about company locations.

Run settings of Datablist’s AI Research Agent
Run settings of Datablist’s AI Research Agent

As you can see, these results go far beyond database information, revealing gems that traditional data providers could never match.

Results of Datablist’s AI Research Agent
Results of Datablist’s AI Research Agent

📘 Only Creativity is the Limit

This example was purposely exaggerated to show the great things the AI agent can do! But you could also find out if a hospital is privately held or public, or do anything else (we will help you with prompting as well.)

How to Scrape the LinkedIn Profile of a Prospect

If you want to add more "human touch" to your prospecting, adding personalization is key, but only if you know how to distinguish between "human touch" and relevancy — and for both, you need to scrape the LinkedIn profile to do so.

The only thing you need for that is the LinkedIn profile URL

Step 1 of How to Scrape the LinkedIn Profile of a Prospect

Sign up for Datablist.com

Datablist home page
Datablist home page

Upload a list with LinkedIn URLs of your prospects

Datablist start page
Datablist start page

Step 2 of How to Scrape the LinkedIn Profile of Your Prospect

Click on Enrich

Datablist collection with LinkedIn URLs to scrape
Datablist collection with LinkedIn URLs to scrape

Go to People and select LinkedIn Profile Scraper.

Datablist’s people enrichment listing, LinkedIn Profile Scraper
Datablist’s people enrichment listing, LinkedIn Profile Scraper

Here are the key settings for LinkedIn profile scraping:

  • Cache vs Real-time: By default, uses cached data. Enable real-time scraping for fresh data (50 credits per profile)
  • Work Experience: Choose how many past jobs to return (default is 3, max is 10)
  • Datablist accepts both regular LinkedIn URLs (linkedin.com/in/xxx) and Sales Navigator URLs as input

Once you configured your settings click on Continue to outputs configuration

Input configuration for Datablist’s LinkedIn Profile Scraper
Input configuration for Datablist’s LinkedIn Profile Scraper

Now click on the plus icons to add new columns for each output you want to have. Here are the data points you can choose from:

  • LinkedIn Profile URL - Standardized profile URL (format: linkedin.com/in/xxx)
  • Basic Info - Name (first, last, full), headline, and summary
  • Location - City, state, country (code and full name)
  • Current Position - Title, company name, dates, company URL, website, industry, size, description, location
  • Work History - Previous 2 positions with titles, company names, and dates
  • Additional Info - Languages spoken and number of connections

Once you’ve done this click on Instant Run

Output columns configuration for Datablist’s LinkedIn Profile Scraper
Output columns configuration for Datablist’s LinkedIn Profile Scraper

With the Run Settings in Datablist you’re able to define if you want to:

  • Run in Async (especially good for large lists)
  • Test on the first 10 items
  • Select the number of items to process (10, 100, or custom)

After configuring these settings, ****click on Run enrichment on first X items to start finding your prospects' LinkedIn data.

Run settings of Datablist’s LinkedIn Profile Scraper
Run settings of Datablist’s LinkedIn Profile Scraper

Here are the results I got:

Results of Datablist’s LinkedIn profile scraper
Results of Datablist’s LinkedIn profile scraper

Note: I didn't scrape all data since I needed only the names and domains to find the phone numbers, but you can get way more data points using this enrichment

Now let me show you how you can enrich your CRM with valid emails!

How to Find Verified Email Addresses for Your CRM

Without verified email addresses, you're like trying to deliver mail in a city with no street names. Not very effective, right?

Here’s what you need to enrich your CRM with verified email addresses:

  • The name of the contact (required)
  • The company name or domain (required)
  • LinkedIn URL (optional)

Step 1 of How to Find Verified Email Addresses for Your CRM

Sign up for Datablist.com.

Datablist home page
Datablist home page

Import your list of contacts.

Datablist start page
Datablist start page

Step 2 of How to Find Verified Email Addresses for Your CRM

Click on Enrich.

Datablist collection with people to find the emails from
Datablist collection with people to find the emails from

Go to People and use the Waterfall Email Finder.

Here’s quick explanation of how it works:

Datablist's Waterfall Email Finder uses 15+ email providers one after another to get you the emails of your prospects, and you only pay for results. Fair, isn't it?.

Datablist’s people enrichment listing, Waterfall Email Finder
Datablist’s people enrichment listing, Waterfall Email Finder

Now you’ll be able to configure your waterfall which you can do this by:

  • Use full name instead of first and last name
  • Choosing you preferred providers with your own API keys

Once you've done this, scroll down and configure your input columns

These are all optional settings. For maximum coverage, keep default settings.

Optional settings of Datablist’s waterfall email enrichment
Optional settings of Datablist’s waterfall email enrichment

Now, map you columns as input property and click on Continue to output configuration.

Input configuration for Datablist’s waterfall enrichment
Input configuration for Datablist’s waterfall enrichment

This enrichment will give you 3 outputs:

  • Email address
  • Email address status
  • MX provider

Click on the plus icons to add one column for each output then, click on Instant Run.

Output columns configuration for Datablist’s waterfall enrichment
Output columns configuration for Datablist’s waterfall enrichment

Now you'll see the Run Settings which allow you to select the number of items to process (10, 100, or custom).

After configuring this, click on Run enrichment on all items to find the emails of your prospects.

Run settings for Datablist’s waterfall enrichment
Run settings for Datablist’s waterfall enrichment

As you can see, it found all emails where data was available. Using Datablist's Waterfall Email Finder, you can easily find thousands of email addresses to enrich your CRM with accurate contact data.

Enriched email list using Datablist's waterfall email enrichment
Enriched email list using Datablist's waterfall email enrichment

Now let me show you how you can find verified mobile phone numbers!

How To Enrich Your CRM With Verified Phone Numbers

Cold callers are hunters, and while everyone else is flooding inboxes with automated messages, they use a simple cold call to cut through the noise like a hot knife through butter.

Yes, both work, but one feels a lot more personal.

Here's how to find those valuable phone numbers:

Step 1 of How to Find Verified Phone Numbers for Your CRM

Sign up for Datablist.com.

Datablist home page
Datablist home page

Upload your list containing LinkedIn profile URLs.

Datablist start page
Datablist start page

Step 2 of How to Find Verified Phone Numbers for Your CRM

Click on Enrich.

Datablist collection with no phone numbers yet
Datablist collection with no phone numbers yet

Go to People and select the Waterfall Phone Finder

Datablist people enrichment listing, phone number enrichment
Datablist people enrichment listing, phone number enrichment

Map your column with LinkedIn profile URLs as input property and click on Continue to outputs configuration.

Input column configuration for finding phone numbers
Input column configuration for finding phone numbers

Click the plus icons to add new output columns:

  • Phone Number: Returns the full international format with country prefix (e.g., +1-555-0123)
  • Country: Provides the two-letter country code (e.g., US, GB, DE) for the phone number location

Then click Instant Run.

Output column configuration for phone number finder
Output column configuration for phone number finder

Now you'll see the Run Settings which allow you to select the number of items to process (10, 100, or custom).

After configuring this, click on Run enrichment on all items to start finding phone numbers of your prospects.

Run settings of phone number enrichment
Run settings of phone number enrichment

Datablist found almost all phone numbers for me. Give it a try!.

Results of phone number enrichment
Results of phone number enrichment

Conclusion

In today's fast-paced environment, where business relationships move at lightning speed, outdated or inaccurate CRM impacts your success rate, leads to missed opportunities and wasted resources.

Regular data cleaning and enrichment should be a priority.

By implementing these enrichment strategies using tools like Datablist, you will maintain a high-quality CRM database for better engagement and higher conversion rates.

Remember, the quality of your marketing, sales, and customer service is only as good as the data behind it. Make data enrichment a regular part of your CRM maintenance routine to stay competitive.

“How you gather, manage and use information will determine whether you win or lose.” -Bill Gates

Frequently Asked Questions About CRM Cleanups

What percentage of CRM data becomes obsolete each year?

Approximately 30% of CRM data becomes obsolete annually. This includes:

  • 15-20% of email addresses that become invalid
  • 18% of phone numbers that change
  • 21% of CEO positions that turn over
  • 25-33% of people who change jobs

How much does bad CRM data cost companies?

Bad CRM data costs businesses an average of $100 per incorrect record. For large organizations, this can amount to millions in annual losses through:

  • Wasted marketing spend
  • Lost productivity
  • Missed opportunities
  • Damaged reputation from poor customer communication

What is CRM cleaning?

CRM Cleaning is the systematic process of maintaining and improving your customer database quality. It consists of 4 main pillars:

  1. Deduplication: Removing duplicate entries and consolidating records
  2. Validation: Verifying the accuracy of existing data points like email addresses and phone numbers
  3. Structuring and Formatting: Standardizing data formats and organizing information consistently
  4. Data Enrichment: Adding new, relevant information to enhance customer profiles

Regular CRM cleaning ensures your team works with accurate, up-to-date information for better decision-making and customer engagement.

How to keep a CRM clean?

  • Defining standardized input formats—for example, creating a consistent framework for taking call notes
  • Limiting certain columns to specific input types (e.g., making the phone number column a "numbers only" attribute)
  • Creating mandatory field requirements - Example: Making it required for sales reps to fill in specific fields like "Last Contact Date" to ensure data completeness

Can ChatGPT do data cleaning?

ChatGPT is not made for data cleaning — for that, you'll have much better free solutions like Datablist. That's not to say that ChatGPT can't clean any data, but it can't deal with large files, and if you have 1k+ records, it will mess up 100%.