First Steps with Data Enrichment

Data Enrichment helps you to automatically complete and update company data as well as validate business email addresses. This article will guide you through the first steps.

step-0

Company Data Search

The company data search feature compares data you provide with public sources such as legal notices, Wikidata, Société, XING, and Google Maps in real-time. To start the process, you upload an Excel or CSV file via the user interface in the Complement company data section

The minimum required columns for the company data search are either organization and / or website columns. The more data you provide with the input file, the better will be the results. For example, if you search for a company with different locations all over the world, the system will be able to pick the right location if you additionally provide country, city, and zip columns.

Columns the system understands are: organization, phone, mobile, fax, email, website, street, street2, poBox, zip, city, country (as ISO 3166-1 alpha-2 code), state (as ISO 3166-2 code), vat, and industry. You can find an example Excel template containing all supported columns here.  

In addition, the input file can contain custom columns (such as a crm_id), which will be preserved in the result file for easy identification of the resulting rows. 

organization firstName lastName website crm_id
snapADDY GmbH Jochen Seelig https://snapaddy.com 10454
E2N GmbH     https://e2n.de/ 555214
Transporeon       456465

Table 1: Example input table for company data search

Email Finder

The email finder feature validates email addresses based on patterns of publicly available email addresses of the provided domains. 

The required columns for the email finder are firstName, lastName and website. You can find an example Excel template containing all supported columns here.

In addition, the input file can contain custom columns (such as a crm_id), which will be preserved in the result file for easy identification of the resulting rows. 

firstName lastName website
Jochen Seelig https://snapaddy.com
Simon Mohr https://e2n.de

Table 2: Example input table for email finder

Results

Depending on the number of rows to be enriched, it can take from a few minutes to a few hours until the result is available. Once your data has been processed completely, you will be notified via email that the results are ready for download. The data enrichment results can then be downloaded in the Results section and will be kept available for 30 days.

Result Columns

The result file will preserve all columns (and values) that have been included in the input file. All fields that are enriched by snapADDY will be added as additional columns and prefixed with snapaddy_. 

Company data search results will contain a column snapaddy_meta_similarity, which indicates how similar the information found on the Internet is on average to the values that were provided in the input file. The score is an aggregation of several metrics developed by snapADDY specifically for this type of semantic comparison.

Here is an example: The two company names "snapADDY" and "snapaddy GmbH (Germany)" have a similarity score of 100%, because they are semantically equivalent – although not spelled the same.

Email finder results will contain a column snapaddy_meta_emailConfidenceScore. The value between 0 and 1 indicates how often this email pattern is used for the given domain (relative to all email patterns found for the domain).

We continuously crawl the entire public Web (currently over 3 billion pages) and derive patterns for a domain from email addresses we find. Let's assume we have found the following email addresses:

j.doe@snapaddy.com
m.mustermann@snapaddy.com
j.seelig@snapaddy.com
erika.mustermann@snapaddy.com

The pattern "f.lastname" would have a confidence score of 0.75, because 3 out of 4 email addresses use this pattern. 

Available Searches per Month

The number of available searches per month is based on the purchased Data Enrichment package of your organization (see: https://www.snapaddy.com/en/data-enrichment-service.html). Available searches are reset on each calendar month and are not transferred over to the next month. Rows that are not processed due to an error will not be counted towards your available searches.