Software to search intelligently for duplicates and duplicated addresses

Good data is the foundation of good marketing. And an intelligent search for matching addresses and duplicates is the best way to achieve this. Because this is the only way to reliably avoid sending multiple communications. And it is the only way to respect the individual's wish not to receive advertising, which has become mandatory since the introduction of the GDPR. And when the individual advertising campaigns are then compared with each other, with your own customer base and other data, nothing should stand in the way of your marketing success.

Matching results

Intelligent search for duplicates (dupes)

Exact hits, where every character is matched, are not the only results found: so too are near-duplicates (fuzzy matching) and duplicated addresses. In this, the following are taken into account, in particular:

  • Typos
  • Spelling variations
  • Omissions and additions
  • Misplaced words
  • Abbreviations
  • Pet names / nicknames

Everything you need for data clean-up:

  • Search for duplicates / dupes inside a table.
  • Search for duplicates / dupes between two tables, for example, to consider blacklists or to synchronize address lists.
  • Search for duplicates by postal address (fuzzy matching), phone number, email address or any other criteria.
  • Fuzzy / error-tolerant matching can deal with company names as well as addresses of private persons.

Numerous opportunities of using the result:

  • The duplicates can be deleted in the source table. Alternatively, the cleansed data can also be written in a new file.
  • The found duplicates can be marked in the original table.
  • The result can be used to enrich data. For example, a telephone number from a second table could be transferred to the first table using the matching result.
  • The result can be processed using the functionality of any stored procedure from the database.
  • Various protocols and lists can be created.

User-friendly and cost-effective:

  • No technical knowledge required. Our products are designed so that hopefully you will never need our free support.
  • See for yourself. Test our products for one week free of charge and without any restrictions.
  • Local processing of data, no need to transfer data to an external service provider. This simplifies compliance with the General Data Protection Regulation (GDPR).
  • For service providers you pay for each project individually, but you only pay once for our software. And all this with an excellent price-performance ratio. (Prices)

Fast, flexible and safe:

  • Can also be used for large databases. Parallel, and therefore particularly fast, processing on systems with several processor cores.
  • Data source (address lists and databases): Excel, Access, MS SQL Server, Azure SQL, ORACLE, MySQL, MariaDB, IBM DB2, PostgreSQL, OpenOffice Calc, LibreOffice, dBase, CSV files and text files.
  • All the program files have a digital signature. This ensures that the files are unchanged and actually originate from us. You can easily verify this digital signature: Properties for the program file (accessible via the right mouse button) -> Digital signatures -> Details -> Show certificate -> Details -> Applicant

Other functions for quality improvement:

  • Functions for selecting and enriching data.
  • Detect gender based on first name.
  • Determine the salutation of a letter.
  • Delete selected data records.
  • Correct the postal code format.
  • Merging tables.
  • Merging and splitting data fields.
  • and much more ...

Our software products:

AI or no AI?

The algorithm that our products use to find duplicates is not based on artificial intelligence in the sense of machine learning. Instead, it uses a complex rule-based algorithm. Compared to machine learning, this has the advantage that it requires less computing power. In addition, the results of such an algorithm are reproducible. The quality of the result always remains the same. For a well-defined problem such as finding duplicate addresses in address lists, such an algorithm is usually the better choice..

DataQualityTools

DataQualityTools 8

Compared to DedupeWizard, DataQualityTools offer more options to find duplicates (dedupe software / list cleansing software) and also a whole series of additional functions to process address data / address lists / mailing lists, such as a function to merge data fields and a function to determine the gender based on the first name from the address. In addition to Excel files, the program can also process dBase, ACCESS, OpenOffice / LibreOffice Calc,  CSV and text files as well as database servers such as MS SQL Server, PostgreSQL, ORACLE, IBM DB2, MariaDB and MySQL. further information ...

DedupeWizard

DedupeWizard 8

A simple software that can be used without any special technical knowledge to clean / dedupe Excel files (deduplication software / dedupe software). Duplicates, especially duplicate addresses can be found using the postal address, the telephone number and/or the email address, either within a single address list or between two address lists, such as is required for the consideration of e.g. Robinson lists, MPS lists or opt-out lists. When the deduplication is based on the postal address – i.e. name, street, postal code and city – the program then also considers typos, inverted words and additions. It is then a fuzzy / error-tolerant matching. further information ...

BatchDeduplicator

BatchDeduplicator 8

BatchDeduplicator contains essentially the same functions as DataQualityTools to dedupe lists (dedupe software / data cleaning software). Like with DataQualityTools, large databases (data sources: EXCEL, ACCESS, dBase, OpenOffice / LibreOffice Calc, CSV files and text files and database servers such as MS SQL Server, MySQL, MariaDB, ORACLE, IBM DB2 and PostgreSQL) can be processed, several million records generally do not represent a problem. The projects can be scheduled and provided with a time of execution, to thus be carried out e.g. every Tuesday at 5 pm. further information ...

Try it free! Download now!

( Release date: 2024/2/16 )

Better data doesn’t have to be expensive.