Ask the Experts Question and Answer
Meet the Experts
Ask a Question
(Names of individuals and companies will not be used.)
Ask the Experts Home
(Posted May 3, 2002)
Could you please give me an idea of the typical range on a per record basis for a deduplication (data cleansing) project?
Sid Adelman?s Answer: There is no meaningful range. It will depend on:
- How many source files you are consolidating for the deduping.
- If data entry was careful and motivated to minimize duplicate names. It?s often easier for them to just enter a duplicate name than to identify and use a customer number that already exists.
- The editing in the entry system to spot and alert duplications
Chuck Kelley?s Answer: A typical range of what? Cost? Records Combined? Each project will be different. I have seen as much as 50:1 and as little as 1.1:1 in terms of combining records. There will need to be some analysis done on how bad the data is and how much is it worth to have a "perfectly" cleansed environment. You can do a 75 percent cleansing of name and address rather inexpensively. To do a "perfect" cleans can cost another 50 times as much. How much is it worth to your business?
ARCHIVE OF QUESTIONS & ANSWERS FOR DATA QUALITY
BACK TO THE LIST OF CATEGORIES