-
Marketplace
-
Channel Resources
Articles from this Site
IBM Introduces New Versions of Two Software Products
Experian QAS Selected by Two State Unemployment Insurance Programs
Emerson Network Power Selects Silver Creek Systems
Talend Announces First Open Source Data Profiler
Equinox Pumps Up Data Quality with DataFlux
White Papers
Data Warehousing Ensuring Data Integrity
Making Data Work: Addressing Data Quality at the Enterprise Level
Can your SharePoint Backup Harm Your Business?
The Value Behind Integrity
Building Profitable Customer Relationships and Personalized Retention Strategies
Web Seminars
Master Data Management: Best Practices for Success
Getting In Synch: Creative Ways to Reconcile Data Between Apps
Closing the Loop: Real-Time Event Detection and Response
Books
Corporate Information Factory, 2nd Edition
The Data Warehouse Challenge: Taming Data Chaos
Data Quality for the Information Age
Improving Data Warehouse and Business Information Quality: Methods for Reducing Costs and Increasing Profits
Metadata Management for Information Control and Business Success
Talking with Your Business Partners about Data Quality
Solving for Quality
If you regularly read this column, I suspect you are someone who is involved in improving data quality within your company. You have probably had business partners or customers tell you that your data is bad. On some occasions, they have almost certainly used words that are considerably worse than bad. These are interesting interactions, and there is absolutely nothing constructive that you can do as a result of these discussions. The description bad data, or something more emotional, is simply too general to be valuable. Its as if you went to your doctor to tell her that you have a terrible headache, but you could not be specific about the nature of the pain, its location, frequency and other symptoms that would be useful for a diagnosis.
Three Steps to Solve Data Problems
Solving the bad data problem requires:
- Clearly defining the nature of the problems your business partners/customers are experiencing,
- Establishing priorities to tackle the most strategically important issues first, and
- Implementing an improvement plan with appropriate metrics and communications to your business partners/customers.
Clearly defining the data problems that your business partners/customers are experiencing, establishing priorities for remediation and communicating your enhancement successes are processes that can only be successfully completed in partnership with your business customers.
Begin with the process of defining the nature of the problems. Your business end users or customers need data to complete tasks. They tend to be focused on their tasks and not on the characteristics of the data they employ. They dont want to, or need to, understand the particulars of how the data is managed and why it may not be what they need, when they need it. Your users simply want good quality data. To assess the root causes of the bad data problem, you will need to know much more precisely what customers mean by bad data. It helps to introduce a shared vocabulary for talking about data quality issues. Timeliness, accuracy, completeness and consistency are one set of quality dimensions that can be useful.
- Timeliness: Currency of data elements.
- Accuracy: Attributes of the entity (object) are correctly represented.
- Completeness: Breadth (number of entities) and depth (number of fields defined and populated).
- Consistency: Identity, definitions, hierarchies, standards and metrics are the same within and across databases.
If your business end users understand these four data quality dimensions, you may discover, for example, that the inaccurate entity identity data issues your customers are talking about (This isnt the right address. Cant you guys do anything right?) is actually a timing problem because your extraction and update cycles are too slow. Or, another common example is a lack of market opportunity data for small businesses (This is such bad data. Where are the employee numbers for these small businesses?), which is a completeness problem that can probably never be solved because no one has the data.
Once you and your business customers have a common vocabulary that permits you to disaggregate the problems into quality components, both you and your business end users will have a much clearer picture of what could be fixed. The next step is to determine what will be fixed. No enterprise has the resources to create perfect data for all data types in all databases. You and your customers need to agree on priorities for attacking specific data problems. This is an effort that involves looking at data types (entity identity data, market opportunity data, relationship data, transaction/role data, etc.) and at the quality dimensions (timeliness, accuracy, completeness and consistency). In many cases, your highest-priority initiatives will involve enhancing only one or two quality dimensions for one or two data types. The choices made in this step should be re-evaluated regularly as your customers strategic priorities change.
Perhaps the most important component in the process of solving the bad data problem is regular communication between you and your business end users or customers. Once you both speak the same quality language and both agree on which issues should be tackled first, you need to report regularly on progress against goals. When business end users have a clear idea that progress is being made, they are often remarkably patient. There is an old proverb that states: A half a loaf is a feast for a starving man. Once you have explained the dimensions of quality and once your customers have agreed with you on near-term quality initiatives, they will understand that perfect data - the whole loaf - isnt available. They will also understand that you are working to deliver the data they need for their strategic goals, the tasty and fulfilling half loaf.
Key takeaway: Most of your business customers dont actually understand data quality, but they do know when they cannot use data to perform necessary tasks. You need to educate them about how to describe their quality symptoms, gain their agreement about quality improvement plans and communicate regularly on the successes of those plans.
Vicki P. Raeburn is president of Scofield Ridge Associates, Inc., a business consultancy focused on data governance and data quality.She has nearly 30 years of leadership experience in the information industry, where she has held positions in global business and product development, marketing, and data operations. Most recently, Raeburn was chief quality officer at Dun & Bradstreet, consulting with customers on the successful implementation of customer information solutions. She has a B.A. from New College of Florida and a Ph.D. from Yale University. She may be reached at vicki@scoridge.com.
For more information on related topics, visit the following channels:


