-
Marketplace
-
Channel Resources
Articles from this Site
Information Builders to Extend WebFOCUS to Predictive Analysis
A Statistical Stocking Stuffer for the Holidays
What are your views on the advantages and/or disadvantages ETL tools and data modeling versus code?
New Interfaces Integrated into MEGA International Modeling Software
Which would be a better choice of classes for career growth in data warehousing - ETL architecture or dimensional modeling?
White Papers
Best Practices: Eight Tips for Improving Your Professional Services Business
Metadata Management for Enterprise Applications
UML for C#
PHP Code Design
Domain-Specific Modeling: 10x Faster than UML
Web Seminars
Modeling Unstructured Data
Creative Strategies for Achieving 24/7 Uptime
Closing the Loop: Real-Time Event Detection and Response
Learning from Others: Best Practices for Data Governance
Books
Data Mining Cookbook: Modeling Data for Marketing, Risk and Customer Relationship Management
Data Modeler's Workbench: Tools and Techniques for Analysis and Design
The Data Modeling Handbook: A Best-Practice Approach to Building Quality Data Models
Data Mining Using SAS Applications
Data Mining: Concepts, Models, Methods and Algorithms
Definition Differences
Design Challenge
Your coworker calls you with a data modeling emergency. There needs to be a single definition for Customer in the data warehouse, yet he was provided with three different definitions. Each definition was provided by a well-respected business expert and is accurate within that business expert's area.
The Challenge
He needs you to recommend a solution he can use to come up with a definition that satisfies all three business experts. What would you recommend?
The Response
The tactic to get to a single Customer definition would require three main activities: seek help from the business, get the complete picture and capture it on the model.
Seek Help for a Complete Picture
It would be difficult to consolidate these three definitions into one without the help of a business owner or data steward. Jan Kamil, data warehouse analyst, would recommend asking, "Who owns the Customer data? That's the party who has to come up with or agree to a definition. If there's no definitive answer to that question, then it's time for tough love. Bribe all three experts with food, lock them in a room with you and don't let anyone out until an inclusive definition has been agreed to by all parties."
William Delaney, investment systems analyst, would recommend a complete analysis of Customer. "It would make sense that there are other business areas or stakeholders within the organization who also have yet another definition of Customer. I would propose a complete analysis of Customer and the current definition of Customer as perceived by each of the business units and stakeholders." Tom Bilcze, data architect, said, "I would first start with one-on-one discussions with each of the experts. After these exercises are complete, the analyst presents a homogenized definition formulated during the one-on-one interview process to each of the three experts."
Part of getting the complete picture involves understanding how customer data is stored in each of these three areas today. Gordon Everest, professor emeritus, recommended asking the question, "Does machine-stored Customer data already exist in each of the three areas? If not, the tactic is much simpler. If Customer data exists in one or more tables in each area, then we have the dual problem of combining data (rows) as well as combining definitions (columns)."
Based on the results of the detailed analysis, there are three ways to model this: subtype, go generic or abstract.
Subtype
Marc Bratman, application development manager, said, "I would define a Customer entity and then three subtypes for the Customer, each defined by the different business experts." A good approach to doing this, as summarized by Krishnan Seshadri, data architect, is, "Define Customer in the broadest of terms, preferably tied to the wording in the organization's mission statement. Then emphasize each business unit's view of the Customer by comparison/contrast and examples." I have found a very useful way of locating a common generic definition is to go to Google.com and type "Define: Customer." Tim Klein, data warehouse architect, recommended, "If there is little overlap or the required subtyping is too extensive, then the Customers are not the same entity and need separate names and definitions."
Go Generic
Anna-Mari Mollentze, IT engineer, suggested a generic definition, such as, "A Customer is any person or entity who interacts directly or indirectly with any business system, thus it can be a client in the marketing or sales departments, a supplier in procurement or an employee in HR." Brent G. Wilke, data warehouse architect, said, "You should try and find a 'merged' definition using the information from all three business experts before this meeting. By merged definition I mean pull out the unique terms in each definition and see if you can incorporate these unique terms into one definition."
Abstract
Dave Hay, industry expert, believes a more abstract approach should be taken. "I never have Customer as an entity or a table. I have Party that may be either a Person or an Organization. Each Party may be a Customer in an order. In addition, each Party may be a Customer in a marketing relationship. The point is that each of the definitions is for a different thing. The fact that there are multiple definitions for Customer means that the term should be thrown out." Michael Smilg, data administrator, offers a similar approach. "I would recognize that there is no such thing as a single definition of customer that would meet the needs of all three areas. Instead, the data warehouse should recognize that a Customer might have multiple roles and rigorously define and manage the distinctions across those roles."
If you would like to become a design challenger and have the opportunity to submit modeling solutions, please add your email address at www.stevehoberman.com. If you have a challenge you would like our group to tackle, please email me a description of the scenario at me@stevehoberman.com.
Steve Hoberman has worked as a business intelligence and data management practitioner and trainer since 1990. He is a Certified Business Intelligence Professional (CBIP), having achieved mastery level certification in data analysis and design. He is a popular and frequent presenter at industry conferences, both nationally and internationally. Hoberman is a columnist and frequent contributor to industry publications, as well as the author of Data Modeler's Workbench and Data Modeling Made Simple (available for purchase through the DM Review bookstore). He is the founder of the Design Challenges group, inventor of the Data Model Scorecard and a recognized innovator and thought leader in the field of data modeling. He can be reached at me@stevehoberman.com.
Graeme Simsion's latest book is out! Data Modeling Theory and Practice. Here's a link where you can read more about the book and purchase it at a discounted price.
For more information on related topics, visit the following channels:


