-
Marketplace
-
Channel Resources
Articles from this Site
SAS Unveils Solution for Managing Virtual Server Environments
IE Discovery Announces Update to InfoDox
JMR Construction Builds Virtual Infrastructure with DataCore
Composite Software Releases Data Integration Strategy Recommendation Tool
IBM Introduces New Informix Dynamic Server
White Papers
Pragmatic Approach to Compliance Data Collation
Informatica - Handling Variable Length Files Using XML
Putting Metadata to Work to Achieve the Goals of Data Governance
Enterprise Information Management - Insights and Strategies into the Direction of EIM
Automated Analysis Technology
Web Seminars
Making the Business Case for Predictive Analytics: Innovative Strategies for Maximizing ROI
Master Data Management: Best Practices for Success
Modeling Unstructured Data
Books
Data Management: Databases and Organizations, 3rd Edition
Data Modeler's Workbench: Tools and Techniques for Analysis and Design
Effective Databases for Text & Document Management
Mobile Handheld Devices - Enabling Enterprise Communications and Data Management
Mobile Data Management (MDM 2002), 3rd International Conference
Data Ownership and Collaboration
Knowledge Integrity
Some recent interest in exploring ideas about data ownership prompts me to revisit the topic of data ownership and collaboration, which we discussed in the March 2003 issue of DM Review. Recent client work associated with establishing data standards has exposed aspects of data ownership that transcend our earlier discussion, mostly inspired by the question of ownership of data (and meta data) that is exchanged.
The first question is introduced as a byproduct of a Web services application. In the client environment, a number of participants in a data exchange network coordinate their exchanges through a centralized Web service. For example, consider a product market in which suppliers register their products and prices with a centralized service, and customers visit the service searching for and potentially purchasing products. In this case, a standard for exchanging information is established by which requests for information are propagated to the suppliers and by which their responses are forwarded to the service. The service is essentially participating as a value-added pass-through for data.
The first question is: who "owns" the data provided by the centralized service? On one hand, one might simply say that the data suppliers own the data because they are supplying it. However, there is an equally good argument that the centralized service owns the data because the service collects and synthesizes the product that is eventually displayed to the data consumer. The question again focuses on responsibility for the quality of the data. However, because there is a possibility that flaws can be introduced in more than one administrative domain, the boundaries of governance may allow data of low quality to propagate under everyone's radar.
The second question, which is the more interesting one, is who owns the meta data associated with data exchange? In fact, let's take this one step further - who owns data standards in general? Rather, let's look at it this way: in an environment where there is an expectation for conformance to a data standard, who is responsible for ensuring the quality of data being passed?
As a concrete example, the introduction of XML as a framework for defining schemas for data interchange has only created an explosion in the proliferation of data standards. (This has prompted one of my colleagues to mention, tongue-in-cheek, that it is great to have so many standards to choose from.) However, embedded within XML schemas and even within traditional fixed-field format layouts are tons of data quality rules - data domain definitions, format constraints, cross-field constraints. For example, in a schema we recently worked on, the client introduced data elements that were what you might call "conditionally mandatory"- a group of elements is optional, but if the group is present, one of a set of three data elements must be present.
Validation of some of these rules can be performed simply through schema validation. Others, such as our conditional mandatory selection of fields, must be handled by an application - schema validation won't cut it. Consider the scenario: A requestor sends the request and anticipates a response. The service handles the request and sends it to the suppliers, who respond. The service forwards the response to the original requestor, who is the ultimate consumer of the data. Who is responsible for validating the data - the requestor, the service or the supplier?
It is more relevant to consider whose business process is most impacted by data that does not conform to the standard. In addition, data standards are effectively "common areas" to be tended by contributions from all participants. By considering the business impacts of not addressing governance of the standards that delineate the exchange of data, all participants will put their shared investment at risk.
David Loshin is the president of Knowledge Integrity, Inc., a consulting and development company focusing on customized information management solutions including information quality solutions consulting, information quality training and business rules solutions. Loshin is the author of Enterprise Knowledge Management - The Data Quality Approach (Morgan Kaufmann, 2001) and Business Intelligence - The Savvy Manager's Guide and is a frequent speaker on maximizing the value of information. Loshin may be reached at loshin@knowledge-integrity.com.
For more information on related topics, visit the following channels:


