Portals eNewsletters Web Seminars dataWarehouse.com DM Review Magazine
DM Review | Covering Business Intelligence, Integration & Analytics
   Covering Business Intelligence, Integration & Analytics Advanced Search

View all Portals

Scheduled Events

White Paper Library
Research Papers

View Job Listings
Post a job


DM Review Home
Current Magazine Issue
Magazine Archives
Online Columnists
Ask the Experts
Industry News
Search DM Review

Buyer's Guide
Industry Events Calendar
Monthly Product Guides
Software Demo Lab
Vendor Listings

About Us
Press Releases
Advertising/Media Kit
Magazine Subscriptions
Editorial Calendar
Contact Us
Customer Service

Meta Data & Knowledge Management:
Managed Meta Data Environment: A Complete Walk-Through, Part 1

  Column published in DM Review Magazine
April 2004 Issue
  By David Marco

Author's Note: This column begins an eight-part series adapted from the book Universal Meta Data Models by David Marco and Michael Jennings (John Wiley & Sons).

Almost every corporation and government agency has already built, is in the process of building or is looking to build a managed meta data environment (MME). Many organizations, however, are making fundamental mistakes. An enterprise may build many meta data repositories (or "islands of meta data") that are not linked together and, as a result, do not provide as much value (see sidebar).

Let's take a quick meta data management quiz. What is the most common form of meta data architecture? It is likely that most of you will answer centralized; but the real answer is "bad architecture." Most meta data repository architectures are built the same way data warehouse architectures were built: badly. The data warehouse architecture issue resulted in many Global 2000 companies rebuilding their data warehousing applications, sometimes from the ground up. Many of the meta data repositories being built or already in use need to be completely rebuilt.

The goal of this eight-part series is to ensure that your MME's architecture is constructed on a rock solid foundation that provides your organization with a significant advantage over those with poorly architected MMEs. During this series, I will present the complete MME architecture and walk through, in detail, each of the six major components and the sustainment of the MME.

MME Overview

The managed meta data environment represents the architectural components, people and processes that are required to properly and systematically gather, retain and disseminate meta data throughout the enterprise. The MME encapsulates the concepts of meta data repositories, catalogs, data dictionaries and any other term that people use to refer to the systematic management of meta data. Some people mistakenly describe an MME as a data warehouse for meta data. In actuality, an MME is an operational system and, as such, is architected in a vastly different manner than a data warehouse.

Companies that are looking to truly and efficiently manage meta data from an enterprise perspective need a fully functional MME. It is important to note that a company should not try to store all of their meta data in an MME, just as the company would not try to store all of their data in a data warehouse. Without the MME's components, it is very difficult to effectively manage meta data in a large organization. The six components of the MME, shown in Figure 1 on page 51, are: meta data sourcing layer, meta data integration layer, meta data repository, meta data management layer, meta data marts and meta data delivery layer.

Figure 1: Managed Meta Data Environment

The MME can be used in either the centralized, decentralized or distributed architecture approaches. Centralized architecture offers a single, uniform and consistent meta model that mandates the schema for defining and organizing the varied meta data stored in a global meta data repository. This allows for a consolidated approach to administering and sharing meta data across the enterprise. Decentralized architecture creates a uniform and consistent meta model that mandates the schema for defining and organizing a global subset of meta data to be stored in a global meta data repository and in the designated shared meta data elements that appear in local meta data repositories. All meta data that is shared and reused among the various local repositories must first go through the global repository; however, sharing and access to the local meta data are independent of the global repository. Distributed architecture includes several disjointed and autonomous meta data repositories that have their own meta models to dictate their internal meta data content and organization, with each repository solely responsible for the sharing and administration of its meta data. The global meta data repository will not hold meta data that appears in the local repositories; instead, it will have pointers to the meta data in the local repositories and meta data on how to access it (see Chapter 7 of Building and Managing the Meta Data Repository for a more detailed walk-through of these approaches). At EWSolutions, we have built MMEs that use each of these three architectural approaches, and some implementations use combinations of these techniques in one MME.

Next month, I will begin my detailed walk-through of each of the six MME components, starting with the meta data sourcing layer.

Where's My Meta Data Architecture?

AT EWSOLUTIONS, ONE OF OUR CLIENTS is a large pharmaceutical company. Because knowledge is the lifeblood of any pharmaceutical company, these types of firms tend to have very large meta data requirements and staffs. Our client decided to hold a "Meta Data Day," and they asked me to give a keynote address to begin the day. Between 60 and 80 people attended their "Meta Data Day."

A series of workshops followed the keynote address. We counted four separate meta data repositories in production and three other separate new meta data repository initiatives starting -- a classic "islands of meta data" problem. This is not an approach that leads to long-term positive results, as much of the most valuable meta data functionality will come from the relationships that the meta data has with itself. For example, it is highly valuable to view a physical column name (technical meta data) and then drill across to the business definition (business meta data) of that physical column name.


Check out DMReview.com's resource portals for additional related content, white papers, books and other resources.

David Marco is an internationally recognized expert in the fields of enterprise architecture, data warehousing and business intelligence and is the world's foremost authority on meta data. He is the author of Universal Meta Data Models (Wiley, 2004) and Building and Managing the Meta Data Repository: A Full Life-Cycle Guide (Wiley, 2000). Marco has taught at the University of Chicago and DePaul University, and in 2004 he was selected to the prestigious Crain's Chicago Business "Top 40 Under 40."  He is the founder and president of Enterprise Warehousing Solutions, Inc., a GSA schedule and Chicago-headquartered strategic partner and systems integrator dedicated to providing companies and large government agencies with best-in-class business intelligence solutions using data warehousing and meta data repository technologies. He may be reached at (866) EWS-1100 or via e-mail at DMarco@EWSolutions.com.

Solutions Marketplace
Provided by IndustryBrains

Data Validation Tools: FREE Trial
Protect against fraud, waste and excess marketing costs by cleaning your customer database of inaccurate, incomplete or undeliverable addresses. Add on phone check, name parsing and geo-coding as needed. FREE trial of Data Quality dev tools here.

Speed Databases 2500% - World's Fastest Storage
Faster databases support more concurrent users and handle more simultaneous transactions. Register for FREE whitepaper, Increase Application Performance With Solid State Disk. Texas Memory Systems - makers of the World's Fastest Storage

Manage Data Center from Virtually Anywhere!
Learn how SecureLinx remote IT management products can quickly and easily give you the ability to securely manage data center equipment (servers, switches, routers, telecom equipment) from anywhere, at any time... even if the network is down.

Design Databases with ER/Studio: Free Trial
ER/Studio delivers next-generation data modeling. Multiple, distinct physical models based on a single logical model give you the tools you need to manage complex database environments and critical metadata in an intuitive user interface.

Free EII Buyer's Guide
Understand EII - Trends. Tech. Apps. Calculate ROI. Download Now.

Click here to advertise in this space

View Full Issue View Full Magazine Issue
E-mail This Column E-Mail This Column
Printer Friendly Version Printer-Friendly Version
Related Content Related Content
Request Reprints Request Reprints
Site Map Terms of Use Privacy Policy
SourceMedia (c) 2006 DM Review and SourceMedia, Inc. All rights reserved.
SourceMedia is an Investcorp company.
Use, duplication, or sale of this service, or data contained herein, is strictly prohibited.