Portals eNewsletters Web Seminars dataWarehouse.com DM Review Magazine
DM Review | Covering Business Intelligence, Integration & Analytics
   Covering Business Intelligence, Integration & Analytics Advanced Search
advertisement

Resource Portals
Analytic Applications
Business Intelligence
Business Performance Management
Data Integration
Data Quality
Data Warehousing Basics
EDM
EII
ETL
More Portals...

Advertisement

Information Center
DM Review Home
Conference & Expo
Web Seminars & Archives
Newsletters
Current Magazine Issue
Magazine Archives
Online Columnists
Ask the Experts
Industry News
Search DM Review

General Resources
Bookstore
Industry Events Calendar
Vendor Listings
White Paper Library
Glossary
Software Demo Lab
Monthly Product Guides
Buyer's Guide

General Resources
About Us
Press Releases
Awards
Media Kit
Reprints
Magazine Subscriptions
Editorial Calendar
Contact Us
Customer Service

Noted in Passing: The Author of a Great Idea

  Article published in DM Review Magazine
July 2003 Issue
 
  By Lou Agosta

Dr. Edgar Codd, IBM Fellow (retired), computer pioneer and creator of the relational database, passed away Friday, April 18, 2003, at the age of 79 of heart failure.

The relational database is the dominant design for the storage and manipulation of data relevant to commercial business applications, data warehousing systems and business intelligence, and is likely to remain so for the foreseeable future.

Specific niche markets in emerging domains such as genomics and the representation of bioinformatics may require or benefit from a new paradigm for the representation of data, but the dominance of the relational model in commercial business operations is secure. Bank accounts, credit cards, stock trading, travel reservations, online auctions and innumerable other now-routine data transactions, as well as data warehouses that address them, all rely on Codd's model. This model was first implemented by Larry Ellison in an early version of what would become the flagship Oracle database. This immediately captured IBM's attention and helped to overcome internal skepticism about Codd's work, which then provided the basis for IBM's own prototype SQL/DS (1981) and DB2 (1983). Other early commercial products based on the relational model included Sybase SQL Server, which originally shared code with what is now Microsoft SQL Server, Teradata (NCR) and Ingres (Computer Associates).

One source of the power of the relational model is that it says nothing about the physical implementation of the data but provides a simple set of logical operations - union, intersection and negation - from which virtually all other transformations of the data can be derived. Another source is its simplicity. The model is supposed to be implemented by means of a symbol system that employs a small, simple set of English language statements (declarations such as select, insert, update and delete) to data organized intuitively into rows and columns (tables). The irony is that a system's strengths can also become its weaknesses. If anything, the challenge is that the relational model is so simple and elementary. "Simple" does not always mean "easy" - it gets under the radar of common sense and the everyday sloppiness with which most people are comfortable. When taken to an elementary level, even simplicity can be challenging. In addition, the subsequent choice of the term "relational algebra" by the colleagues at the lab to describe the syntax of the relational model was obviously coined by software developers and never vetted by the marketing department. It retains a certain aura of mystery - and can inspire fear in the heart of the many people for whom algebra was not a good experience in high school. Yes, logical abstraction and mathematical (set) theory is to be found in the background. Yet the intuitiveness of the rows and columns of the table is clear in an entirely different context as spreadsheets have come to dominate business analysis on the desktop. What the relational model has had all along (in contrast to the desktop) is integrity - data integrity. Another advantage is that the form of the relational SQL statements is declarative - the user tells the system what to do, not how to do it, which is left to the underlying database itself. This is in contrast to procedural programming languages such as C that require experts to disentangle the convoluted syntax and thus will remain the domain of specialists. Structured query language (SQL) has always been available to business analysts who were willing to make a modest extra effort without having to become full-fledged developers. It is now an interface for data mining, ETL (extract, transform and load) and a variety of business intelligence analyses. Thus, if you are looking for a great idea that has still not been exhausted - and, one way to identify a great idea is by its inexhaustibility - see Codd's 1970 paper, "A Relational Model of Data for Large Shared Data Banks" (reprinted from Communications of the ACM, Vol. 13, No. 6, June 1970 at http://www.acm.org/classics/nov95/).

This assertion can be appreciated in the prediction that all other paradigms without exception will be assimilated to the relational one. That has already happened with object-oriented databases (one or two of which have found a niche in specialty verticals such as publishing or avionics). Object-relational extensions are now a feature of the standard relational database in the form of user-defined data types, user-defined functions and inheritance mechanisms. This will likewise happen with in-memory databases, OLAP databases and XML databases. This leads to a clear recommendation for clients - absent very specific industry-specific requirements. Do not rush to purchase one of these special-purpose data stores, but rather wait for the functionality to be assimilated in the next release of your standard relational database.

Like so many touched by genius, Codd has gone from being a voice crying in the wilderness, to being merely impractical, to being obvious such that everyone knew it all along. Some readers will take heart from the example of Codd with their own struggles in the corporate jungle in that he received a less than satisfactory review from superiors at IBM in Poughkeepsie, New York, early in his career, leading him to move west to California seek new opportunities at the IBM Santa Teresa Lab. As they say, the rest is history...

...............................................................................

For more information on related topics visit the following related portals...
Databases.

Lou Agosta is the lead industry analyst at Forrester Research, Inc. in data warehousing, data quality and predictive analytics (data mining), and the author of The Essential Guide to Data Warehousing (Prentice Hall PTR, 2000). Please send comments or questions to lagosta@acm.org.

 

 

Solutions Marketplace
Provided by IndustryBrains

Bowne Global Solutions: Language Services
World's largest language services firm offers translation/localization, interpretation, and tech writing. With offices in 24 countries and more than 2,000 staff, we go beyond words with an in depth understanding of your business and target markets

Award-Winning Database Administration Tools
Embarcadero Technologies Offers a Full Suite of Powerful Software Tools for Designing, Optimizing, Securing, Migrating, and Managing Enterprise Databases. Come See Why 97 of the Fortune 100 Depend on Embarcadero!

Online Backup and Recovery for Business Servers
Fully managed online backup and recovery service for business servers. Backs up data to a secure offsite facility, making it immediately available for recovery 24x7x365. 30-day trial.

NEW Glasshouse White Paper from ADIC
Learn to integrate disk into your backup system; evaluate real benefits and costs of different disk backup approaches; choose between disk arrays and virtual tape libraries; and build long-term disaster recovery protection into a disk backup system.

Data Mining: Strategy, Methods & Practice
Learn how experts build and deploy predictive models by attending The Modeling Agency's vendor-neutral courses. Leverage valuable information hidden within your data through predictive analytics. Click through to view upcoming events.

Click here to advertise in this space


View Full Issue View Full Magazine Issue
E-mail This Article E-Mail This Article
Printer Friendly Version Printer-Friendly Version
Related Content Related Content
Request Reprints Request Reprints
Advertisement
advertisement
Site Map Terms of Use Privacy Policy
SourceMedia (c) 2005 DM Review and SourceMedia, Inc. All rights reserved.
Use, duplication, or sale of this service, or data contained herein, is strictly prohibited.