Portals eNewsletters Web Seminars dataWarehouse.com DM Review Magazine
DM Review | Information Is Your Business
   Information Is Your Business Advanced Search
advertisement

RESOURCE PORTALS
Business Intelligence
Compliance
Corporate Performance Management
Data Management
Data Integration
Data Quality
Data Visualization
Data Warehousing Basics
ETL
Master Data Management
View all Portals

WEB SEMINARS
Scheduled Events

RESEARCH VAULT
White Paper Library
Research Papers

CAREERZONE

Advertisement

INFORMATION CENTER
DM Review Home
Newsletters
Current Magazine Issue
Magazine Archives
Online Columnists
Ask the Experts
Industry News
Search DM Review

GENERAL RESOURCES
TechEvaluationsCenter:
Evaluate IT solutions
Bookstore
Buyer's Guide
Glossary
Industry Events Calendar
Monthly Product Guides
Software Demo Lab
Vendor Listings

DM REVIEW
About Us
Press Releases
Awards
Advertising/Media Kit
Reprints
Magazine Subscriptions
Editorial Calendar
Contact Us
Customer Service

Ask the Experts Question and Answer

Ask the Expert

Meet the Experts
Ask a Question (Names of individuals and companies will not be used.)
Question Archive
Ask the Experts Home

Q:  

Do I need a data warehouse to do data mining? My IT department moves at a snail's pace. By the time my unit sees a data warehouse delivered... anyway, my transactional database is ripe for data mining applications.

A:  

Sid Adelman?s Answer: No, you don?t absolutely need a data warehouse to begin data mining. But you do need clean, accurate data that is well understood and the creation of a data warehouse usually (and should) include cleaning up and documenting the data. Even if you pull the data from a clean data warehouse, you still have additional work to transform the data and get it ready for data mining.

Ross Armstrong?s Answer: You probably go to the restaurant and ask to see the dessert menu first. Get to the good stuff as soon as you can. You don?t need a warehouse first, but there are a number of considerations you should plan for. The process of preparing the data for mining is important. Remember, the mining algorithm analyzes the absolute values without consideration for context. If you have a column that changes valid values or changes meaning over time, then the results of the mining will be skewed unless you remove this data first. If there are spikes or errors in the data, then they have to be removed. Since the mining process might take a few weeks or months, it is necessary to not update the data for that period of time. This may not be practical in a transactional system. In short, you generally don?t need a data warehouse, but you?ll probably need something similar to put the data into to mine it.

(Posted )


ARCHIVE OF QUESTIONS & ANSWERS FOR DATA QUALITY
BACK TO THE LIST OF CATEGORIES



Advertisement
advertisement
Site Map Terms of Use Privacy Policy
SourceMedia (c) 2006 DM Review and SourceMedia, Inc. All rights reserved.
SourceMedia is an Investcorp company.
Use, duplication, or sale of this service, or data contained herein, is strictly prohibited.