What is the typical rate within data warehousing for promoting changes that effect transformation and data cleansing for existing subject areas that are in production?
What is the typical rate within data warehousing for promoting changes that effect transformation and data cleansing for existing subject areas that are in production? My experience is from the operational side where changes or releases to Production are carefully planned out and done on a quarterly and sometimes monthly basis.
Douglas Hackney's Answer: Just after release, there is commonly a flurry of changes and modifications as data quality and other issues are revealed by use and utilization. Shops with more process rigor and structured processes will usually settle into a bi-monthly or monthly frequency of update to the system.
David Marco's Answer: Making changes to a production data warehouse environment is very similar to that of operational systems. Program errors/bugs are typically changed ASAP as the errors arise. Data warehouse enhancements/modifications occur on a scheduled basis...monthly, quarterly, as needed, etc).
Sid Adelman's Answer: The ETL process must be controlled just like the controls you have for operational systems. Changes to the ETL process come about from changes to the source systems (which shouldn't happen that frequently), identification of data quality problems that require changes in the cleansing process and requests for new data. The latter two requirements come about more frequently than the first and so you need to be more flexible and be able to react more quickly than you would with traditional operational systems.
For more information on related topics visit the following related portals...
Data Acquisition, Replication,
Data Quality and
Douglas Hackney is the president of Enterprise Group Ltd., a consulting and knowledge-transfer company specializing in designing and implementing data warehouses and associated information delivery systems. He can be reached at www.egltd.com.
David Marco is an internationally recognized expert in the fields of enterprise architecture, data warehousing and business intelligence and is the world's foremost authority on meta data. He is the author of Universal Meta Data Models (Wiley, 2004) and Building and Managing the Meta Data Repository: A Full Life-Cycle Guide (Wiley, 2000). Marco has taught at the University of Chicago and DePaul University, and in 2004 he was selected to the prestigious Crain's Chicago Business "Top 40 Under 40." He is the founder and president of Enterprise Warehousing Solutions, Inc., a GSA schedule and Chicago-headquartered strategic partner and systems integrator dedicated to providing companies and large government agencies with best-in-class business intelligence solutions using data warehousing and meta data repository technologies. He may be reached at (866) EWS-1100 or via e-mail at DMarco@EWSolutions.com.
Sid Adelman is a principal in Sid Adelman & Associates, an organization specializing in planning and implementing data warehouses, in data warehouse and BI assessments, and in establishing effective data architectures and strategies. He is a regular speaker at DW conferences. Adelman chairs the "Ask the Experts" column on www.dmreview.com. He is a frequent contributor to journals that focus on data warehousing. He co-authored Data Warehouse Project Management and is the principal author on Impossible Data Warehouse Situations with Solutions from the Experts. His new book, Data Strategy, is scheduled for publication this year. He can be reached at 818-783-9634 or firstname.lastname@example.org. Visit his Web site at www.sidadelman.com.
Provided by IndustryBrains
|Data Validation Tools: FREE Trial|
Protect against fraud, waste and excess marketing costs by cleaning your customer database of inaccurate, incomplete or undeliverable addresses. Add on phone check, name parsing and geo-coding as needed. FREE trial of Data Quality dev tools here.
|Backup SQL Server or Exchange Continuously|
FREE WHITE PAPER. Recover SQL Server, Exchange or NTFS data within minutes with TimeSpring?s continuous data protection (CDP) software. No protection gaps, no scheduling requirements, no backup related slowdowns and no backup windows to manage.
|Manage Data Center from Virtually Anywhere!|
Learn how SecureLinx remote IT management products can quickly and easily give you the ability to securely manage data center equipment (servers, switches, routers, telecom equipment) from anywhere, at any time... even if the network is down.
|Design Databases with ER/Studio: Free Trial|
ER/Studio delivers next-generation data modeling. Multiple, distinct physical models based on a single logical model give you the tools you need to manage complex database environments and critical metadata in an intuitive user interface.
|Free EII Buyer's Guide|
Understand EII - Trends. Tech. Apps. Calculate ROI. Download Now.
|Click here to advertise in this space|