-
Marketplace
-
Channel Resources
Articles from this Site
Technologies to Take Your Enterprise-Wide BI and DW Program to the Next Level
Data Integration Will Break Out of the Silo: BI and DW Trends, 8 for '08
U.S. Airforce Selects Teksouth
The Song Remains the Same - Data Shadows Systems Continue to be Pervasive for Reporting and Analytics: BI and DW Trends, 8 for '08
Data Warehousing Meets Data Archiving in Information Lifecycle Management
White Papers
Databasing in the 90s: Data and What We're Doing with It!
Spend Data Warehouse on Steroids
Debugging PL/SQL with AgileInfoSoftware OraDebug
Tune Oracle SQL Performance with AgileInfoSoftware
Data Warehouses: What are they and how will they benefit your organization?
Books
One of our customers outsourced their data warehouse. Now they want to bring it back in house. What all factors should be considered and what are the risks?
|
Q: |
One of our customers outsourced their data warehouse. Now they want to bring it back in house. What all factors should be considered and what are the risks? |
|
A: |
Sid Adelman's Answer: The biggest risk seems to be the lack of intelligent decision making capability of your customer's management to have misunderstood the issues and underestimated the problems of outsourcing before they signed the contract. My guess is they will be making comparably inappropriate decisions as they insource the system. Having said that some other risks are:
Outsourcing is either the result of the bean counters being powerful enough to force through the decision or the result of an ineffectual IT organization which had been unable to support the business. The question is what has changed? Danette McGilvray's Answer: I would first find why they want to bring the data warehouse back in house. What are all the reasons for this decision? I suspect one of the reasons is that the business has concerns with the quality of the information coming out of the warehouse. I will focus my comments on information quality since it is often overlooked completely or postponed "until the transition is complete." The quality questions were probably not adequately addressed during the initial development and implementation of the warehouse. A common - yet ineffective approach - is to say "Let's just get the warehouse built. We will take care of the information quality after we are in production." What is left is a warehouse that was loaded with poor quality data. To further add to the problem, ongoing processes that load, maintain, and use the data were not developed with data quality in mind. The result is knowledge workers who will not use the warehouse because they do not trust the data. Along with the common attitude just expressed of addressing quality after implementation is another perspective that includes the statement, "Yes, we have a data quality team. It is working in parallel with the project." This statement indicates those responsible for data quality are actually separate from the warehouse project. Don't repeat that mistake. Any project team formed to bring the data warehouse back in house should include those who bring the data quality point of view to the project. The data quality issues and tasks must be integrated into the overall project tasks and timeline. What do I mean by the data quality point of view? Look at the life cycle of the information coming into the warehouse - planning for the data (this is where the project team comes in); obtaining the data (from source systems); applying the data (by knowledge workers, for example); maintaining or updating the data; and disposing (archiving or deleting) the data. Use the acronym POAMD to help you remember the information life cycle. This is similar to the four essential database operations: CRUD (Create-Read-Update-Delete). However, the information life cycle approach also takes the business needs, processes and people viewpoints into account along with the technology aspect. For example, in the obtain phase data comes into the warehouse from a source system. Do you have processes for validating the data? What happens if the data does not pass the validation tests? Do you have processes and responsibilities in place to handle content problems? Are there people on both the warehouse and source system sides who are responsible for communicating with each other and resolving the issues? For the maintain phase, what if the source system finds out several weeks after a load that an incorrect file was sent? Who takes care of the issue? Can you catch the problem at load time? Do you have metrics in place to monitor key information? These questions provide only an introduction into what needs to be considered. To touch on the question of risk - if data quality is one of the concerns, I strongly recommend completing some assessments to determine the extent of the quality problem. The results will help you estimate the time and cost needed to make the warehouse acceptable and useful to the business. The assessments could include such activities as data profiling, user interviews, process analysis, etc. The assessments you choose depend on the nature of the data quality concerns. The risk if you do not do this? You will not factor in some of the most important activities needed to ensure the data warehouse is a trusted source for the business. Yes, there is a cost to these activities. But a bigger cost is the business not using the warehouse because they have no confidence in the information. Clay Rehm's Answer: Since the data warehouse has already been developed, it will most likely be stored in a specific RDBMS. Is that RDBMS supported in your organization? If not, how easy will it be to perform a data conversion to your database? The warehouse is being populated through ETL processes and the business rules should be described there. Will you be able to access and modify those rules? Does your organization use the same ETL tools that the outsourcer used? The warehouse should have had data stewards or data owners. Who will those owners now be? Do they understand their roles in data quality, security, access, etc.? |
Sid Adelman is a principal in Sid Adelman & Associates, an organization specializing in planning and implementing data warehouses, in data warehouse and BI assessments, and in establishing effective data architectures and strategies. He is a regular speaker at DW conferences. Adelman chairs the "Ask the Experts" column on www.dmreview.com. He is a frequent contributor to journals that focus on data warehousing. He co-authored Data Warehouse Project Management and is the principal author on Impossible Data Warehouse Situations with Solutions from the Experts and Data Strategy. He can be reached at (818) 783-9634 or visit his Web site at www.sidadelman.com.
Clay Rehm, CCP, PMP, is president of Rehm Technology (www.rehmtech.com), a consulting firm specializing in data integration solutions. Rehm provides hands-on expertise in project management, assessments, methodologies, data modeling, database design, metadata and systems analysis, design and development. He has worked in multiple platforms and his experience spans operational and data warehouse environments. Rehm is a technical book editor and is a co-author of the book, Impossible Data Warehouse Situations with Solutions from the Experts. In addition, he is a Certified Computing Professional (CCP), a certified Project Management Professional (PMP), holds a Bachelors of Science degree in Computer Science and a Masters Degree in Software Engineering from Carroll College. He can be reached at clay.rehm@rehmtech.com.
Danette McGilvray is president and principal of Granite Falls Consulting, Inc., a firm specializing in information quality management to support key business processes around customer satisfaction, decision support and operational excellence. Projects include enterprise data integration programs, data warehousing strategies and best practices for large-scale ERP data migrations for Fortune 50 organizations. For more than ten years she led information quality initiatives at Hewlett-Packard and Agilent Technologies. An accomplished program manager and facilitator, she is an internationally respected expert on data profiling, metrics, quality, audits, benchmarking, and tool acquisition and implementation. McGilvray is an invited speaker at conferences throughout the U.S. and Europe, where she trains other industry experts in enterprise information management and data stewardship. You can reach her at danette@gfalls.com.
For more information on related topics, visit the following channels:


