This article relies too much on references to primary sources. (December 2010) (Learn how and when to remove this template message) |
The Kimball Lifecycle is a methodology for developing data warehouses, and has been developed by Ralph Kimball and a variety of colleagues.
According to Kimball et al., this phase is the start of the lifecycle. It is a planning phase in which project is a single iteration of the lifecycle while program is the broader coordination of resources. When launching a project/program Kimball et al. suggests following three focus areas:
This is an ongoing discipline in the project. The purpose is to keep the project/program on course, develop a communication plan and manage expectations.
This phase/milestone of the project is about making the project team understand the business requirements. Its purpose is to establish a foundation for all the following activities in the lifecycle. Kimball et al. makes it clear that it is important for the project team to talk with the business users and be prepared to focus on listening and to document the interview. An output of this step is the Enterprise bus matrix.
The top track holds two milestones:
Dimensional Modeling is a process in which the business requirements are used to design dimensional models for the system.
Physical Design is the phase where the database is designed. It involves the database environment as well as security.
ETL Design & Development is the design of some of the heavy procedures in the DW/BI-system (Extract, Transform, Load). Kimball et al. suggests four parts to this process, which are further divided into 34 subsystems (Kimball et al., 2008):
BI Application Design deals with designing and selecting some applications to support the business requirements. BI Application Development use the design to develop and validate applications to support the business requirements.
When the three tracks are complete they all end up in the final deployment. This phase requires planning and should include pre-deployment testing, documentation, training and maintenance and support.
When the deployment has finished the system will need proper maintenance to stay alive. This includes data reconciliation, execution and monitoring and performance tuning.
As the project can be seen as part of the larger iterative program, it is likely that the system will want to expand. There will be projects to add new data as well as reaching new segments of the business areas. The lifecycle then starts over again.
Kimball, R., Ross, M., Thornthwaite, W., Mundy, J., & Becker, B. (2008). The data warehouse lifecycle toolkit (2nd ed.). Wiley Publishing, Inc. ISBN:978-0-470-14977-5