MPCS 53017: Final Project Report
Due by Wednesday, March 19, 2014
Final Report
In the final report, you will put together all the documents you
created throughout the quarter and show some sample data from you data
warehouse.
Your submission should include:
- Revised description of your data warehouse (2-3 paragraphs).
Please, note any important changes you made after going through the
design and development process.
- The current star schema(s) of your data warehouse.
- Up to 5 Source-to-Target Data Maps and their SQL implementation.
Please, note any other languages used (python, shell scripts, java,
etc.)
- Six questions about the data in your data warehouse expressed in
English including at least 2 reporting-type questions. A simple
example of a reporting-type question is "compare the two consecutive
monthly sales in terms of total dollars for two product categories".
- For each of the 6 questions, show the SQL query or the
sequence of SQL queries that gathers that data necessary to answer the
question. For some questions, you may find it easier to use other
software (e.g. statistical tools such as R, Matlab and Stata) to
implement part of the sequence. In such cases, just state what is
the input and output for such tools without providing the actual code.
- Show actual data warehouse results for at least one reporting and
one other query. Your results should include the query and an excerpt
of the resulting relation (up to 20 rows).
You will submit your proposal via phoenixforge. Please, add all of
the relevant files in the reports directory and commit them.
Instructions on how to use phoenixforge are available here
You should also prepare a team presentation (15-30 minutes) for
our class on March 17. The presentation can include a live demo
of querying your data warehouse, discussion of your design process,
any interesting problems you had to solve in order to implement the
ETL process. You can also discuss how your initial ideas (questions,
data sources, approach) changed throughout the quarter.