MPCS 53017: Draft proposal
Due on Monday, February 3, 2014
Draft proposal
First you will need to form a team with one or two more classmates.
If you would like to work on the project on your own, please, discuss
it with the instructor. You can organize your team and distribute the
work among team members in any way you like, but, please make sure
that everyone understands (though not necessarily implements) all
aspects of the project.
Your draft proposal should discuss the following points:
- An overview of the proposed project.
- What are the key question(s) that you would like to answer by
building the proposed data warehouse? Please, be as specific as
possible, and list up to 10 questions. For example, analyzing crime
in Chicago is too general while exploring the relationship between
weather patterns such as precipitation, temperature, wind, etc and
crime is right on point.
- What are the data sources that you would like to use? Please, be
as exhaustive as possible bur prioritize the data sources since you
will likely end up using only a few of your top choices. Please,
include specific datasets such as
City
of Chicago: Crimes - 2001 to present.
- For each data source, list some details such as whether it is
available via API or as a flat file, size in terms of number of tuples
and disk volume, and any limitations. For example, the crime dataset
contains 5M+ records, is about 1.7GB and can be downloaded as csv file
and also available via API.
Please, avoid any data sources that require screen-scraping
(extracting the data from the html pages of a web site) and only list
data sources that are publicly available.
You will submit your proposal by emailing your first draft as a PDF or
text document to the instructor. Please, use the following title for
your submission mpcsdw draft proposal.