MGS
8040 Data Mining
Assignment
2: Business Application and Data
1. Write
a few sentences describing how data-mining might be used in a particular
industry.
Answer the following pertaining to the
industry application you discussed above.
2. Identify
one or more dependent variables (y), at least one of which is categorical.
3. Identify
some independent variables that might be used to predict the categorical y.
4. Create
a spreadsheet with 10 sample records (you can make up the data) that show
reasonable values for any five Xs and the Y. Copy and
paste as a table in the Word document that you will turn in. Clearly describe
what each variable is.
5. What
would be a reasonable outcome period for this model to predict y?
6. What
would be a reasonable sample time frame for the prediction/classification
problem? Give a date range, and explain in a sentence or two what must happen
during that time frame for a record to qualify to be in your sample.