0% found this document useful (0 votes)
15 views

GRADING

The document outlines a grading rubric for a data mining assignment. It provides points for various tasks related to data cleaning, exploration, and modification. It awards points for appropriately dropping or justifying variables, dummy coding, identifying data types, discussing missing data strategies, binning variables, identifying outliers, descriptive statistics, frequencies, and correlations. It also awards points for technical execution and presentation of these tasks.

Uploaded by

Yo Tu
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views

GRADING

The document outlines a grading rubric for a data mining assignment. It provides points for various tasks related to data cleaning, exploration, and modification. It awards points for appropriately dropping or justifying variables, dummy coding, identifying data types, discussing missing data strategies, binning variables, identifying outliers, descriptive statistics, frequencies, and correlations. It also awards points for technical execution and presentation of these tasks.

Uploaded by

Yo Tu
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Grading Rubric for Week 4, Assignment 2: Data Mining Part 1

Points Points
CODING/CLEANSING
Available Earned
Drop any variables? Appropriate? Justified? 3
Appropriate dummy codes for alphanumeric data
Dummy code key 2
Binary/nominal/ordinal identified correctly 5
Discussion of dummy coding decisions - including,
perhaps, not coding every variable provided 5
Discussion of appropriate missing value strategy 5

Recognize the "cost" of the missing value strategy used 5


Bin at least one variable - discussion of binning -
appropriate variable / bin size 3
Recognize the "cost" of the binning strategy used 2
Identification of outliers and/or potentially incorrect
data 3
Decision for addressing outliers and/or potentially
incorrect data including the "cost" of this decision 2
EXPLORE
Discussion of descriptive statistics on continuous
variables 5

Discussion of frequency statistics on categorical variables 5


Discussion of at least 3 correlations 5
MODIFY

Variable transformation (done correctly, appropriate


variable) in addition to binning and dummy coding

Selection of appropriate transformation and discussion 5


SUBMISSION
Prepare your discussion and labeled summary tables in
one document 5
Professionalism (organization, labeled variables/sheets,
tone, typos, etc.) 5
Excel file posted REQUIRED
SUBTOTAL FOR CONCEPTS/DISCUSSION 65

GRAND TOTAL (Discussion and technical elements) 100


ment 2: Data Mining Part 1 (Real Estate data)

Points Points
TECHNICAL ELEMENTS
Available Earned
Execution of appropriate missing value strategy 3
Execution of dummy coding 3
Technique:
Bin at least one variable

Technique:
Done correctly 3

Presented professionally 1
Appropriate use of pivot table
Done correctly 3

Presented professionally 1
Descriptive statistics on continuous variables
Technique:

Done correctly 5

Presented professionally 2
Frequency statistics on categorical variables
Technique:

Done correctly 5

Presented professionally 2
Correlation table

Done correctly 5

Presented professionally 2
SUBTOTAL FOR TECHNICAL ELEMENTS 35

You might also like