The document discusses using decision trees to build a model on fraud data that classifies individuals with taxable incomes of $30,000 or less as "Risky" and those with higher incomes as "Good". The model would use the following variables: whether the individual is an undergrad, their marital status, taxable income, work experience, and whether they live in an urban area.
Download as TXT, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
219 views
Problem Statement (Fraudcheck)
The document discusses using decision trees to build a model on fraud data that classifies individuals with taxable incomes of $30,000 or less as "Risky" and those with higher incomes as "Good". The model would use the following variables: whether the individual is an undergrad, their marital status, taxable income, work experience, and whether they live in an urban area.
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 1
Use decision trees to prepare a model on fraud data
treating those who have taxable_income <= 30000 as "Risky" and others are "Good"
Data Description :
Undergrad : person is under graduated or not
Marital.Status : marital status of a person Taxable.Income : Taxable income is the amount of how much tax an individual owes to the government Work Experience : Work experience of an individual person Urban : Whether that person belongs to urban area or not