100% found this document useful (1 vote)
268 views

Solution Manual Practical Applications of Data Mining by Sang C. Suh

Solution Manual for Practical Applications of Data Mining – 1st Edition Author(s): Sang C. Suh It includes Solution Manual and Lecturer’s PowerPoint slides of the book. Solution Manual for Data Mining by Suh covers chapters 2 to 8 and there is one Word file for each of chapters which has 222 pages totally. Contact me in order to access the whole complete document. Email: [email protected] WhatsApp: https://wa.me/message/2H3BV2L5TTSUF1 Telegram: https://t.me/solutionmanual

Uploaded by

Shamoon Yahoo
Copyright
© © All Rights Reserved
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
268 views

Solution Manual Practical Applications of Data Mining by Sang C. Suh

Solution Manual for Practical Applications of Data Mining – 1st Edition Author(s): Sang C. Suh It includes Solution Manual and Lecturer’s PowerPoint slides of the book. Solution Manual for Data Mining by Suh covers chapters 2 to 8 and there is one Word file for each of chapters which has 222 pages totally. Contact me in order to access the whole complete document. Email: [email protected] WhatsApp: https://wa.me/message/2H3BV2L5TTSUF1 Telegram: https://t.me/solutionmanual

Uploaded by

Shamoon Yahoo
Copyright
© © All Rights Reserved
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 11

Contact me in order to access the whole complete document. //Email: solution9159@gmail.

com //
WhatsApp: https://wa.me/message/2H3BV2L5TTSUF1 // Telegram: https://t.me/solutionmanual

2.1) Use the database table in fig 2.28, which contains ten market transactions. Use the
Aprior algorithm to complete the association rule generation process. Experiment
with different values of support and confidence to observe how they control the
number of association rules generated.
Answer:

TID Items
1 1,3,4,5,6,7,9,10
2 2,4,5,7,8,9,10
3 1,2,3,4,10
4 2,4,5,6,7,10
5 1,3,4,5,7,8
6 3,5,7,8,9,10
7 1,3,4,8,9,10
8 2,5,6,7,8,9
9 1,3,4,5,6,7,10
10 1,2,3,4,5,7,8,9,10

C1 and L1

Item Support
1 6
2 5
3 7
4 8
5 8
6 4
7 8
8 6
9 6
10 8

Item Support
3 7
4 8
5 8
7 8
10 8

© 2012 Jones and Bartlett Learning, LLC


Contact me in order to access the whole complete document. //Email: [email protected] //
WhatsApp: https://wa.me/message/2H3BV2L5TTSUF1 // Telegram: https://t.me/solutionmanual

Consider the items whose support factor is greater than 60%.Items are 3, 4, 5, 7, and 10.
Item Sets Support
{3,4} 6
{3,5} 5
{3,7} 5
{3,10} 6
{4,5} 6
{4,7} 6
{4,10} 7
{5,7} 8
{5,10} 6
{7,10} 6

Considering the item sets whose support factor is greater than 60%.Items are 4, 5,7,10
L2

Item Sets Support


{4,10} 7
{5,7} 8

After deriving the rules the confidence level in


Item Support Confidence
{4,10} 7 87.5%
{5,7} 8 100%
{7,5} 8 100%
{10,4} 7 87.5%

2.2) Consider the weighted directed graph shown in fig 2.29. Generated all possible
composite association rules meeting the support values of 1.0, 1.5, and 2.0.
Answer:

Support Factor X / Y where X = / (n-1)

Y= / |E|
For example in the given graph support factor for AB = (3 / 1) / (34 / 11) = 3 / 3.09 = 0.9

© 2012 Jones and Bartlett Learning, LLC


Contact me in order to access the whole complete document. //Email: [email protected] //
WhatsApp: https://wa.me/message/2H3BV2L5TTSUF1 // Telegram: https://t.me/solutionmanual

Path S.F
AB 3/3.09
AE 5/3.09
BA 2/3.09
BC 4/3.09 Path S.F
BE 2/3.09 ABCD 3/3.09
ABED 3/3.09
CD 2/3.09
ABEC 3.3/3.09
DE 3/3.09
AEDC 4/3.09
DC 3/3.09
AECD 4/3.09
ED 4/3.09
BAED 3.6/3.09
EA 1/3.09
BAEC 4.3/3.09
EC 5/3.09
BCDE 3/3.09
ABC 3.5/3.09
BEDC 3/3.09
ABE 2.5/3.09
BECD 3/3.09
AED 4.5/3.09
CDEA 2/3.09
AEC 5/3.09
DEAB 2.3/3.09
BAE 3.5/3.09 EABC 2.6/3.09
BCD 3/3.09 ABCDE 3/3.09
BEA 1.5/3.09 ABEDC 2.75/3.09
BED 3/3.09 ABECD 3/3.09
BEC 3.5/3.09 BAEDC 3.5/3.09
CDE 2.5/3.09 BAECD 3.5/3.09
DEA 2 BCDEA 2.5/3.09
DEC 4 CDEAB 2.25/3.09
EDC 3.5 DEABC 2.75/3.09
EAB 2 EABCD 2.5/3.09
ECD 3.5

The paths having support factor as >=1.0 are


BE , ED , ABC , AED , BAE , BEC , DEC , EDC , ECD , ABEC , AEDC , AECD , BAED ,
BAEC , BAEDC , BAECD
The paths having support factor as >=1.5 are
AE, EC, AEC

© 2012 Jones and Bartlett Learning, LLC


Contact me in order to access the whole complete document. //Email: [email protected] //
WhatsApp: https://wa.me/message/2H3BV2L5TTSUF1 // Telegram: https://t.me/solutionmanual

There are no paths having support factor >= 2.0

2.3 ) The database table shown in Figure 2.30 contains a dataset with the following five attributes:
color(Yellow,purple),size(large,small),act(stretch,dip),age(adult,child),and inflated(T,F).Follow
the method used in section2.6 for generating compact rules to derive all rules with full confidence.

Answer:

Color Size Act Age Inflated


Yellow Small Stretch Adult T
Yellow Small Stretch Child T
Yellow Small Dip Adult T
Yellow Small Dip Child T
Yellow Large Stretch Adult T
Yellow Large Stretch Child F
Yellow Large Dip Adult F
Yellow Large Dip Child F
Purple Small Stretch Adult T
Purple Small Stretch Child F
Purple Small Dip Adult F
Purple Small Dip Child F
Purple Large Stretch Adult T
Purple Large Stretch Child F
Purple Large Dip Adult F
Purple Large Dip Child F

Color Count
Yellow 8
Purple 8

Size Count
Small 8
Large 8

Rule : If( Color=yellow) and(Size=Small) and(Act=Stretch) then Inflated is T.


If( Color=yellow) and(Size=Large) and(Act=Dip) then Inflated is F.
If ( Color=Purple) and(Size=Large) and(Act=Dip) then Inflated is F.

Two attribute Subtables for color,size and act:


Size Small(count) Large(count)
Color
Yellow 4 4

© 2012 Jones and Bartlett Learning, LLC


Contact me in order to access the whole complete document. //Email: [email protected] //
WhatsApp: https://wa.me/message/2H3BV2L5TTSUF1 // Telegram: https://t.me/solutionmanual

Purple 4 4

Act Stretch(count) Dip(count)


Color
Yellow 4 4
Purple 4 4

Act Stretch(count) Dip(count)


Size
Small 4 4
Large 4 4

SAD’s from color, size, and Act to Inflated:


Inflated T F
Color
Yellow 5/8 3/8
Purple 2/8 6/8

Inflated T F
Size
Small 5/8 3/8
Large 2/8 6/8

Inflated T F
Act
Stretch 5/8 3/8
Dip 2/8 6/8

SADs from two condition events to Inflated:


Inflated
Color Size T F
Yellow Small 1 0
Yellow Large 1/4 3/4
Purple Small 1/4 3/4
Purple Large 1/4 3/4

© 2012 Jones and Bartlett Learning, LLC


Contact me in order to access the whole complete document. //Email: [email protected] //
WhatsApp: https://wa.me/message/2H3BV2L5TTSUF1 // Telegram: https://t.me/solutionmanual

Inflated
Color Act T F
Yellow Stretch 3/4 1/4
Yellow Dip 1/2 1/2
Purple Stretch 1/2 1/2
Purple Dip 0 1

Inflated
Act Size T F
Stretch Small 3/4 1/4
Stretch Large 1/2 1/2
Dip Small 1/2 1/2
Dip Large 1 0

Rule 1: If (Color=Yellow) and (Size=Small) THEN (Inflated=T)

Rule 2: If (Color=Purple) and (Act=Dip) THEN (Inflated=F)

Rule 3: If (Act=Dip) and (Size=Large) THEN (Inflated=T)

SADs with three condition events:


Color Size Act Inflated
T F
Yellow Large Stretch 1/2 ½
Purple Small Stretch 1/2 ½
Purple Large Stretch 1/2 ½

2.4) The major-salary table in Fig 2.31 shows 30 instances of IT professionals in terms of four
attributes: age, education, major and salary. Construct conceptual hierarchies for age, education,
major and salary. Generate interesting learning tasks for deriving discrimination and characteristic
rules from the table. Use the AO induction method to derive either discrimination or characteristic
rules based on the learning tasks generated.

Answer:

The Conceptual Hierarchies for Age, Edu, Salary, and Major are

{28,29,31,32,34,35,36,37,38,39,41,42,45,49,52,54,55,56} is subset for Age

{EE, CS, MIS, DS} is subset for Major

{Ph.D , B.S, M.S} is subset for Edu

© 2012 Jones and Bartlett Learning, LLC


Contact me in order to access the whole complete document. //Email: [email protected] //
WhatsApp: https://wa.me/message/2H3BV2L5TTSUF1 // Telegram: https://t.me/solutionmanual

{41k,43k,44k,45k,46k,55k,56k,58k,60k,62k,64k,65k,67k,68k,69k,70k,72k,73k,75k,77k} is subset for


Salary

Let us assume that

Age (28-to-48) is Middle Age (M)

Age (49-to-56) is Old Age (O)

Major (EE) is Electrical

Major (CS, MIS, DS) is Science

Salary: (40k-50k) is M1

(51k-60k) is M2

(61k-70k) is M3

(71k-80k) is M4

(81k-90k) is M5

By using Attribute Removal Concept we are removing “Edu” field. After removing “Edu” field, table view
is as follows

Age Major Salary Vote


M M3 2
M M2 2
M Electrical M1 2
O M3 1
M M1 2
M M2 1
M M4 4
M M3 1
O M3 1
M M3 3
O M3 2
O M4 1
M M3 1
Science
M M1 2
M M1 1
M M3 2
M M4 1
M M2 1

© 2012 Jones and Bartlett Learning, LLC


Contact me in order to access the whole complete document. //Email: [email protected] //
WhatsApp: https://wa.me/message/2H3BV2L5TTSUF1 // Telegram: https://t.me/solutionmanual

Age Major Salary Vote


M3 2
M2 2
M Electrical
M1 4
O M3 1
M1 3
M2 2
M3 7
M Science
M4 5
M3 3
O M4 1

ID Age Spectacle Astigmatic TearProduction Contact lens


20-39 50-59 Myope Hypermetrope Yes No Reduced Normal Soft Hard None
1 1 0 1 0 0 1 1 0 0 0 1
2 1 0 1 0 0 1 0 1 1 0 0
3 1 0 1 0 1 0 1 0 0 0 1
4 1 0 1 0 1 0 0 1 0 1 0
5 1 0 0 1 0 1 1 0 0 0 1
6 1 0 0 1 0 1 0 1 1 0 0
7 1 0 0 1 1 0 1 0 0 0 1
8 1 0 0 1 1 0 0 1 0 1 0
9 1 0 1 0 0 1 1 0 0 0 1
10 1 0 1 0 0 1 0 1 1 0 0
11 1 0 1 0 1 0 1 0 0 0 1
12 1 0 1 0 1 0 0 1 0 1 0
13 1 0 0 1 0 1 1 0 0 0 1
14 1 0 0 1 0 1 0 1 1 0 0
15 1 0 0 1 1 0 1 0 0 0 1
16 1 0 0 1 1 0 0 1 0 0 1
17 0 1 1 0 0 1 1 0 0 0 1
18 0 1 1 0 0 1 0 1 0 0 1
19 0 1 1 0 1 0 1 0 0 0 1
20 0 1 1 0 1 0 0 1 0 1 0
21 0 1 0 1 0 1 1 0 0 0 1
22 0 1 0 1 0 1 0 1 1 0 0
23 0 1 0 1 1 0 1 0 0 0 1
24 0 1 0 1 1 0 0 1 0 0 1

© 2012 Jones and Bartlett Learning, LLC


Contact me in order to access the whole complete document. //Email: [email protected] //
WhatsApp: https://wa.me/message/2H3BV2L5TTSUF1 // Telegram: https://t.me/solutionmanual

Fig 2.32 shows a contact lenses table that contains information about contact lenses prescriptions.
From table, derive quantitative association rules by mapping tables to Boolean association rules?

Rules Support Confidence


<Age20.39>&&<Spectacle:Hypermetrope> 16% 50%
<Astigmatic:No>
<Age50.59>&&<Spectacle:myope> 12% 75%
<Contact:None>

2.5 Figure shows a contact lens table tht contains information about contact lens
prescriptions(hard , soft and no contact lens)From the table derive quantitative association rules by
mapping tables to Boolean association rules.

Contact
ID Age Spectacle Astigmatic Tear Production lens
1 21 Myope No Reduced None
2 24 Myope No Normal Soft
3 20 Myope Yes Reduced None
4 26 Myope Yes Normal Hard
5 27 Hypermyope No Reduced None
6 22 Hypermyope No Normal Soft
7 28 Hypermyope Yes Reduced None
8 27 Hypermyope Yes Normal Hard
9 38 Myope No Reduced None
10 32 Myope No Normal Soft
11 36 Myope Yes Reduced None
12 37 Myope Yes Normal Hard
13 33 Hypermyope No Reduced None
14 32 Hypermyope No Normal Soft
15 39 Hypermyope Yes Reduced None
16 34 Hypermyope Yes Normal None
17 52 Myope No Reduced None
18 51 Myope No Normal None
19 50 Myope Yes Reduced None
20 54 Myope Yes Normal Hard
21 52 Hypermyope No Reduced None
22 55 Hypermyope No Normal Soft
23 58 Hypermyope Yes Reduced None
24 54 Hypermyope Yes Normal None

© 2012 Jones and Bartlett Learning, LLC


Contact me in order to access the whole complete document. //Email: [email protected] //
WhatsApp: https://wa.me/message/2H3BV2L5TTSUF1 // Telegram: https://t.me/solutionmanual

Answer:

Rearranging the tables using Boolean entrees


Dividing age column into 3 categories: 20..29, 30..39,50..59
Abbreviations used in table:
Spec: Spectacle,
Astig: Astigmatism,

Tear prd: tear production.


Spec: Spec: Astig Astig Tear Tear Lens Lens
Age Age Age Myop HypMy : : Prd: Prd: : : lens:
ID <20…29> <30…39> <50…59> e o No Yes Norm Red None Soft Hard
1 1 0 0 1 0 1 0 0 1 1 0 0
2 1 0 0 1 0 1 0 1 0 0 1 0
3 1 0 0 1 0 0 1 0 1 1 0 0
4 1 0 0 1 0 0 1 1 0 0 0 1
5 1 0 0 0 1 1 0 0 1 1 0 0
6 1 0 0 0 1 1 0 1 0 0 1 0
7 1 0 0 0 1 0 1 0 1 1 0 0
8 1 0 0 0 1 0 1 1 0 0 0 1
9 0 1 0 1 0 1 0 0 1 1 0 0
10 0 1 0 1 0 1 0 1 0 0 1 0
11 0 1 0 1 0 0 1 0 1 1 0 0
12 0 1 0 1 0 0 1 1 0 0 0 1
13 0 1 0 0 1 1 0 0 1 1 0 0
14 0 1 0 0 1 1 0 1 0 0 1 0
15 0 1 0 0 1 0 1 0 1 1 0 0
16 0 1 0 0 1 0 1 1 0 1 0 0
17 0 0 1 1 0 1 0 0 1 1 0 0
18 0 0 1 1 0 1 0 1 0 1 0 0
19 0 0 1 1 0 0 1 0 1 1 0 0
20 0 0 1 1 0 0 1 1 0 0 0 1
21 0 0 1 0 1 1 0 0 1 1 0 0
22 0 0 1 0 1 1 0 1 0 0 1 0
23 0 0 1 0 1 0 1 0 1 1 0 0
24 0 0 1 0 1 0 1 1 0 1 0 0
1. (Tear Production: reduced)=>(Contact lens: None)
Support: 50%, Confidence : 100%
2. (Tear Production: reduced)^(Astigmatism: yes )=>(Contact lens: None)
Support: 25%, Confidence : 100%
3. (Tear Production: reduced)^(Astigmatism: yes )^(Spectacle: Myopic)=>(Contact lens: None)
Support: 12.5%, Confidence : 100%
4. (Astigmatism: No)^(Tear Production: Reduced)^(Spectacle:Myopic)=>(Contact lens: None)
Support: 12.5%, Confidence: 100%
5. (Astigmatism: No)^(Tear Production: Yes)^(Spectacle: Hyper myopic)=>(Contact lens: Soft)
Support: 12.5%, Confidence: 60%.
6. (Spectacle :Myopic )^(Astigmatism: No)^(Tear Production: Reduced)=>(Contact lens: None)

© 2012 Jones and Bartlett Learning, LLC


Contact me in order to access the whole complete document. //Email: [email protected] //
WhatsApp: https://wa.me/message/2H3BV2L5TTSUF1 // Telegram: https://t.me/solutionmanual

Support: 12.5%, Confidence : 100%


7. (Tear production: Normal)^(Astigmatism: Yes)=>(Contact lens : Hard)
Support: 16.7%, Confidence: 66.7%
8. (Age: <20-29>)^(Spectacle: Hyper myopic )^(Astigmatism :No)^(Tear production:
Reduced) => (Contact Lens: None)
Support: 8.3%, Confidence: 100%
9. (Age: <30-39>)^(Astigmatism: no)^(Tear Production: Normal)=>(Contact lens: Soft)
Support: 8.3% Confidence: 60%
10. (Age: <30-39>)^(Astigmatism: no)^(Tear Production: Normal)^(Spectacle:
Hyper myopic) => (Contact lens: Soft)
Support: 12.5% Confidence: 60%

© 2012 Jones and Bartlett Learning, LLC

You might also like