Page History: Data Mining (Fall, 2012)
Compare Page Revisions
120
119
118
117
116
115
114
113
112
111
110
109
108
107
106
105
104
103
102
101
100
99
98
97
96
95
94
93
92
91
90
89
88
87
86
85
84
83
82
81
80
79
78
77
76
75
74
73
72
71
70
69
68
67
66
65
64
63
62
61
60
59
58
57
56
55
54
53
52
51
50
49
48
47
46
45
44
43
42
41
40
39
38
37
36
35
34
33
32
31
30
29
28
27
26
25
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
0
Current
120
119
118
117
116
115
114
113
112
111
110
109
108
107
106
105
104
103
102
101
100
99
98
97
96
95
94
93
92
91
90
89
88
87
86
85
84
83
82
81
80
79
78
77
76
75
74
73
72
71
70
69
68
67
66
65
64
63
62
61
60
59
58
57
56
55
54
53
52
51
50
49
48
47
46
45
44
43
42
41
40
39
38
37
36
35
34
33
32
31
30
29
28
27
26
25
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
0
« Older Revision
-
Back to Page History
-
Newer Revision »
Page Revision: 2012/08/17 13:04
Information
Course Number
: 081202B3
To
: M. Sc. students of Department of Computer Science and Technology, Nanjing University.
Number of Students
: 150
Classroom
: 233, Computer Science and Technology Building, Xianlin Campus
Time
: 16:00 -- 17:50, Wednesday
Office Hour
: 14:30 - 15:30, Wednesday (Rm 917)
Text Book
: D. Hand, H. Mannila, P. Smyth. Principles of Data Mining. MIT Press, MA:Cambridge, 2001.
Main Reference Books
:
J. Han, M. Kamber. Data Mining: Concepts and Techniques, 2nd edition. Morgan Kaufmann Publishers, 2006
I. H. Witten, E. Frank. Data Mining: Practical Machine Learning Tools and Techniques, 2nd edition. Morgan Kaufmann Publishers, 2005
P.-N. Tan, M. Steinbach, V. Kumar. Introduction to Data Mining, Addison-Wesley, 2006.
Grading
: Final exam (50%) + assignment 1 (12.5%) + assignment 2 (12.5%) + assignment 3 (12.5%) + assignment 4 (12.5%)
TA
: Mr.
Chao Qian
Assignments
Assignment 1:
A visualization task
Assignment 2:
A classification task
Assignment 3:
A clustering task
Assignment 4:
Mining a real-world data
Lectures
9/12
: Introduction
9/19
: Data quality
9/26
: Data visualization
Links
Weka
An open source (Java) machine learning/data mining algorithms software.
R Project
An open source platform for statistic computing using R script language.
ACM SIGKDD
The website of the ACM Special Interest Group on Knowledge Discovery and Data Mining.
ACM SIGKDD Explorations Newsletters
A magazine of SIGKDD.
KDnuggets
A website for data mining resources.
The end