|
ABSTRACT
The primary aim of most data mining algorithms is to facilitate the discovery of concise and interpretable information from large amounts of data. However, many of the current formalizations of data mining algorithms have not quite reached this goal. One of the reasons for this is that the focus on using purely automated techniques has imposed several constraints on data mining algorithms. For example, any data mining problem such as clustering or association rules requires the specification of particular problem formulations, objective functions, and parameters. Such systems fail to take the user's needs into account very effectively. This makes it necessary to keep the user in the loop in a way which is both efficient and interpretable. One unique way of achieving this is by leveraging human visual perceptions on intermediate data mining results. Such a system combines the computational power of a computer and the intuitive abilities of a human to provide solutions which cannot be achieved by either. This paper will discuss a number of recent approaches to several data mining algorithms along these lines.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Charu C. Aggarwal , Joel L. Wolf , Philip S. Yu , Cecilia Procopiuc , Jong Soo Park, Fast algorithms for projected clustering, Proceedings of the 1999 ACM SIGMOD international conference on Management of data, p.61-72, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
|
 |
3
|
|
| |
4
|
C. C. Aggarwal. Towards Exploratory Instance Centered Classification of High Dimensional Data. IBM Research Report, 2002.
|
| |
5
|
C. C. Aggarwal. Towards Meaningful High Dimensional Nearest Neighbor Search by Human-Computer Interaction. ICDE Conference, 2002.
|
| |
6
|
|
 |
7
|
|
 |
8
|
|
 |
9
|
Rakesh Agrawal , Johannes Gehrke , Dimitrios Gunopulos , Prabhakar Raghavan, Automatic subspace clustering of high dimensional data for data mining applications, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.94-105, June 01-04, 1998, Seattle, Washington, United States
|
 |
10
|
Mihael Ankerst , Martin Ester , Hans-Peter Kriegel, Towards an effective cooperation of the user and the computer for classification, Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, p.179-188, August 20-23, 2000, Boston, Massachusetts, United States
[doi> 10.1145/347090.347124]
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
M. Ester, H.-P. Kriegel, J. Sander, M. Wimmer, X. Xu. Density-Connected Sets and their Application for Trend Detection in Spatial databases. Proceedings of the KDD Conference, 1997.
|
| |
15
|
C. Faloutsos , R. Barber , M. Flickner , J. Hafner , W. Niblack , D. Petkovic , W. Equitz, Efficient and effective querying by image content, Journal of Intelligent Information Systems, v.3 n.3-4, p.231-262, July 1994
[doi> 10.1007/BF00962238]
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
 |
19
|
Heike Hofmann , Arno P. J. M. Siebes , Adalbert F. X. Wilhelm, Visualizing association rules with interactive mosaic plots, Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, p.227-235, August 20-23, 2000, Boston, Massachusetts, United States
[doi> 10.1145/347090.347133]
|
| |
20
|
|
 |
21
|
|
| |
22
|
|
| |
23
|
D. A. Keim. Visual Support for Query Specification and Data Mining. Shaker Publishing Company, Aachen, Germany 1995.
|
| |
24
|
|
| |
25
|
|
| |
26
|
Y. Rui, T. S. Huang, S. Mehrotra, Content-based image retrieval with relevance feedback in MARS. Proceedings of the IEEE Conference on Image Processing, 1997.
|
| |
27
|
|
| |
28
|
|
| |
29
|
|
| |
30
|
B. W. Silverman. Density Estimation for Statistics and Data Analysis, Chapman and Hall, 1986.
|
 |
31
|
|
| |
32
|
|
| |
33
|
|
| |
34
|
|
CITED BY 7
|
|
|
|
|
Jessica Lin , Eamonn Keogh , Stefano Lonardi , Jeffrey P. Lankford , Donna M. Nystrom, Visually mining and monitoring massive time series, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|