|
ABSTRACT
Existing data mining algorithms on graphs look for nodes satisfying specific properties, such as specific notions of structural similarity or specific measures of link-based importance. While such analyses for predetermined properties can be effective in well-understood domains, sometimes identifying an appropriate property for analysis can be a challenge, and focusing on a single property may neglect other important aspects of the data. In this paper, we develop a foundation for mining the properties themselves. We present a theoretical framework defining the space of graph properties, a variety of mining queries enabled by the framework, techniques to handle the enormous size of the query space, and an experimental system called F-Miner that demonstrates the utility and feasibility of property mining.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
| |
5
|
Sergey Brin. Extracting patterns and relations from the World Wide Web. In Proceedings of the WebDB Workshop at the 6th International Conference on Extending Database Technology (EDBT), Valencia, Spain, March 1998.
|
| |
6
|
Luc Dehaspe and Hannu TT Toivonen. Discovery of Relational Association Rules, pages 189--212. Springer-Verlag Heidelberg, Germany, 2001. http://citeseer.nj.nec.com/486120.html.
|
| |
7
|
|
| |
8
|
|
 |
9
|
|
| |
10
|
Christoph Helma, Stefan Kramer, and Luc De Raedt. The molecular feature miner MolFea. In Proceedings of the Beilstein-Institut Workshop, Bozen, Italy, May 2002.
|
 |
11
|
|
| |
12
|
|
 |
13
|
|
| |
14
|
Glen Jeh and Jennifer Widom. Mining the space of graph properties. Technical report, Stanford University Database Group, 2003. http://dbpubs.stanford.edu/pub/2003-10.
|
 |
15
|
|
| |
16
|
M. M. Kessler. Bibliographic coupling between scientific papers. American Documentation, 14:10--25, 1963.
|
| |
17
|
|
| |
18
|
Michihiro Kuramochi and George Karypis. An efficient algorithm for discovering frequent subgraphs. Technical report, Department of Computer Science, University of Minnesota, 2002. http://www.cs.umn.edu/~kuram/papers/fsg-long.pdf.
|
| |
19
|
Surnjani Djoko Lawrence B. Holder, Diane J. Cook. Substructure discovery in the SUBDUE system. In Proceedings of the AAAI Workshop on Knowledge Discovery in Databases (KDD), Seattle, Washington, USA, July 1994.
|
| |
20
|
Stephen Muggleton and Luc De Raedt. Inductive logic programming: Theory and methods. Journal of Logic Programming, 19/20:629--679, 1994.
|
| |
21
|
Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. The Page-Rank citation ranking: Bringing order to the Web. Technical report, Stanford University Database Group, 1998. http://citeseer.nj.nec.com/368196.html.
|
| |
22
|
Henry Small. Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the American Society for Information Science, 24:265--269, 1973.
|
| |
23
|
|
| |
24
|
|
| |
25
|
|
 |
26
|
|
|