ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
An optimal algorithm for similarity based entity association
Full text PdfPdf (530 KB)
Source ACM Southeast Regional Conference archive
Proceedings of the 33rd annual on Southeast regional conference table of contents
Clemson, South Carolina
SESSION: Database systems I table of contents
Pages: 30 - 36  
Year of Publication: 1995
ISBN:0-89791747-2
Author
Olivier Brissac  IREMIA, Faculté des Sciences, La REUNION. France
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 0,   Downloads (12 Months): 10,   Citation Count: 0
Additional Information:

abstract   references  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1122018.1122024
What is a DOI?

ABSTRACT

Some inductive learning systems enable structural descriptions of examples, based on elementary entities whose value can be either symbolic or numerical. This paper studies such systems, especially the ones that use a similarity measure defined between elementary entities. As it is the case in our system [12], [13], in a learning process, the matching step leads to a generalization of the training set. The main benefit of a similarity based matching step is to enable symbolic as well as numeric values processing. Given a pair of examples, learning algorithms using a similarity measure begin with computing this measure for all entities pairs taken in both examples. When it comes to choosing which entities are to be associated, a greedy method is used: entities are associated by pairs in decreasing similarity order. This paper proposes an alternative to this greedy choice based on a network flow algorithm providing an optimal result, according to the given similarity function. Furthermore, we study a generalization of this approach and we show the general case to be NP-complete. After a discussion on theoretical and practical use of the optimal method, we give some directions for further works.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
J.P Barthelemy and A Cobstein, "Complexité algorithmique et problèmes de communication", Masson/CNET-ENST, 1992.
 
4
C Berge, "Théorie des graphes et ses applications", Dunod, 1963.
 
5
G Bisson, "Une mesure de similarite pour apprender en LPO", Proceedings of RFIA 91, 1991.
 
6
G Bisson, "Learning in FOL with a similarity measure", Proceedings of AAAI, 1992.
 
7
M Botta and A Giordana, "SMART+ : A multistartegy learning tool", proceedings of 13th International Joint Conference on Artificial Intelligence (IJCAI), 1993.
 
8
E Diday, "Knowledge representation and symbolic data analysis", proceedings of 2nd international workshop in data expert knowledge and decision, 1989.
 
9
 
10
J.G Ganascia, "Agape et Charade: Deux techniques d'apprentissage symbolique appliquees a la construction de bases de connaissances", Phd Universite de Paris-Sud, 1987.
 
11
M Gondran and M Minoux, "Graphes et Algorithmes", Eyrolles, 1990.
 
12
M Liquière and J Sallantin, "INNE (INduction in NEtworks) A structural learning algorithm for noisy examples", Proceedings of the 4th European Working Session on Learning (EWSL), Morgan Kaufmann, 1989, pp. 111--123.
 
13
M Liquière, "Graphs and structural similarities", Proceedings of IFCS'93, New approaches in Classification and Data Analysis, Studies in Classification, Data Analysis and Knowledge Acquisition, Springer Verlag, 1993.
 
14
M Minoux, "Programmation mathématique, théorie et algorithmes", Dunod, volume 1, 1983.
 
15
B Roy, "Algebre moderne et theorie des graphes", Dunod, volume 1:2, 1970.
 
16
 
17