ACM Home Page

Searching within The Guide for: Keywords:"entity resolution"  (start a new search)

Found 28 of 1,394,228

REFINE YOUR SEARCH

ADVANCED SEARCH
Advanced search.  Advanced Search

FEEDBACK
Please provide us with feedback. Please provide us with feedback

Found 28 of 1,394,228

Results 1 - 20 of 28
Sort by in
Result page: 1   2    next    >>
1
A strategy for allowing meaningful and comparable scores in approximate matching
December 2009
Information Systems , Volume 34 Issue 8
Publisher: Elsevier Science Ltd.
Additional Information:full citation, abstract, references
Bibliometrics:  Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Downloads (Overall): n/a,    Citation Count: 0

Approximate data matching aims at assessing whether two distinct instances of data represent the same real-world object. The comparison between data values is usually done by applying a similarity function which returns a similarity score. If this score ...


Keywords: Data cleaning, Data integration, Deduplication, Entity resolution, Similarity querying
2
Scaling up duplicate detection in graph data
October 2008
CIKM '08: Proceeding of the 17th ACM conference on Information and knowledge management
Publisher: ACM Request Permissions Request Permissions   
Full text available: PdfPdf (296.51 KB)
Additional Information:full citation, abstract, references, index terms
Bibliometrics:  Downloads (6 Weeks): 11,   Downloads (12 Months): 109,   Downloads (Overall): 109,    Citation Count: 0

Duplicate detection determines different representations of real-world objects in a database. Recent research has considered the use of relationships among object representations to improve duplicate detection. In the general case where relationships ...


Keywords: data quality, duplicate detection, entity resolution, scalability
3
Automatic record linkage using seeded nearest neighbour and support vector machine classification
August 2008
KDD '08: Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Publisher: ACM Request Permissions Request Permissions   
Full text available: PdfPdf (415.92 KB)
Additional Information:full citation, abstract, references, index terms
Bibliometrics:  Downloads (6 Weeks): 24,   Downloads (12 Months): 311,   Downloads (Overall): 407,    Citation Count: 1

The task of linking databases is an important step in an increasing number of data mining projects, because linked data can contain information that is not available otherwise, or that would require time-consuming and expensive collection of specific ...


Keywords: data linkage, data matching, deduplication, entity resolution, nearest neighbour, support vector machine
4
Pair-Wise entity resolution: overview and challenges
November 2006
CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management
Publisher: ACM Request Permissions Request Permissions   
Full text available: PdfPdf (91.71 KB)
Additional Information:full citation, abstract, index terms
Bibliometrics:  Downloads (6 Weeks): 8,   Downloads (12 Months): 72,   Downloads (Overall): 268,    Citation Count: 1

Information integration is one of the oldest and most important computer science problems: Information from diverse sources must be combined, so that users can access and manipulate the information in a unified way. One of the central problems in information ...


Keywords: data cleaning, entity resolution
5
Qualitative effects of knowledge rules and user feedback in probabilistic data integration
Maurice Keulen, Ander Keijzer
October 2009
The VLDB Journal — The International Journal on Very Large Data Bases , Volume 18 Issue 5
Publisher: Springer-Verlag New York, Inc.
Full text available: PdfPdf (1.83 MB)
Additional Information:full citation, abstract
Bibliometrics:  Downloads (6 Weeks): 3,   Downloads (12 Months): 3,   Downloads (Overall): 3,    Citation Count: 0

In data integration efforts, portal development in particular, much development time is devoted to entity resolution. Often advanced similarity measurement techniques are used to remove semantic duplicates or solve other semantic conflicts. It proves ...


Keywords: Data integration, Data quality, Entity resolution, Uncertain databases, User feedback
6
A Latent Topic Model for Complete Entity Resolution
March 2009
ICDE '09: Proceedings of the 2009 IEEE International Conference on Data Engineering
Publisher: IEEE Computer Society
Full text available: Publisher SitePublisher Site
Additional Information:full citation, abstract
Bibliometrics:  Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Downloads (Overall): n/a,    Citation Count: 0

In bibliographies like DBLP and Citeseer, there are three kinds of entity-name problems that need to be solved. First, multiple entities share one name, which is called the name sharing problem. Second, one entity has different names, which is called ...


Keywords: Entity resolution, name disambiguation, topic model, LDA
7
A cluster-based approach to XML similarity joins
Leonardo A. Ribeiro, Theo Härder, Fernanda S. Pimenta
September 2009
IDEAS '09: Proceedings of the 2009 International Database Engineering & Applications Symposium
Publisher: ACM
Full text available: PdfPdf (900.97 KB)
Additional Information:full citation, abstract, references, index terms
Bibliometrics:  Downloads (6 Weeks): 9,   Downloads (12 Months): 9,   Downloads (Overall): 9,    Citation Count: 0

A natural consequence of the widespread adoption of XML as standard for information representation and exchange is the redundant storage of large amounts of persistent XML documents. Compared to relational data tables, data represented in XML format ...


Keywords: XML, clustering, entity resolution, similarity joins, similarity measures, xml databases
8
Exploiting context analysis for combining multiple entity resolution systems
June 2009
SIGMOD '09: Proceedings of the 35th SIGMOD international conference on Management of data
Publisher: ACM Request Permissions Request Permissions   
Full text available: PdfPdf (468.33 KB)
Additional Information:full citation, abstract, references, index terms
Bibliometrics:  Downloads (6 Weeks): 49,   Downloads (12 Months): 200,   Downloads (Overall): 200,    Citation Count: 0

Entity Resolution (ER) is an important real world problem that has attracted significant research interest over the past few years. It deals with determining which object descriptions co-refer in a dataset. Due to its practical significance for data ...


Keywords: context analysis, entity resolution, er ensemble
9
Domain-independent data cleaning via analysis of entity-relationship graph
June 2006
Transactions on Database Systems (TODS) , Volume 31 Issue 2
Publisher: ACM Request Permissions Request Permissions   
Full text available: PdfPdf (1.27 MB)
Additional Information:full citation, appendices and supplements, abstract, references, index terms
Bibliometrics:  Downloads (6 Weeks): 13,   Downloads (12 Months): 192,   Downloads (Overall): 1056,    Citation Count: 11

In this article, we address the problem of reference disambiguation. Specifically, we consider a situation where entities in the database are referred to using descriptions (e.g., a set of instantiated attributes). The objective of reference disambiguation ...


Keywords: Connection strength, RelDC, data cleaning, entity resolution, graph analysis, reference disambiguation, relationship analysis
10
Query-time entity resolution
August 2006
KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Publisher: ACM Request Permissions Request Permissions   
Full text available: PdfPdf (751.81 KB)
Additional Information:full citation, abstract, references, cited by, index terms
Bibliometrics:  Downloads (6 Weeks): 15,   Downloads (12 Months): 70,   Downloads (Overall): 336,    Citation Count: 6

The goal of entity resolution is to reconcile database references corresponding to the same real-world entities. Given the abundance of publicly available databases where entities are not resolved, we motivate the problem of quickly processing queries ...


Keywords: adaptive, entity resolution, query, relations
11
WEST: Modern Technologies for Web People Search
March 2009
ICDE '09: Proceedings of the 2009 IEEE International Conference on Data Engineering
Publisher: IEEE Computer Society
Full text available: Publisher SitePublisher Site
Additional Information:full citation, abstract
Bibliometrics:  Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Downloads (Overall): n/a,    Citation Count: 1

In this paper we describe WEST (Web Entity Search Technologies) system that we have developed to improve people search over the Internet. Recently the problem of Web People Search (WePS) has attracted significant attention from both the industry and ...


Keywords: Web People Search, Entity Resolution, Disambiguation
12
Adaptive sorted neighborhood methods for efficient record linkage
June 2007
JCDL '07: Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Publisher: ACM Request Permissions Request Permissions   
Full text available: PdfPdf (422.93 KB)
Additional Information:full citation, abstract, references, index terms
Bibliometrics:  Downloads (6 Weeks): 5,   Downloads (12 Months): 65,   Downloads (Overall): 231,    Citation Count: 1

Traditionally, record linkage algorithms have played an important role in maintaining digital libraries - i.e., identifying matching citations or authors for consolidation in updating or integrating digital libraries. As such, a variety of record linkage ...


Keywords: citation matching, entity resolution, record linkage, sorted neighborhood
13
Entity resolution with iterative blocking
June 2009
SIGMOD '09: Proceedings of the 35th SIGMOD international conference on Management of data
Publisher: ACM Request Permissions Request Permissions   
Full text available: PdfPdf (487.23 KB)
Additional Information:full citation, abstract, references, cited by, index terms
Bibliometrics:  Downloads (6 Weeks): 40,   Downloads (12 Months): 187,   Downloads (Overall): 187,    Citation Count: 1

Entity Resolution (ER) is the problem of identifying which records in a database refer to the same real-world entity. An exhaustive ER process involves computing the similarities between pairs of records, which can be very expensive for large datasets. ...


Keywords: blocking, entity resolution, iterative blocking
14
Duplicate Record Detection: A Survey
January 2007
IEEE Transactions on Knowledge and Data Engineering , Volume 19 Issue 1
Publisher: IEEE Educational Activities Department
Full text available: Publisher SitePublisher Site
Additional Information:full citation, abstract, index terms
Bibliometrics:  Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Downloads (Overall): n/a,    Citation Count: 25

Often, in the real world, entities have two or more representations in databases. Duplicate records do not share a common key and/or they contain errors that make duplicate matching a difficult task. Errors are introduced as the result of transcription ...


Keywords: Duplicate detection, data cleaning, data integration, record linkage, data deduplication, instance identification, database hardening, name matching, identity uncertainty, entity resolution, fuzzy duplicate detection, entity matching.
15
Consolidation of References to Persons in Bibliographic Databases
December 2008
ICADL 08: Proceedings of the 11th International Conference on Asian Digital Libraries: Universal and Ubiquitous Access to Information
Publisher: Springer-Verlag
Additional Information:full citation, abstract
Bibliometrics:  Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Downloads (Overall): n/a,    Citation Count: 0

Entity resolution is the process of determining if, in a specific context, two or more references correspond to the same entity. In this work, we address this problem in the context of references to persons as they are found in bibliographic data, specifically ...


Keywords: Entity resolution, bibliographic metadata, machine learning, similarity metrics
16
Geocode Matching and Privacy Preservation
May 2009
Privacy, Security, and Trust in KDD
Publisher: Springer-Verlag
Additional Information:full citation, abstract
Bibliometrics:  Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Downloads (Overall): n/a,    Citation Count: 0

Geocoding is the process of matching addresses to geographic locations, such as latitudes and longitudes, or local census areas. In many applications, addresses are the key to geo-spatial data analysis and mining. Privacy and confidentiality are of paramount ...


Keywords: Data matching, entity resolution, geocoding, privacy preservation, record linkage, secure multi-party computations
17
Web based linkage
November 2007
WIDM '07: Proceedings of the 9th annual ACM international workshop on Web information and data management
Publisher: ACM Request Permissions Request Permissions   
Full text available: PdfPdf (624.67 KB)
Additional Information:full citation, abstract, references, index terms
Bibliometrics:  Downloads (6 Weeks): 11,   Downloads (12 Months): 58,   Downloads (Overall): 178,    Citation Count: 3

When a variety of names are used for the same real-world entity, the problem of detecting all such variants has been known as the (record) linkage or entity resolution problem. In this paper, toward this problem, we propose a novel approach ...


Keywords: entity resolution, record linkage
18
A strategy for allowing meaningful and comparable scores in approximate matching
Carina F. Dorneles, Marcos Freitas Nunes, Carlos A. Heuser, Viviane P. Moreira, Altigran S. da Silva, Edleno S. de Moura
December 2009
Information Systems , Volume 34 Issue 8
Publisher: Elsevier Science Ltd.
Additional Information:full citation, abstract, references
Bibliometrics:  Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Downloads (Overall): n/a,    Citation Count: 0

Approximate data matching aims at assessing whether two distinct instances of data represent the same real-world object. The comparison between data values is usually done by applying a similarity function which returns a similarity score. If this score ...


Keywords: Data cleaning, Data integration, Deduplication, Entity resolution, Similarity querying
19
Search engine driven author disambiguation
June 2006
JCDL '06: Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
Publisher: ACM Request Permissions Request Permissions   
Full text available: PdfPdf (86.41 KB)
Additional Information:full citation, abstract, references, index terms
Bibliometrics:  Downloads (6 Weeks): 7,   Downloads (12 Months): 56,   Downloads (Overall): 219,    Citation Count: 9

In scholarly digital libraries, author disambiguation is an important task that attributes a scholarly work with specific authors. This is critical when individuals share the same name. We present an approach to this task that analyzes the results of ...


Keywords: IHF, author disambiguation, entity resolution
20
Adaptive graphical approach to entity resolution
June 2007
JCDL '07: Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Publisher: ACM Request Permissions Request Permissions   
Full text available: PdfPdf (523.81 KB)
Additional Information:full citation, abstract, references, index terms
Bibliometrics:  Downloads (6 Weeks): 10,   Downloads (12 Months): 126,   Downloads (Overall): 357,    Citation Count: 4

Entity resolution is a very common Information Quality (IQ) problem with many different applications. In digital libraries, it is related to problems of citation matching and author name disambiguation; in Natural Language Processing, it is related ...


Keywords: SNA, entity relationship graph, entity resolution, graph analysis, self-tuning
Result page: 1   2    next    >>