ADVANCED SEARCH
Advanced Search
FEEDBACK
Please provide us with feedback
Found 28 of 1,394,228
|
|
Results 1 - 20 of 28
|
|
|
Result page:
1
2
next
>>
|
|
|
1
|
|
A strategy for allowing meaningful and comparable scores in approximate matching
|
|
December 2009
|
|
Information Systems
, Volume 34 Issue 8
|
|
Publisher: Elsevier Science Ltd.
|
|
| Bibliometrics: Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Downloads (Overall): n/a, Citation Count: 0 |
 |
|
Approximate data matching aims at assessing whether two distinct instances of data represent the same real-world object. The comparison between data values is usually done by applying a similarity function which returns a similarity score. If this score ...
Keywords: Data cleaning, Data integration, Deduplication, Entity resolution, Similarity querying
|
|
2
|
|
Scaling up duplicate detection in graph data
|
|
October 2008
|
|
CIKM '08: Proceeding of the 17th ACM conference on Information and knowledge management
|
Publisher: ACM
Request Permissions
|
|
Full text available: |
Pdf
(296.51 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 11, Downloads (12 Months): 109, Downloads (Overall): 109, Citation Count: 0 |
 |
|
Duplicate detection determines different representations of real-world objects in a database. Recent research has considered the use of relationships among object representations to improve duplicate detection. In the general case where relationships ...
Keywords: data quality, duplicate detection, entity resolution, scalability
|
|
3
|
|
Automatic record linkage using seeded nearest neighbour and support vector machine classification
|
|
August 2008
|
|
KDD '08: Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
|
Publisher: ACM
Request Permissions
|
|
Full text available: |
Pdf
(415.92 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 24, Downloads (12 Months): 311, Downloads (Overall): 407, Citation Count: 1 |
 |
|
The task of linking databases is an important step in an increasing number of data mining projects, because linked data can contain information that is not available otherwise, or that would require time-consuming and expensive collection of specific ...
Keywords: data linkage, data matching, deduplication, entity resolution, nearest neighbour, support vector machine
|
|
4
|
|
Pair-Wise entity resolution: overview and challenges
|
|
November 2006
|
|
CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management
|
Publisher: ACM
Request Permissions
|
|
Full text available: |
Pdf
(91.71 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 8, Downloads (12 Months): 72, Downloads (Overall): 268, Citation Count: 1 |
 |
|
Information integration is one of the oldest and most important computer science problems: Information from diverse sources must be combined, so that users can access and manipulate the information in a unified way. One of the central problems in information ...
Keywords: data cleaning, entity resolution
|
|
5
|
|
Qualitative effects of knowledge rules and user feedback in probabilistic data integration
Maurice Keulen,
Ander Keijzer
|
|
October 2009
|
|
The VLDB Journal — The International Journal on Very Large Data Bases
, Volume 18 Issue 5
|
|
Publisher: Springer-Verlag New York, Inc.
|
|
Full text available: |
Pdf
(1.83 MB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 3, Downloads (12 Months): 3, Downloads (Overall): 3, Citation Count: 0 |
 |
|
In data integration efforts, portal development in particular, much development time is devoted to entity resolution. Often advanced similarity measurement techniques are used to remove semantic duplicates or solve other semantic conflicts. It proves ...
Keywords: Data integration, Data quality, Entity resolution, Uncertain databases, User feedback
|
|
6
|
|
A Latent Topic Model for Complete Entity Resolution
|
|
March 2009
|
|
ICDE '09: Proceedings of the 2009 IEEE International Conference on Data Engineering
|
|
Publisher: IEEE Computer Society
|
|
|
|
| Bibliometrics: Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Downloads (Overall): n/a, Citation Count: 0 |
 |
|
In bibliographies like DBLP and Citeseer, there are three kinds of entity-name problems that need to be solved. First, multiple entities share one name, which is called the name sharing problem. Second, one entity has different names, which is called ...
Keywords: Entity resolution, name disambiguation, topic model, LDA
|
|
7
|
|
A cluster-based approach to XML similarity joins
Leonardo A. Ribeiro,
Theo Härder,
Fernanda S. Pimenta
|
|
September 2009
|
|
IDEAS '09: Proceedings of the 2009 International Database Engineering & Applications Symposium
|
|
Publisher: ACM
|
|
Full text available: |
Pdf
(900.97 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 9, Downloads (12 Months): 9, Downloads (Overall): 9, Citation Count: 0 |
 |
|
A natural consequence of the widespread adoption of XML as standard for information representation and exchange is the redundant storage of large amounts of persistent XML documents. Compared to relational data tables, data represented in XML format ...
Keywords: XML, clustering, entity resolution, similarity joins, similarity measures, xml databases
|
|
8
|
|
Exploiting context analysis for combining multiple entity resolution systems
|
|
June 2009
|
|
SIGMOD '09: Proceedings of the 35th SIGMOD international conference on Management of data
|
Publisher: ACM
Request Permissions
|
|
Full text available: |
Pdf
(468.33 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 49, Downloads (12 Months): 200, Downloads (Overall): 200, Citation Count: 0 |
 |
|
Entity Resolution (ER) is an important real world problem that has attracted significant research interest over the past few years. It deals with determining which object descriptions co-refer in a dataset. Due to its practical significance for data ...
Keywords: context analysis, entity resolution, er ensemble
|
|
9
|
|
Domain-independent data cleaning via analysis of entity-relationship graph
|
|
June 2006
|
|
Transactions on Database Systems (TODS)
, Volume 31 Issue 2
|
Publisher: ACM
Request Permissions
|
|
Full text available: |
Pdf
(1.27 MB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 13, Downloads (12 Months): 192, Downloads (Overall): 1056, Citation Count: 11 |
 |
|
In this article, we address the problem of reference disambiguation. Specifically, we consider a situation where entities in the database are referred to using descriptions (e.g., a set of instantiated attributes). The objective of reference disambiguation ...
Keywords: Connection strength, RelDC, data cleaning, entity resolution, graph analysis, reference disambiguation, relationship analysis
|
|
10
|
|
Query-time entity resolution
|
|
August 2006
|
|
KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
|
Publisher: ACM
Request Permissions
|
|
Full text available: |
Pdf
(751.81 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 15, Downloads (12 Months): 70, Downloads (Overall): 336, Citation Count: 6 |
 |
|
The goal of entity resolution is to reconcile database references corresponding to the same real-world entities. Given the abundance of publicly available databases where entities are not resolved, we motivate the problem of quickly processing queries ...
Keywords: adaptive, entity resolution, query, relations
|
|
11
|
|
WEST: Modern Technologies for Web People Search
|
|
March 2009
|
|
ICDE '09: Proceedings of the 2009 IEEE International Conference on Data Engineering
|
|
Publisher: IEEE Computer Society
|
|
|
|
| Bibliometrics: Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Downloads (Overall): n/a, Citation Count: 1 |
 |
|
In this paper we describe WEST (Web Entity Search Technologies) system that we have developed to improve people search over the Internet. Recently the problem of Web People Search (WePS) has attracted significant attention from both the industry and ...
Keywords: Web People Search, Entity Resolution, Disambiguation
|
|
12
|
|
Adaptive sorted neighborhood methods for efficient record linkage
|
|
June 2007
|
|
JCDL '07: Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
|
Publisher: ACM
Request Permissions
|
|
Full text available: |
Pdf
(422.93 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 5, Downloads (12 Months): 65, Downloads (Overall): 231, Citation Count: 1 |
 |
|
Traditionally, record linkage algorithms have played an important role in maintaining digital libraries - i.e., identifying matching citations or authors for consolidation in updating or integrating digital libraries. As such, a variety of record linkage ...
Keywords: citation matching, entity resolution, record linkage, sorted neighborhood
|
|
13
|
|
Entity resolution with iterative blocking
|
|
June 2009
|
|
SIGMOD '09: Proceedings of the 35th SIGMOD international conference on Management of data
|
Publisher: ACM
Request Permissions
|
|
Full text available: |
Pdf
(487.23 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 40, Downloads (12 Months): 187, Downloads (Overall): 187, Citation Count: 1 |
 |
|
Entity Resolution (ER) is the problem of identifying which records in a database refer to the same real-world entity. An exhaustive ER process involves computing the similarities between pairs of records, which can be very expensive for large datasets. ...
Keywords: blocking, entity resolution, iterative blocking
|
|
14
|
|
Duplicate Record Detection: A Survey
|
|
January 2007
|
|
IEEE Transactions on Knowledge and Data Engineering
, Volume 19 Issue 1
|
|
Publisher: IEEE Educational Activities Department
|
|
|
|
| Bibliometrics: Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Downloads (Overall): n/a, Citation Count: 25 |
 |
|
Often, in the real world, entities have two or more representations in databases. Duplicate records do not share a common key and/or they contain errors that make duplicate matching a difficult task. Errors are introduced as the result of transcription ...
Keywords: Duplicate detection, data cleaning, data integration, record linkage, data deduplication, instance identification, database hardening, name matching, identity uncertainty, entity resolution, fuzzy duplicate detection, entity matching.
|
|
15
|
|
Consolidation of References to Persons in Bibliographic Databases
|
|
December 2008
|
|
ICADL 08: Proceedings of the 11th International Conference on Asian Digital Libraries: Universal and Ubiquitous Access to Information
|
|
Publisher: Springer-Verlag
|
|
| Bibliometrics: Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Downloads (Overall): n/a, Citation Count: 0 |
 |
|
Entity resolution is the process of determining if, in a specific context, two or more references correspond to the same entity. In this work, we address this problem in the context of references to persons as they are found in bibliographic data, specifically ...
Keywords: Entity resolution, bibliographic metadata, machine learning, similarity metrics
|
|
16
|
|
Geocode Matching and Privacy Preservation
|
|
May 2009
|
|
Privacy, Security, and Trust in KDD
|
|
Publisher: Springer-Verlag
|
|
| Bibliometrics: Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Downloads (Overall): n/a, Citation Count: 0 |
 |
|
Geocoding is the process of matching addresses to geographic locations, such as latitudes and longitudes, or local census areas. In many applications, addresses are the key to geo-spatial data analysis and mining. Privacy and confidentiality are of paramount ...
Keywords: Data matching, entity resolution, geocoding, privacy preservation, record linkage, secure multi-party computations
|
|
17
|
|
Web based linkage
|
|
November 2007
|
|
WIDM '07: Proceedings of the 9th annual ACM international workshop on Web information and data management
|
Publisher: ACM
Request Permissions
|
|
Full text available: |
Pdf
(624.67 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 11, Downloads (12 Months): 58, Downloads (Overall): 178, Citation Count: 3 |
 |
|
When a variety of names are used for the same real-world entity, the problem of detecting all such variants has been known as the (record) linkage or entity resolution problem. In this paper, toward this problem, we propose a novel approach ...
Keywords: entity resolution, record linkage
|
|
18
|
|
A strategy for allowing meaningful and comparable scores in approximate matching
Carina F. Dorneles,
Marcos Freitas Nunes,
Carlos A. Heuser,
Viviane P. Moreira,
Altigran S. da Silva,
Edleno S. de Moura
|
|
December 2009
|
|
Information Systems
, Volume 34 Issue 8
|
|
Publisher: Elsevier Science Ltd.
|
|
| Bibliometrics: Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Downloads (Overall): n/a, Citation Count: 0 |
 |
|
Approximate data matching aims at assessing whether two distinct instances of data represent the same real-world object. The comparison between data values is usually done by applying a similarity function which returns a similarity score. If this score ...
Keywords: Data cleaning, Data integration, Deduplication, Entity resolution, Similarity querying
|
|
19
|
|
Search engine driven author disambiguation
|
|
June 2006
|
|
JCDL '06: Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
|
Publisher: ACM
Request Permissions
|
|
Full text available: |
Pdf
(86.41 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 7, Downloads (12 Months): 56, Downloads (Overall): 219, Citation Count: 9 |
 |
|
In scholarly digital libraries, author disambiguation is an important task that attributes a scholarly work with specific authors. This is critical when individuals share the same name. We present an approach to this task that analyzes the results of ...
Keywords: IHF, author disambiguation, entity resolution
|
|
20
|
|
Adaptive graphical approach to entity resolution
|
|
June 2007
|
|
JCDL '07: Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
|
Publisher: ACM
Request Permissions
|
|
Full text available: |
Pdf
(523.81 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 10, Downloads (12 Months): 126, Downloads (Overall): 357, Citation Count: 4 |
 |
|
Entity resolution is a very common Information Quality (IQ) problem with many different applications. In digital libraries, it is related to problems of citation matching and author name disambiguation; in Natural Language Processing, it is related ...
Keywords: SNA, entity relationship graph, entity resolution, graph analysis, self-tuning
|
|
|
|
|