An effective approach to entity resolution problem using Quasi-Clique and its application to digital libraries

Byung Won On, Ergin Elmacioglu, Dongwon Lee, Jaewoo Kang, Jian Pei

Research output: Chapter in Book/Report/Conference proceedingConference contribution

16 Citations (Scopus)

Abstract

We study how to resolve entities that contain a group of related elements in them (e.g., an author entity with a list of citations or an intermediate result by GROUP BY SQL query). Such entities, named as grouped-entities, frequently occur in many applications. By exploiting contextual information mined from the group of elements per entity in addition to syntactic similarity, we show that our approach, Quasi-Cllque, improves precision and recall unto 91% when used together with a variety of existing entity resolution solutions, but never worsens them.

Original languageEnglish
Title of host publicationProceedings of the ACM/IEEE Joint Conference on Digital Libraries
Pages51-52
Number of pages2
Volume2006
DOIs
Publication statusPublished - 2006 Dec 1
Externally publishedYes
Event6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006: Opening Information Horizons, JCDL '06 - Chapel Hill, NC, United States
Duration: 2006 Jun 112006 Jun 15

Other

Other6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006: Opening Information Horizons, JCDL '06
CountryUnited States
CityChapel Hill, NC
Period06/6/1106/6/15

Fingerprint

Digital libraries
Syntactics

Keywords

  • Entity resolution
  • Graph partition
  • Name disambiguation

ASJC Scopus subject areas

  • Engineering(all)

Cite this

On, B. W., Elmacioglu, E., Lee, D., Kang, J., & Pei, J. (2006). An effective approach to entity resolution problem using Quasi-Clique and its application to digital libraries. In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (Vol. 2006, pp. 51-52) https://doi.org/10.1145/1141753.1141761

An effective approach to entity resolution problem using Quasi-Clique and its application to digital libraries. / On, Byung Won; Elmacioglu, Ergin; Lee, Dongwon; Kang, Jaewoo; Pei, Jian.

Proceedings of the ACM/IEEE Joint Conference on Digital Libraries. Vol. 2006 2006. p. 51-52.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

On, BW, Elmacioglu, E, Lee, D, Kang, J & Pei, J 2006, An effective approach to entity resolution problem using Quasi-Clique and its application to digital libraries. in Proceedings of the ACM/IEEE Joint Conference on Digital Libraries. vol. 2006, pp. 51-52, 6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006: Opening Information Horizons, JCDL '06, Chapel Hill, NC, United States, 06/6/11. https://doi.org/10.1145/1141753.1141761
On BW, Elmacioglu E, Lee D, Kang J, Pei J. An effective approach to entity resolution problem using Quasi-Clique and its application to digital libraries. In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries. Vol. 2006. 2006. p. 51-52 https://doi.org/10.1145/1141753.1141761
On, Byung Won ; Elmacioglu, Ergin ; Lee, Dongwon ; Kang, Jaewoo ; Pei, Jian. / An effective approach to entity resolution problem using Quasi-Clique and its application to digital libraries. Proceedings of the ACM/IEEE Joint Conference on Digital Libraries. Vol. 2006 2006. pp. 51-52
@inproceedings{10e13855523e42b0927af1d4cb563c1d,
title = "An effective approach to entity resolution problem using Quasi-Clique and its application to digital libraries",
abstract = "We study how to resolve entities that contain a group of related elements in them (e.g., an author entity with a list of citations or an intermediate result by GROUP BY SQL query). Such entities, named as grouped-entities, frequently occur in many applications. By exploiting contextual information mined from the group of elements per entity in addition to syntactic similarity, we show that our approach, Quasi-Cllque, improves precision and recall unto 91{\%} when used together with a variety of existing entity resolution solutions, but never worsens them.",
keywords = "Entity resolution, Graph partition, Name disambiguation",
author = "On, {Byung Won} and Ergin Elmacioglu and Dongwon Lee and Jaewoo Kang and Jian Pei",
year = "2006",
month = "12",
day = "1",
doi = "10.1145/1141753.1141761",
language = "English",
isbn = "1595933549",
volume = "2006",
pages = "51--52",
booktitle = "Proceedings of the ACM/IEEE Joint Conference on Digital Libraries",

}

TY - GEN

T1 - An effective approach to entity resolution problem using Quasi-Clique and its application to digital libraries

AU - On, Byung Won

AU - Elmacioglu, Ergin

AU - Lee, Dongwon

AU - Kang, Jaewoo

AU - Pei, Jian

PY - 2006/12/1

Y1 - 2006/12/1

N2 - We study how to resolve entities that contain a group of related elements in them (e.g., an author entity with a list of citations or an intermediate result by GROUP BY SQL query). Such entities, named as grouped-entities, frequently occur in many applications. By exploiting contextual information mined from the group of elements per entity in addition to syntactic similarity, we show that our approach, Quasi-Cllque, improves precision and recall unto 91% when used together with a variety of existing entity resolution solutions, but never worsens them.

AB - We study how to resolve entities that contain a group of related elements in them (e.g., an author entity with a list of citations or an intermediate result by GROUP BY SQL query). Such entities, named as grouped-entities, frequently occur in many applications. By exploiting contextual information mined from the group of elements per entity in addition to syntactic similarity, we show that our approach, Quasi-Cllque, improves precision and recall unto 91% when used together with a variety of existing entity resolution solutions, but never worsens them.

KW - Entity resolution

KW - Graph partition

KW - Name disambiguation

UR - http://www.scopus.com/inward/record.url?scp=34247276592&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34247276592&partnerID=8YFLogxK

U2 - 10.1145/1141753.1141761

DO - 10.1145/1141753.1141761

M3 - Conference contribution

AN - SCOPUS:34247276592

SN - 1595933549

SN - 9781595933546

VL - 2006

SP - 51

EP - 52

BT - Proceedings of the ACM/IEEE Joint Conference on Digital Libraries

ER -