Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Semantic blocking for Record Linkage

  • J. Nin
  • , Víctor Muntés-Mulero
  • , Norbert MartíNez-Bazan
  • , Josep L. Larriba-Pey

Producción científica: Capítulo del libroContribución a congreso/conferenciarevisión exhaustiva

Resumen

Record Linkage (RL) is an important component of data cleaning and integration and data processing in general. For years, many efforts have focused on improving the performance of the RL process, either by reducing the number of record comparisons or reducing the number of attribute comparisons, which reduces the computational time, but increases the amount of error. However, the real bottleneck of RL is the post-process, where the results have to be reviewed by experts that decide which pairs or groups of records are real links and which are false hits. In this paper we show that exploiting the semantic relationships (e.g. foreign key), established between one or more data sources, makes it possible to find a new sort of semantic blocking method that improves the number of hits and reduces the amount of review effort.

Idioma originalInglés
Título de la publicación alojadaArtificial Intelligence Research and Development
Páginas141-149
Número de páginas9
EstadoPublicada - 2007
Publicado de forma externa
Evento10th International Conference of the Catalan Association for Artificial Intelligence, CCIA 2007 - Sant Julia de Loria, Andorra
Duración: 25 oct 200726 oct 2007

Serie de la publicación

NombreFrontiers in Artificial Intelligence and Applications
Volumen163
ISSN (versión impresa)0922-6389

Conferencia

Conferencia10th International Conference of the Catalan Association for Artificial Intelligence, CCIA 2007
País/TerritorioAndorra
CiudadSant Julia de Loria
Período25/10/0726/10/07

Huella

Profundice en los temas de investigación de 'Semantic blocking for Record Linkage'. En conjunto forman una huella única.

Cómo citar