Universidade de Lisboa Repositório da Universidade de Lisboa

Repositório da Universidade de Lisboa >
Faculdade de Ciências (FC) >
Departamento de Informática / Department of Informatics (FC-DI) >
FC-DI - Technical Reports >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10455/3050

Título: Semantic Similarity Match for Data Quality
Autor: Martins, Fernando
Falcão, André
Couto, Francisco M.
Palavras-chave: semantic similarity
data cleaning
data quality
wordnet
similarity match
Issue Date: Oct-2007
Editora: Department of Informatics, University of Lisbon
Relatório da Série N.º: di-fcul-tr-07-25
Resumo: Data quality is a critical aspect of applications that support business operations. Often entities are represented more than once in data repositories. Since duplicate records do not share a common key, they are hard to detect. Duplicate detection over text is usually performed using lexical approaches, which do not capture text sense. The difficulties increase when the duplicate detection must be performed using the text sense. This work presents a semantic similarity approach, based on a text sense matching mechanism, that performs the detection of text units which are similar in sense. The goal of the proposed semantic similarity approach is therefore to perform the duplicate detection task in a data quality process
URI: http://hdl.handle.net/10455/3050
Appears in Collections:FC-DI - Technical Reports

Files in This Item:

File SizeFormat
07-25.pdf215,96 kBAdobe PDFView/Open
Statistics
FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

  © Universidade de Lisboa / SIBUL
Alameda da Universidade | Cidade Universitária | 1649-004 Lisboa | Portugal
Tel. +351 217967624 | Fax +351 217933624 | repositorio@reitoria.ul.pt - Feedback - Statistics
DeGóis
Promotores do RCAAP   Financiadores do RCAAP

Fundação para a Ciência e a Tecnologia Universidade do Minho   Governo Português Ministério da Educação e Ciência PO Sociedade do Conhecimento (POSC) Portal oficial da União Europeia