📞 +91-7667918914 | ✉️ ijarcce@gmail.com
IJARCCE Logo
International Journal of Advanced Research in Computer and Communication Engineering A monthly Peer-reviewed & Refereed journal
ISSN Online 2278-1021ISSN Print 2319-5940Since 2012
IJARCCE adheres to the suggestive parameters outlined by the University Grants Commission (UGC) for peer-reviewed journals, upholding high standards of research quality, ethical publishing, and academic excellence.
← Back to VOLUME 5, ISSUE 7, JULY 2016

Using Rule Based and Blocking Approaches to accomplish Entity Identification for Data Cleaning

Ankita Saxena, Prof. Ranjana Dahake

DOI: 10.17148/IJARCCE.2016.5776

Abstract: In today�s scenario entity appear in multiple data sources so it is necessary to identify the records referring to the same real-world entity, which is named as Entity Resolution (ER).ER is one of the most substantial problems in data cleaning and ascends in many applications such as information integration and information retrieval. Familiar ER approaches are in sufficient to identify records based on pair wise likeness comparisons, which assumes that records referring to the same entity are more similar to each other than otherwise. However for certain circumstances this assumption does not always hold in practice and likeness comparisons do not work well when such assumption breaks. So to overcome outdated ER drawback a new set of rules which could describe the complex matching conditions between records and entities is proposed such as rule discovery algorithm, rule based ER algorithm along with blocking scheme methods to get more resolved classified entity set.



Keywords: Entity Resolution, Data Cleaning, Rule Learning and Meta blocking.

How to Cite:

[1] Ankita Saxena, Prof. Ranjana Dahake, “Using Rule Based and Blocking Approaches to accomplish Entity Identification for Data Cleaning,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2016.5776