Welcome to the upgraded MacSphere! We're putting the finishing touches on it; if you notice anything amiss, email macsphere@mcmaster.ca

Restoring Consistency in Ontological Multidimensional Data Models via Weighted Repairs

dc.contributor.advisorChiang, Fei
dc.contributor.authorHaque, Enamul
dc.contributor.departmentComputing and Softwareen_US
dc.date.accessioned2020-04-17T13:27:00Z
dc.date.available2020-04-17T13:27:00Z
dc.date.issued2020
dc.descriptionThis can be considered as a multidisciplinary research where ideas from Operations Research, Data Science and Logic came together to solve an inconsistency handling problem in a special type of ontology.en_US
dc.description.abstractHigh data quality is a prerequisite for accurate data analysis. However, data inconsistencies often arise in real data, leading to untrusted decision making downstream in the data analysis pipeline. In this research, we study the problem of inconsistency detection and repair of the Ontology Multi-dimensional Data Model (OMD). We propose a framework of data quality assessment, and repair for the OMD. We formally define a weight-based repair-by-deletion semantics, and present an automatic weight generation mechanism that considers multiple input criteria. Our methods are rooted in multi-criteria decision making that consider the correlation, contrast, and conflict that may exist among multiple criteria, and is often needed in the data cleaning domain. After weight generation we present a dynamic programming based Min-Sum algorithm to identify minimal weight solution. We then apply evolutionary optimization techniques and demonstrate improved performance using medical datasets, making it realizable in practice.en_US
dc.description.degreeMaster of Computer Science (MCS)en_US
dc.description.degreetypeThesisen_US
dc.description.layabstractAccurate data analysis requires high quality data as input. In this research, we study inconsistency in an ontology known as Ontology Multi-dimensional Data (OMD) Model and propose algorithms to repair them based on their automatically generated relative weights. We proposed two techniques to restore consistency, one provides optimal results but takes longer time compared to the other one, which produces sub-optimal results but fast enough for practical purposes, shown with experiments on datasets.en_US
dc.identifier.urihttp://hdl.handle.net/11375/25386
dc.language.isoenen_US
dc.subjectLogicen_US
dc.subjectData Scienceen_US
dc.subjectData Cleaningen_US
dc.subjectMCDMen_US
dc.subjectCRITICen_US
dc.subjectOMDen_US
dc.subjectGenetic Algorithmsen_US
dc.subjectDatabaseen_US
dc.subjectDatalogen_US
dc.titleRestoring Consistency in Ontological Multidimensional Data Models via Weighted Repairsen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Haque_Enamul_2020April_MSc.pdf
Size:
2.86 MB
Format:
Adobe Portable Document Format
Description:
Thesis Data Science

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: