Please use this identifier to cite or link to this item:
http://hdl.handle.net/11375/25386
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Chiang, Fei | - |
dc.contributor.author | Haque, Enamul | - |
dc.date.accessioned | 2020-04-17T13:27:00Z | - |
dc.date.available | 2020-04-17T13:27:00Z | - |
dc.date.issued | 2020 | - |
dc.identifier.uri | http://hdl.handle.net/11375/25386 | - |
dc.description | This can be considered as a multidisciplinary research where ideas from Operations Research, Data Science and Logic came together to solve an inconsistency handling problem in a special type of ontology. | en_US |
dc.description.abstract | High data quality is a prerequisite for accurate data analysis. However, data inconsistencies often arise in real data, leading to untrusted decision making downstream in the data analysis pipeline. In this research, we study the problem of inconsistency detection and repair of the Ontology Multi-dimensional Data Model (OMD). We propose a framework of data quality assessment, and repair for the OMD. We formally define a weight-based repair-by-deletion semantics, and present an automatic weight generation mechanism that considers multiple input criteria. Our methods are rooted in multi-criteria decision making that consider the correlation, contrast, and conflict that may exist among multiple criteria, and is often needed in the data cleaning domain. After weight generation we present a dynamic programming based Min-Sum algorithm to identify minimal weight solution. We then apply evolutionary optimization techniques and demonstrate improved performance using medical datasets, making it realizable in practice. | en_US |
dc.language.iso | en | en_US |
dc.subject | Logic | en_US |
dc.subject | Data Science | en_US |
dc.subject | Data Cleaning | en_US |
dc.subject | MCDM | en_US |
dc.subject | CRITIC | en_US |
dc.subject | OMD | en_US |
dc.subject | Genetic Algorithms | en_US |
dc.subject | Database | en_US |
dc.subject | Datalog | en_US |
dc.title | Restoring Consistency in Ontological Multidimensional Data Models via Weighted Repairs | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Computing and Software | en_US |
dc.description.degreetype | Thesis | en_US |
dc.description.degree | Master of Computer Science (MCS) | en_US |
dc.description.layabstract | Accurate data analysis requires high quality data as input. In this research, we study inconsistency in an ontology known as Ontology Multi-dimensional Data (OMD) Model and propose algorithms to repair them based on their automatically generated relative weights. We proposed two techniques to restore consistency, one provides optimal results but takes longer time compared to the other one, which produces sub-optimal results but fast enough for practical purposes, shown with experiments on datasets. | en_US |
Appears in Collections: | Open Access Dissertations and Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Haque_Enamul_2020April_MSc.pdf | Thesis Data Science | 2.93 MB | Adobe PDF | View/Open |
Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.