Welcome to the upgraded MacSphere! We're putting the finishing touches on it; if you notice anything amiss, email macsphere@mcmaster.ca

Multi-label Classification and Sentiment Analysis on Textual Records

dc.contributor.advisorChen, Jun
dc.contributor.authorGuo, Xintong
dc.contributor.departmentElectrical and Computer Engineeringen_US
dc.date.accessioned2019-07-24T18:13:10Z
dc.date.available2019-07-24T18:13:10Z
dc.date.issued2019
dc.description.abstractIn this thesis we have present effective approaches for two classic Nature Language Processing tasks: Multi-label Text Classification(MLTC) and Sentiment Analysis(SA) based on two datasets. For MLTC, a robust deep learning approach based on convolution neural network(CNN) has been introduced. We have done this on almost one million records with a related label list consists of 20 labels. We have divided our data set into three parts, training set, validation set and test set. Our CNN based model achieved great result measured in F1 score. For SA, data set was more informative and well-structured compared with MLTC. A traditional word embedding method, Word2Vec was used for generating word vector of each text records. Following that, we employed several classic deep learning models such as Bi-LSTM, RCNN, Attention mechanism and CNN to extract sentiment features. In the next step, a classification frame was designed to graded. At last, the start-of-art language model, BERT which use transfer learning method was employed. In conclusion, we compared performance of RNN-based model, CNN-based model and pre-trained language model on classification task and discuss their applicability.en_US
dc.description.degreeMaster of Science in Electrical and Computer Engineering (MSECE)en_US
dc.description.degreetypeThesisen_US
dc.description.layabstractThis theis purposed two deep learning solution to both multi-label classification problem and sentiment analysis problem.en_US
dc.identifier.urihttp://hdl.handle.net/11375/24627
dc.language.isoenen_US
dc.subjectNLPen_US
dc.subjectsentiment analysisen_US
dc.subjectmulti-label classificationen_US
dc.subjectmachine learningen_US
dc.subjectdeep learningen_US
dc.titleMulti-label Classification and Sentiment Analysis on Textual Recordsen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Guo_Xintong_201907_MASc.pdf
Size:
2.03 MB
Format:
Adobe Portable Document Format
Description:
Xintong Guo master of science thesis

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: