Welcome to the upgraded MacSphere! We're putting the finishing touches on it; if you notice anything amiss, email macsphere@mcmaster.ca

Confidence Distillation for Efficient Action Recognition

dc.contributor.advisorChiang, Fei
dc.contributor.advisorZheng, Rong
dc.contributor.authorManzuri Shalmani, Shervin
dc.contributor.departmentComputing and Softwareen_US
dc.date.accessioned2021-01-03T01:29:08Z
dc.date.available2021-01-03T01:29:08Z
dc.date.issued2020
dc.description.abstractModern neural networks are powerful predictive models. However, when it comes to recognizing that they may be wrong about their predictions and measuring the certainty of beliefs, they perform poorly. For one of the most common activation functions, the ReLU and its variants, even a well-calibrated model can produce incorrect but high confidence predictions. In the related task of action recognition, most current classification methods are based on clip-level classifiers that densely sample a given video for non-overlapping, same sized clips and aggregate the results using an aggregation function - typically averaging - to achieve video level predictions. While this approach has shown to be effective, it is sub-optimal in recognition accuracy and has a high computational overhead. To mitigate both these issues, we propose the confidence distillation framework to firstly teach a representation of uncertainty of the teacher to the student and secondly divide the task of full video prediction between the student and the teacher models. We conduct extensive experiments on three action recognition datasets and demonstrate that our framework achieves state-of-the-art results in action recognition accuracy and computational efficiency.en_US
dc.description.degreeMaster of Science (MSc)en_US
dc.description.degreetypeThesisen_US
dc.description.layabstractWe devise a distillation loss function to train an efficient sampler/classifier for video-based action recognition tasks.en_US
dc.identifier.urihttp://hdl.handle.net/11375/26127
dc.language.isoenen_US
dc.subjectDeep Learningen_US
dc.subjectComputer Visionen_US
dc.subjectArtificial Intelligenceen_US
dc.subjectEfficient Inferenceen_US
dc.subjectRegularizationen_US
dc.subjectLoss Functionen_US
dc.subjectMachine Learningen_US
dc.subjectDistillationen_US
dc.titleConfidence Distillation for Efficient Action Recognitionen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
manzurishalmani_shervin_finalsubmission202012_masters.pdf
Size:
7.76 MB
Format:
Adobe Portable Document Format
Description:
Primary Thesis File

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: