Welcome to the upgraded MacSphere! We're putting the finishing touches on it; if you notice anything amiss, email macsphere@mcmaster.ca

Accelerating Object Detection and Tracking Pipelines for Efficient Edge Video Analytics

dc.contributor.advisorZheng, Rong
dc.contributor.advisorRazavi, Saiedeh
dc.contributor.authorXu, Renjie
dc.contributor.departmentComputing and Softwareen_US
dc.date.accessioned2025-10-08T18:07:51Z
dc.date.available2025-10-08T18:07:51Z
dc.date.issued2025
dc.description.abstractEdge computing enables rapid video analytics by processing data closer to the source, thereby reducing end-to-end latency. This gives rise to the paradigm of edge video analytics (EVA). Object detection and object tracking are key building blocks of video analytics pipelines (VAPs), as their outputs directly impact the performance of downstream tasks. In real-world applications like traffic monitoring, timely and accurate responses are critical, as delayed or inaccurate results can compromise safety. However, achieving such an accuracy-efficiency balance at the edge is particularly challenging due to two main factors: the compute-intensive nature of modern Convolutional Neural Network (CNN)- or Vision Transformer (ViT)-based models, and the limited computational and communication resources on edge devices. This thesis aims to improve the efficiency of object detection and tracking pipelines without sacrificing accuracy, enabling efficient and reliable EVA. Conventional pipelines often adopt fixed configurations (e.g., frame resolution and backbone model) or process entire frames uniformly, overlooking the dynamic and spatially diverse nature of video content, resulting in considerable resource waste. To address these limitations, we propose three novel approaches: FastTuner, a model-agnostic framework that dynamically selects the optimal frame resolution and backbone model at runtime to accelerate multi-object tracking (MOT) pipelines; BlockHybrid, which leverages a policy network to classify each frame into “hard” and “easy” blocks, and processes them with either a block-wise detector or a lightweight tracker accordingly; and SEED, an end-to-end framework that couples block selection with block execution, enabling unified and efficient selection and execution of informative blocks in ViT-based object detectors. Extensive evaluations across multiple datasets and deployment scenarios demonstrate the effectiveness and generality of the proposed methods. Together, these contributions pave the way for more adaptive and scalable video analytics in real-world edge environments.en_US
dc.description.degreeDoctor of Philosophy (PhD)en_US
dc.description.degreetypeThesisen_US
dc.description.layabstractVideo analytics is a technique that can extract insightful information from videos, driving real-world applications such as traffic monitoring, where rapid and accurate responses are critical for safety. However, existing video analytics pipelines are compute-intensive, making them difficult to run efficiently on resource-constrained edge devices. This thesis proposes three novel approaches that significantly accelerate video analytics without compromising accuracy. These approaches intelligently adjust how videos are analyzed by selecting appropriate resolutions and processing models, and by focusing only on the most informative parts of each frame, greatly reducing unnecessary computation and communication. Extensive experiments demonstrate that the proposed approaches enhance the trade-off between accuracy and efficiency, providing a strong foundation for efficient and reliable edge video analytics.en_US
dc.identifier.urihttp://hdl.handle.net/11375/32494
dc.language.isoenen_US
dc.subjectEdge Computingen_US
dc.subjectVideo Analyticsen_US
dc.subjectObject Detectionen_US
dc.subjectObject Trackingen_US
dc.subjectComputer Visionen_US
dc.subjectDeep Learningen_US
dc.titleAccelerating Object Detection and Tracking Pipelines for Efficient Edge Video Analyticsen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Xu_Renjie_202509_PhD.pdf
Size:
6.99 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: