Skip navigation
  • Home
  • Browse
    • Communities
      & Collections
    • Browse Items by:
    • Publication Date
    • Author
    • Title
    • Subject
    • Department
  • Sign on to:
    • My MacSphere
    • Receive email
      updates
    • Edit Profile


McMaster University Home Page
  1. MacSphere
  2. Open Access Dissertations and Theses Community
  3. Open Access Dissertations and Theses
Please use this identifier to cite or link to this item: http://hdl.handle.net/11375/32494
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorZheng, Rong-
dc.contributor.advisorRazavi, Saiedeh-
dc.contributor.authorXu, Renjie-
dc.date.accessioned2025-10-08T18:07:51Z-
dc.date.available2025-10-08T18:07:51Z-
dc.date.issued2025-
dc.identifier.urihttp://hdl.handle.net/11375/32494-
dc.description.abstractEdge computing enables rapid video analytics by processing data closer to the source, thereby reducing end-to-end latency. This gives rise to the paradigm of edge video analytics (EVA). Object detection and object tracking are key building blocks of video analytics pipelines (VAPs), as their outputs directly impact the performance of downstream tasks. In real-world applications like traffic monitoring, timely and accurate responses are critical, as delayed or inaccurate results can compromise safety. However, achieving such an accuracy-efficiency balance at the edge is particularly challenging due to two main factors: the compute-intensive nature of modern Convolutional Neural Network (CNN)- or Vision Transformer (ViT)-based models, and the limited computational and communication resources on edge devices. This thesis aims to improve the efficiency of object detection and tracking pipelines without sacrificing accuracy, enabling efficient and reliable EVA. Conventional pipelines often adopt fixed configurations (e.g., frame resolution and backbone model) or process entire frames uniformly, overlooking the dynamic and spatially diverse nature of video content, resulting in considerable resource waste. To address these limitations, we propose three novel approaches: FastTuner, a model-agnostic framework that dynamically selects the optimal frame resolution and backbone model at runtime to accelerate multi-object tracking (MOT) pipelines; BlockHybrid, which leverages a policy network to classify each frame into “hard” and “easy” blocks, and processes them with either a block-wise detector or a lightweight tracker accordingly; and SEED, an end-to-end framework that couples block selection with block execution, enabling unified and efficient selection and execution of informative blocks in ViT-based object detectors. Extensive evaluations across multiple datasets and deployment scenarios demonstrate the effectiveness and generality of the proposed methods. Together, these contributions pave the way for more adaptive and scalable video analytics in real-world edge environments.en_US
dc.language.isoenen_US
dc.subjectEdge Computingen_US
dc.subjectVideo Analyticsen_US
dc.subjectObject Detectionen_US
dc.subjectObject Trackingen_US
dc.subjectComputer Visionen_US
dc.subjectDeep Learningen_US
dc.titleAccelerating Object Detection and Tracking Pipelines for Efficient Edge Video Analyticsen_US
dc.typeThesisen_US
dc.contributor.departmentComputing and Softwareen_US
dc.description.degreetypeThesisen_US
dc.description.degreeDoctor of Philosophy (PhD)en_US
dc.description.layabstractVideo analytics is a technique that can extract insightful information from videos, driving real-world applications such as traffic monitoring, where rapid and accurate responses are critical for safety. However, existing video analytics pipelines are compute-intensive, making them difficult to run efficiently on resource-constrained edge devices. This thesis proposes three novel approaches that significantly accelerate video analytics without compromising accuracy. These approaches intelligently adjust how videos are analyzed by selecting appropriate resolutions and processing models, and by focusing only on the most informative parts of each frame, greatly reducing unnecessary computation and communication. Extensive experiments demonstrate that the proposed approaches enhance the trade-off between accuracy and efficiency, providing a strong foundation for efficient and reliable edge video analytics.en_US
Appears in Collections:Open Access Dissertations and Theses

Files in This Item:
File Description SizeFormat 
Xu_Renjie_202509_PhD.pdf
Open Access
7.16 MBAdobe PDFView/Open
Show simple item record Statistics


Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.

Sherman Centre for Digital Scholarship     McMaster University Libraries
©2022 McMaster University, 1280 Main Street West, Hamilton, Ontario L8S 4L8 | 905-525-9140 | Contact Us | Terms of Use & Privacy Policy | Feedback

Report Accessibility Issue