Please use this identifier to cite or link to this item:
http://hdl.handle.net/11375/31624
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Ali, Emadi | - |
dc.contributor.author | Saad, Sara | - |
dc.date.accessioned | 2025-05-06T14:21:21Z | - |
dc.date.available | 2025-05-06T14:21:21Z | - |
dc.date.issued | 2025 | - |
dc.identifier.uri | http://hdl.handle.net/11375/31624 | - |
dc.description.abstract | Bias in automated face detection—where certain underrepresented groups consistently experience degraded or erroneous inference—remains a significant challenge in computer vision. This thesis proposes an efficient, fairness-aware face detection framework that mitigates bias across both demographic groups and other facial attributes, while leveraging GPU acceleration for real-time performance. The core contribution is a novel joint training approach that integrates a Variational Autoencoder (VAE) branch and a reinforcement learning–based rarity-driven resampling strategy into a pre-trained Faster R-CNN detector. This design enables the model to learn latent representations of underrepresented facial features (e.g., rare demographic or appearance traits) and rebalance the training process accordingly—notably without requiring explicit demographic or attribute labels. A rarity-aware sampling policy, guided by the VAE’s latent space and treated as a black-box optimization problem via reinforcement learning, preferentially up-samples infrequent examples during training to improve fairness for minority groups. Using face detection with a ResNet-50 backbone as a challenging testbed, we demonstrate that this approach consistently narrows the performance gap between well-represented and underrepresented groups without degrading overall accuracy. Trained on the CelebA dataset and evaluated on CelebA and FairFace benchmarks, the proposed model achieves significantly higher detection precision (mAP) and localization quality (mean IoU) compared to a baseline Faster R-CNN detector without VAE integration. Specifically, on CelebA, we observed improvement in mean IoU for all attributes (40 attribute groups) ranging from a minimum of +0.8% (Blurry, N = 1926, p < 0.05) to a maximum of +5.3% (Rosy Cheeks, N = 3206, p < 0.001). Similarly, on the FairFace dataset (14 race and gender subgroups), mean IoU improved by +1.3% (Female, N = 30001, p < 0.001) up to +3.8% (Male, N = 31971, p < 0.001). To ensure practical deployment, the trained model was exported via ONNX Runtime and optimized for GPU inference. On GPU, the ONNX-exported model attains nearly a 19× throughput speed-up over CPU execution, enabling real-time face detection. In summary, this research demonstrates a fast and fair face detection method that significantly improves performance on potentially biased datasets without sacrificing efficiency or accuracy. | en_US |
dc.language.iso | en | en_US |
dc.title | GPU ACCELERATED BIAS MITIGATION IN FACE DETECTION USING VARIATIONAL AUTO ENCODERS AND ONNX RUNTIME | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Electrical and Computer Engineering | en_US |
dc.description.degreetype | Thesis | en_US |
dc.description.degree | Master of Applied Science (MASc) | en_US |
dc.description.layabstract | Face detection systems often struggle to perform fairly across different people—detection can be less reliable for some races or genders. This thesis introduces a new face detector designed to work equitably. The system is trained on a large face dataset (CelebA) and evaluated on a balanced demographic dataset (FairFace), addressing biases in both demographics (race, gender) and other factors (like eyeglasses or lighting). The approach uses an advanced AI technique called a Variational Autoencoder, combined with reinforcement learning and a state-of-the-art face detection model, to recognize and compensate for underrepresented facial features. As a result, the system learns to avoid bias and improves detection accuracy for diverse groups. Importantly, it runs efficiently on modern graphics processors in real-time using ONNX Runtime, making the model deployable on different hardware without losing speed. In summary, this research demonstrates a fast and fair face detection method that better serves all demographic groups. | en_US |
Appears in Collections: | Open Access Dissertations and Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Sara_Saad_MASc_Thesis_April29_2025.pdf | 7.4 MB | Adobe PDF | View/Open |
Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.