Welcome to the upgraded MacSphere! We're putting the finishing touches on it; if you notice anything amiss, email macsphere@mcmaster.ca

Prior-guided Neural Compression of Visual Data

dc.contributor.advisorWu, Xiaolin
dc.contributor.authorXu, Hao
dc.contributor.departmentElectrical and Computer Engineeringen_US
dc.date.accessioned2025-08-28T15:17:22Z
dc.date.available2025-08-28T15:17:22Z
dc.date.issued2025
dc.description.abstractIn recent years, deep learning methods have been widely applied in the field of visual data compression, some of which have delivered the best rate-distortion performances in history. The target domain of neural data compression started from that of still images and then rapidly expanded into various modalities, including videos, point clouds, and the emerging 3D Gaussian Splatting (3DGS) representations. However, steady progresses of the neural compression research aside, some technical challenges still remain. One of them is the high computational costs of current neural compression models. Although more complex neural network architectures often yield improved compression performance, the marginal benefit of blindly increasing model complexity is diminishing. For most applications the rapidly increased cost cannot be justified for ever small coding gains. Unless better cost-performance ratio is achieved, neural visual data compression methods are unlikely to replace traditional compression standards that are deeply entrenched in real world. To address this issue, we advocate to incorporate known priors, such as signal sparsity and efficient space covering structures in quantization, into the design of neural compression architecture. Compared with the mainstream pure big data-driven black box learning approach, our prior-guided design approach can lead to significant coding gains with no or very small overheads in either model size or computational complexity, improving cost-effectiveness in practical deployment. With such motivations we design, implement and experiment with a series of neural compression models for three visual data modalities: images, 3D point clouds, and 3DGS scene representations. Extensive experimental results demonstrate that our prior-guided neural compression models can deliver rate-distortion performance comparable to state-of-the-art methods at a much reduced resource level. In future research, we plan to further optimize the proposed paradigm to discover even more efficient and practical neural compression models, and in parallel we will expand its applications to other data modalities, such as volumetric video and panoramic video.en_US
dc.description.degreeDoctor of Philosophy (PhD)en_US
dc.description.degreetypeThesisen_US
dc.identifier.urihttp://hdl.handle.net/11375/32258
dc.language.isoenen_US
dc.titlePrior-guided Neural Compression of Visual Dataen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Xu_Hao_202508_PhD.pdf
Size:
73.82 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: