Prior-guided Neural Compression of Visual Data

Xu, Hao

Prior-guided Neural Compression of Visual Data

dc.contributor.advisor	Wu, Xiaolin
dc.contributor.author	Xu, Hao
dc.contributor.department	Electrical and Computer Engineering	en_US
dc.date.accessioned	2025-08-28T15:17:22Z
dc.date.available	2025-08-28T15:17:22Z
dc.date.issued	2025
dc.description.abstract	In recent years, deep learning methods have been widely applied in the ﬁeld of visual data compression, some of which have delivered the best rate-distortion performances in history. The target domain of neural data compression started from that of still images and then rapidly expanded into various modalities, including videos, point clouds, and the emerging 3D Gaussian Splatting (3DGS) representations. However, steady progresses of the neural compression research aside, some technical challenges still remain. One of them is the high computational costs of current neural compression models. Although more complex neural network architectures often yield improved compression performance, the marginal beneﬁt of blindly increasing model complexity is diminishing. For most applications the rapidly increased cost cannot be justiﬁed for ever small coding gains. Unless better cost-performance ratio is achieved, neural visual data compression methods are unlikely to replace traditional compression standards that are deeply entrenched in real world. To address this issue, we advocate to incorporate known priors, such as signal sparsity and eﬃcient space covering structures in quantization, into the design of neural compression architecture. Compared with the mainstream pure big data-driven black box learning approach, our prior-guided design approach can lead to signiﬁcant coding gains with no or very small overheads in either model size or computational complexity, improving cost-eﬀectiveness in practical deployment. With such motivations we design, implement and experiment with a series of neural compression models for three visual data modalities: images, 3D point clouds, and 3DGS scene representations. Extensive experimental results demonstrate that our prior-guided neural compression models can deliver rate-distortion performance comparable to state-of-the-art methods at a much reduced resource level. In future research, we plan to further optimize the proposed paradigm to discover even more eﬃcient and practical neural compression models, and in parallel we will expand its applications to other data modalities, such as volumetric video and panoramic video.	en_US
dc.description.degree	Doctor of Philosophy (PhD)	en_US
dc.description.degreetype	Thesis	en_US
dc.identifier.uri	http://hdl.handle.net/11375/32258
dc.language.iso	en	en_US
dc.title	Prior-guided Neural Compression of Visual Data	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Xu_Hao_202508_PhD.pdf
Size:: 73.82 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.68 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Open Access Dissertations and Theses