Welcome to the upgraded MacSphere! We're putting the finishing touches on it; if you notice anything amiss, email macsphere@mcmaster.ca

On the Rate-Distortion-Perception Tradeoff for Lossy Compression

dc.contributor.advisorChen, Jun
dc.contributor.authorQian, Jingjing
dc.contributor.departmentElectrical and Computer Engineeringen_US
dc.date.accessioned2023-10-02T18:17:53Z
dc.date.available2023-10-02T18:17:53Z
dc.date.issued2023
dc.description.abstractDeep generative models when utilized in lossy image compression tasks can reconstruct realistic looking outputs even at extremely low bit-rates, while traditional compression methods often exhibit noticeable artifacts under similar conditions. As a result, there has been a substantial surge of interest in both the information theoretic aspects and the practical architectures of deep learning based image compression. This thesis makes contributions to the emerging framework of rate-distortion-perception theory. The main results are summarized as follows: 1. We investigate the tradeoff among rate, distortion, and perception for binary sources. The distortion considered here is the Hamming distortion and the perception quality is measured by the total variation distance. We first derive a closed-form expression for the rate-distortion-perception tradeoff in the one-shot setting. This is followed by a complete characterization of the achievable distortion-perception region for a general representation. We then consider the universal setting in which the encoder is one-size-fits-all, and derive upper and lower bounds on the minimum rate penalty. Finally, we study successive refinement for both point-wise and set-wise versions of perception-constrained lossy compression. A necessary and sufficient condition for point-wise successive refinement and a sufficient condition for the successive refinability of universal representations are provided. 2. Next, we characterize the expression for the rate-distortion-perception function of vector Gaussian sources, which extends the result in the scalar counterpart, and show that in the high-perceptual-quality regime, each component of the reconstruction (including high-frequency components) is strictly correlated with that of the source, which is in contrast to the traditional water-filling solution. This result is obtained by optimizing over all possible encoder-decoder pairs subject to the distortion and perception constraints. We then consider the notion of universal representation where the encoder is fixed and the decoder is adapted to achieve different distortion-perception pairs. We characterize the achievable distortion-perception region for a fixed representation and demonstrate that the corresponding distortion-perception tradeoff is approximately optimal. Our findings significantly enrich the nascent rate-distortion-perception theory, establishing a solid foundation for the field of learned image compression.en_US
dc.description.degreeDoctor of Philosophy (PhD)en_US
dc.description.degreetypeNoneen_US
dc.identifier.urihttp://hdl.handle.net/11375/28976
dc.language.isoenen_US
dc.subjectLossy compressionen_US
dc.subjectInformation Theoryen_US
dc.subjectRate-Distortion-Perception Tradeoffen_US
dc.titleOn the Rate-Distortion-Perception Tradeoff for Lossy Compressionen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Qian_Jingjing_202309_PhD.pdf
Size:
2.08 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: