Quantization of random orthonormal matrices with application to adaptive transform coding

Boragolla, B. N. Weerasinghe Mudiyanselage Rashmi Amadini

Quantization of random orthonormal matrices with application to adaptive transform coding

dc.contributor.author	Boragolla, B. N. Weerasinghe Mudiyanselage Rashmi Amadini
dc.contributor.examiningcommittee	Sherif, Sherif (Electrical and Computer Engineering)
dc.contributor.examiningcommittee	Ashraf, Ahmed (Electrical and Computer Engineering)
dc.contributor.examiningcommittee	Bajić, Ivan (Simon Fraser University)
dc.contributor.supervisor	Yahampath, Pradeepa
dc.date.accessioned	2024-10-03T17:03:30Z
dc.date.available	2024-10-03T17:03:30Z
dc.date.issued	2024-09-23
dc.date.submitted	2024-10-01T11:43:38Z	en_US
dc.date.submitted	2024-10-03T16:31:53Z	en_US
dc.degree.discipline	Electrical and Computer Engineering
dc.degree.level	Doctor of Philosophy (Ph.D.)
dc.description.abstract	This thesis explores the quantization of orthonormal random matrices, an important component in the adaptive transform coding of images and video. While vector quantization in Euclidean space has been extensively studied, applying these methods to orthonormal matrices is challenging due to the inherent orthogonality constraints. Directly solving the constrained optimization problem in Euclidean space is difficult. Therefore, this research investigates several novel constructive approaches to the quantization of orthonormal random matrices, specifically for application in video and image compression. These matrices serve as approximations to the Karhunen-Loève Transforms (KLTs) of pixel blocks in images. The main objective of this thesis is to design an optimal codebook of transform matrices that can be utilized in adaptive transform coding, thereby enhancing compression efficiency. This thesis first investigates solving the constrained optimization problem in Euclidean space as an unconstrained optimization problem on the orthonormal matrix manifold. The primary goal is to minimize the mean square error (MSE) of transform coding, and this thesis derives an objective function for minimization based on high-rate analysis. Since the minimization problem lacks a closed-form solution, this thesis proposes a coordinate descent algorithm, which is guaranteed to converge to a local minimum. The proposed algorithm can be used to design both separable and non-separable transforms from sample image data. Experimental results demonstrate that adaptive transform coding using codebooks designed by the proposed algorithm outperforms both non-adaptive coding based on the widely used two-dimensional discrete cosine transform (2D-DCT) and adaptive coding using transform codebooks designed by various recently reported methods. This thesis further investigates a model-based approach to transform matrix codebook learning by modeling natural image blocks as finite lattice non-causal homogeneous Gauss Markov random fields (GMRFs) with Neumann boundary conditions. The proposed method involves the estimation of GMRF parameters. While the standard approach for GMRF parameter estimation is maximum likelihood, this thesis introduces a novel method based on high-rate analysis of transform coding, which focuses on minimizing the MSE of transform coding. In the proposed codebook design approach, the quantization of an orthonormal matrix is carried out in the parameter space of GMRF, transforming the complex task of quantizing a large matrix with orthonormality constraints into a much simpler task of vector quantization in a reduced-dimensional space. A very important advantage of the proposed method is that it can be easily used to design transforms for variable block-size adaptive transform coding. Experimental results are presented for adaptive transform coding of still images which compares the proposed approach against various other alternatives recently reported in the literature. Finally, this thesis addresses the problem of predictive quantization of a random orthonormal matrix process. While predictive quantization is commonly used to quantize correlated processes in Euclidean space, these methods are not directly applicable to processes on a manifold. The approach proposed in this thesis views the prediction problem as one of tracking KLT matrices on the orthonormal matrix manifold. A definition for matrix prediction error on the manifold is proposed. Numerical results demonstrate the effectiveness of utilizing predictive quantization of transform matrices to improve transform coding gain.
dc.description.note	February 2025
dc.identifier.uri	http://hdl.handle.net/1993/38649
dc.language.iso	eng
dc.rights	open access	en_US
dc.subject	Quantization of Random Orthonormal Matrices
dc.subject	Adaptive Transform Coding
dc.subject	GMRF
dc.subject	Predictive quantization
dc.subject	Stiefel Manifold
dc.title	Quantization of random orthonormal matrices with application to adaptive transform coding
dc.type	doctoral thesis	en_US
local.subject.manitoba	no
oaire.awardTitle	University of Manitoba Graduate Fellowship (UMGF)
project.funder.identifier	http://dx.doi.org/10.13039/501100000038, http://dx.doi.org/10.13039/100010318
project.funder.name	Natural Sciences and Engineering Research Council of Canada, University of Manitoba

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Boragolla_Rashmi.pdf
Size:: 9.6 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 770 B
Format:: Item-specific license agreed to upon submission
Description:

Download

Collections

FGS - Electronic Theses and Practica