Rate Distortion Theory for Causal Video Coding: Characterization, Computation Algorithm, and Comparison

En-Hui Yang; Lin Zheng; Da-Ke He; Zhen Zhang

doi:10.1109/TIT.2011.2159043

Rate Distortion Theory for Causal Video Coding: Characterization, Computation Algorithm, and Comparison

Yang, E.-H., Zheng, L., He, D.-K., Zhang, Z.

Source

IEEE Transactions on Information Theory > 2011 > 57 > 8 > 5258 - 5280

Abstract

Causal video coding is considered from an information theoretic point of view, where video source frames $X_1, X_2, \ldots, X_N$ are encoded in a frame by frame manner, the encoder for each frame $X_k$ can use all previous frames and all previous encoded frames while the corresponding decoder can use only all previous encoded frames, and each frame $X_k$ itself is modeled as a source $X_k = \{X_{k}(i) \}_{i=1}^{\infty}$ . A novel computation approach is proposed to analytically characterize, numerically compute, and compare the minimum total rate of causal video coding $R_{c}^*(D_1, \ldots,D_N)$ required to achieve a given distortion (quality) level $D_1, \ldots,D_N > 0$ . Among many other things, the computation approach includes an iterative algorithm with global convergence for computing $R_{c}^*(D_1, \ldots,D_N)$ . The global convergence of the algorithm further enables us to demonstrate a somewhat surprising result (dubbed the more and less coding theorem)—under some conditions on source frames and distortion, the more frames need to be encoded and transmitted, the less amount of data after encoding has to be actually sent. With the help of the algorithm, it is also shown by example that $R_{c}^*(D_1, \ldots,D_N)$ is in general much smaller than the total rate offered by the traditional greedy coding method. As a by-product, an extended Markov lemma is established for correlated ergodic sources.