Elsevier

Neurocomputing

Volume 57, March 2004, Pages 361-372
Neurocomputing

Layered video transmission based on genetic programming for lossy channels

https://doi.org/10.1016/j.neucom.2003.10.013Get rights and content

Abstract

This paper presents a novel robust layered video transmission design algorithm for noisy channels. In the algorithm, the 3D SPIHT coding technique is used to encode the video sequences for the transmission of each layer. A new error protection allocation scheme based on genetic programming is then employed to determine the degree of protection for each layer so that the average distortion of the reconstructed images after transmission can be minimized. Simulation results show that, subject to the same amount of redundancy bits for error protection, the new algorithm outperforms other existing algorithms where equal-protection schemes are adopted.

Introduction

In many applications, digital video sequences are required to be transmitted over wireless or computer networks. Users in the networks might have different demands on the quality of the video sequences. Because of the variety of requests in the networks, it might be difficult for a server to provide an encoded bit stream satisfying all the requirements. One solution to this problem is to use the simulcast technique where video sequences are encoded and stored independently for each specific request. This approach requires more resources to be used in the encoder in terms of disk space and management overhead.

To eliminate this drawback, a number of layered transmission design algorithms [2], [7], [5] has been proposed. In the schemes, an encoded bit stream is delivered in more than one layer. By decoding bit streams accumulated up to different upper layers from the base layer, we can reconstruct the transmitted images with different qualities. Users with various requirements therefore share the same system for efficient video transmission. Many techniques such as embedded zero-tree wavelet (EZW) [10], 2D set partitioning in hierarchical trees (2D SPIHT) [9] and JPEG2000 [11] can be employed for realizing layered systems. Nevertheless, these techniques are used only for still images. They may not be well suited for the video sequences having high temporal correlation. The 3D SPIHT [6], which is an extention of the 2D SPIHT for video coding, can be used to solve this problem. The algorithm uses 3D wavelet transform for exploiting both the spatial and temporal correlations. It also provides the embedded bitstreams for attaining layered transmission.

Although the 3D SPIHT is effective for the implementation of layered transmission systems, its performance can be severely degraded when the bitstreams are delivered over lossy channels. A proper error protection scheme therefore is necessary for robust video transmission. The usual equal error protection schemes might not be suited for the SPIHT-based system. This is because the significance of bit streams delivered in different layers in the systems are not equal. The bit streams in lower layers are more important since an error occurs in these bit streams results in failure for decoding bit streams in higher layers.

In light of the facts discussed above, the objective of this paper is to present a novel unequal error protection scheme for the layered video transmission systems realized by the 3D SPIHT. This unequal protection problem is a typical combinatorial optimization problem which seeks the minimum of a given objective function of many variables. The variables interact with each other in a very complicated way, and their relationship usually cannot be expressed analytically.

One way to solve the problem is to use the simulated annealing (SA) [3], [13] technique. It mimics the objective function associated with the combinatorial optimization problem as the energy associated with the physical system. By slowly reducing an appropriately defined effective temperature, it seeks the minimum energy state as the solution for the optimization problem. Although the SA has been found to be effective, one has to be careful with the annealing schedule, the rate at which the temperature is lowered. The schedule achieving global optimum [3] requires tremendous computational complexities, and is not realistic in many applications. Other schedules that accelerate the cooling process can reduce the computational complexities at the expense of possible trapping in a poor local optimum.

The genetic algorithm (GA) [8] is another method for solving the combinatorial problem [1], [4]. It consists of a set of genetic strings where each string is a possible solution for the optimization problem. The regeneration, crossover, and mutation operations are then applied to these strings to obtain a better set of genetic strings. The same process is repeated until the algorithm converges to a nearby global optimum. The performance of GA is insensitive to the selection of algorithm parameters. In addition, since the algorithm is simple to implement, the computational complexities of GA is lower than that of SA. Because of these advantages, we use GA to realize the optimal error protection for layered transmission system. Simulation results show that, as compared with the techniques based on the simple equal protection scheme, the novel algorithm is less sensitive to delivery errors. The GA-based protection scheme therefore is an effective alternative for the applications where both superior rate distortion performance and robust transmission are desired.

Section snippets

Layered transmission with error protection

This section presents the lossy layered transmission system considered in this paper. It consists of three subsections, which present the 3D SPIHT realization, error protection model and performance measures of the layered transmission systems, respectively.

Optimal error protection allocation

The objective of the optimal error protection allocation is to find Nl,l=1,…,L, minimizing the average distortion D subject to the constraint on the transmission rate RRT, where RT is the rate constraint, and can be prespecified before the allocation process. The transmission rate R and average distortion D are defined in , , respectively. That is, our goal is find Nl,l=1,…,L, attaining the operational rate-distortion function D̂(RT) defined byD̂(RT)=min(N1,…,NL:∑l=1LNl⩽RT)D.The Lagrangian

Simulation results

In our experiment, we use the QCIF (176×144) video sequence “foreman” as the source data. We extract 384 frames from the sequence for performance measurement. The frames are partitioned into 24 GOFs containing 16 frames per GOF. The SPIHT-encoded bitstream of each GOF is partitioned into 32 packets (L=32). The lth packet is assigned to the layer l,l=1,…,L. All the packets have identical size B=1000 bytes.

The (Nl,K) Reed–Solomon code will be used at layer l for error protection. To realize the

Concluding remarks

The GA-based allocation has been found to be effective for the design of layered transmission systems. It significantly outperforms the equal allocation algorithm for high packet loss probabilities. Its performance is insensitive to the selection of algorithm parameters. The algorithm is therefore well suited for the applications where high rate-distortion performance is desired for lossy channels.

Acknowledgements

The authors would like to thank the anonymous reviewers for their valuable comments and suggestions.

Wen-Jyi Hwang received his diploma in electronics engineering from National Taipei Institute of Technology, Taiwan, in 1987, and M.S.E.C.E. and Ph.D. degrees from the University of Massachusetts at Amherst in 1990 and 1993, respectively. From September 1993 until January 2003, he was with the Department of Electrical Engineering, Chung Yuan Christian University, Taiwan. In February 2003, he joined the Graduate Institute of Computer Science and Information Engineering, National Taiwan Normal

References (13)

  • P. Andrey et al.

    Unsupervised image segmentation using a distributed genetic algorithm

    Pattern Recognition

    (1994)
  • F. Bosveld et al.

    Hierarchical coding

  • S. Geman et al.

    Stochastic relaxation, Gibbs distributions, and Bayesian restoration of images

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1984)
  • W.J. Hwang et al.

    Genetic entropy-constrained vector quantizer design algorithm

    Opt. Eng.

    (1999)
  • W.J. Hwang et al.

    Layered image transmission based on embedded zerotree wavelet coding

    Opt. Eng.

    (1999)
  • B.J. Kim et al.

    Low bit-rate scalable video coding with 3D set partitioning in Hierarchical trees (3D SPIHT)

    IEEE Trans. Circuits Systems Video Technol.

    (2000)
There are more references available in the full text version of this article.

Cited by (2)

Wen-Jyi Hwang received his diploma in electronics engineering from National Taipei Institute of Technology, Taiwan, in 1987, and M.S.E.C.E. and Ph.D. degrees from the University of Massachusetts at Amherst in 1990 and 1993, respectively. From September 1993 until January 2003, he was with the Department of Electrical Engineering, Chung Yuan Christian University, Taiwan. In February 2003, he joined the Graduate Institute of Computer Science and Information Engineering, National Taiwan Normal University, where he is now a Full Professor. His research interests are centered on multimedia communications systems with particular emphasis on image/video transmission. Dr. Hwang is the recipient of the 2000 Outstanding Research Professor Award from Chung Yuan Christian University, 2002 Outstanding Young Researcher Award from the Asia-Pacific Board of the IEEE Communication Society, and 2002 Outstanding Young Electrical Engineer Award from the Chinese Institute of the Electrical Engineering.

Chien-Min Ou obtained his degree in electronics engineering from Chien-Shin Institute of Technology, in 1978, and M.S. degree in electrical engineering from Chung Yuan Christian University,Taiwan, in 2000. He is currently pursuing the Ph.D. degree at the same university. In 1978, he joined the staff at Ching-Yun Institute of Technology as an instructor of Electronics Engineering. His areas of interest are video signal processing and VLSI design.

Rui-Chuan Lin was born in Keelung, Taiwan, R.O.C., on January, 15, 1981. He received the B.S. degree in 2002 from Chung Yuan Christian University, where he is presently working toward the M.S. degree. His research interests include image coding and multimedia communications.

Wen-Wei Hu was born in I-Lan, Taiwan, R.O.C., on September 27, 1979. He received the B.S. degree in 2002 from Chung Yuan Christian University, where he is presently working toward the M.S. degree. His research interests include image coding and digital signal processing.

View full text