Layered video transmission based on genetic programming for lossy channels
Introduction
In many applications, digital video sequences are required to be transmitted over wireless or computer networks. Users in the networks might have different demands on the quality of the video sequences. Because of the variety of requests in the networks, it might be difficult for a server to provide an encoded bit stream satisfying all the requirements. One solution to this problem is to use the simulcast technique where video sequences are encoded and stored independently for each specific request. This approach requires more resources to be used in the encoder in terms of disk space and management overhead.
To eliminate this drawback, a number of layered transmission design algorithms [2], [7], [5] has been proposed. In the schemes, an encoded bit stream is delivered in more than one layer. By decoding bit streams accumulated up to different upper layers from the base layer, we can reconstruct the transmitted images with different qualities. Users with various requirements therefore share the same system for efficient video transmission. Many techniques such as embedded zero-tree wavelet (EZW) [10], 2D set partitioning in hierarchical trees (2D SPIHT) [9] and JPEG2000 [11] can be employed for realizing layered systems. Nevertheless, these techniques are used only for still images. They may not be well suited for the video sequences having high temporal correlation. The 3D SPIHT [6], which is an extention of the 2D SPIHT for video coding, can be used to solve this problem. The algorithm uses 3D wavelet transform for exploiting both the spatial and temporal correlations. It also provides the embedded bitstreams for attaining layered transmission.
Although the 3D SPIHT is effective for the implementation of layered transmission systems, its performance can be severely degraded when the bitstreams are delivered over lossy channels. A proper error protection scheme therefore is necessary for robust video transmission. The usual equal error protection schemes might not be suited for the SPIHT-based system. This is because the significance of bit streams delivered in different layers in the systems are not equal. The bit streams in lower layers are more important since an error occurs in these bit streams results in failure for decoding bit streams in higher layers.
In light of the facts discussed above, the objective of this paper is to present a novel unequal error protection scheme for the layered video transmission systems realized by the 3D SPIHT. This unequal protection problem is a typical combinatorial optimization problem which seeks the minimum of a given objective function of many variables. The variables interact with each other in a very complicated way, and their relationship usually cannot be expressed analytically.
One way to solve the problem is to use the simulated annealing (SA) [3], [13] technique. It mimics the objective function associated with the combinatorial optimization problem as the energy associated with the physical system. By slowly reducing an appropriately defined effective temperature, it seeks the minimum energy state as the solution for the optimization problem. Although the SA has been found to be effective, one has to be careful with the annealing schedule, the rate at which the temperature is lowered. The schedule achieving global optimum [3] requires tremendous computational complexities, and is not realistic in many applications. Other schedules that accelerate the cooling process can reduce the computational complexities at the expense of possible trapping in a poor local optimum.
The genetic algorithm (GA) [8] is another method for solving the combinatorial problem [1], [4]. It consists of a set of genetic strings where each string is a possible solution for the optimization problem. The regeneration, crossover, and mutation operations are then applied to these strings to obtain a better set of genetic strings. The same process is repeated until the algorithm converges to a nearby global optimum. The performance of GA is insensitive to the selection of algorithm parameters. In addition, since the algorithm is simple to implement, the computational complexities of GA is lower than that of SA. Because of these advantages, we use GA to realize the optimal error protection for layered transmission system. Simulation results show that, as compared with the techniques based on the simple equal protection scheme, the novel algorithm is less sensitive to delivery errors. The GA-based protection scheme therefore is an effective alternative for the applications where both superior rate distortion performance and robust transmission are desired.
Section snippets
Layered transmission with error protection
This section presents the lossy layered transmission system considered in this paper. It consists of three subsections, which present the 3D SPIHT realization, error protection model and performance measures of the layered transmission systems, respectively.
Optimal error protection allocation
The objective of the optimal error protection allocation is to find Nl,l=1,…,L, minimizing the average distortion D subject to the constraint on the transmission rate R⩽RT, where RT is the rate constraint, and can be prespecified before the allocation process. The transmission rate R and average distortion D are defined in , , respectively. That is, our goal is find Nl,l=1,…,L, attaining the operational rate-distortion function defined byThe Lagrangian
Simulation results
In our experiment, we use the QCIF (176×144) video sequence “foreman” as the source data. We extract 384 frames from the sequence for performance measurement. The frames are partitioned into 24 GOFs containing 16 frames per GOF. The SPIHT-encoded bitstream of each GOF is partitioned into 32 packets (L=32). The lth packet is assigned to the layer l,l=1,…,L. All the packets have identical size B=1000 bytes.
The (Nl,K) Reed–Solomon code will be used at layer l for error protection. To realize the
Concluding remarks
The GA-based allocation has been found to be effective for the design of layered transmission systems. It significantly outperforms the equal allocation algorithm for high packet loss probabilities. Its performance is insensitive to the selection of algorithm parameters. The algorithm is therefore well suited for the applications where high rate-distortion performance is desired for lossy channels.
Acknowledgements
The authors would like to thank the anonymous reviewers for their valuable comments and suggestions.
Wen-Jyi Hwang received his diploma in electronics engineering from National Taipei Institute of Technology, Taiwan, in 1987, and M.S.E.C.E. and Ph.D. degrees from the University of Massachusetts at Amherst in 1990 and 1993, respectively. From September 1993 until January 2003, he was with the Department of Electrical Engineering, Chung Yuan Christian University, Taiwan. In February 2003, he joined the Graduate Institute of Computer Science and Information Engineering, National Taiwan Normal
References (13)
- et al.
Unsupervised image segmentation using a distributed genetic algorithm
Pattern Recognition
(1994) - et al.
Hierarchical coding
- et al.
Stochastic relaxation, Gibbs distributions, and Bayesian restoration of images
IEEE Trans. Pattern Anal. Mach. Intell.
(1984) - et al.
Genetic entropy-constrained vector quantizer design algorithm
Opt. Eng.
(1999) - et al.
Layered image transmission based on embedded zerotree wavelet coding
Opt. Eng.
(1999) - et al.
Low bit-rate scalable video coding with 3D set partitioning in Hierarchical trees (3D SPIHT)
IEEE Trans. Circuits Systems Video Technol.
(2000)
Cited by (2)
A tutorial and review on inter-layer fec coded layered video streaming
2015, IEEE Communications Surveys and TutorialsLow complexity adaptive error control for receiver-driven layered video multicast
2006, Journal of the Chinese Institute of Engineers, Transactions of the Chinese Institute of Engineers,Series A/Chung-kuo Kung Ch'eng Hsuch K'an
Wen-Jyi Hwang received his diploma in electronics engineering from National Taipei Institute of Technology, Taiwan, in 1987, and M.S.E.C.E. and Ph.D. degrees from the University of Massachusetts at Amherst in 1990 and 1993, respectively. From September 1993 until January 2003, he was with the Department of Electrical Engineering, Chung Yuan Christian University, Taiwan. In February 2003, he joined the Graduate Institute of Computer Science and Information Engineering, National Taiwan Normal University, where he is now a Full Professor. His research interests are centered on multimedia communications systems with particular emphasis on image/video transmission. Dr. Hwang is the recipient of the 2000 Outstanding Research Professor Award from Chung Yuan Christian University, 2002 Outstanding Young Researcher Award from the Asia-Pacific Board of the IEEE Communication Society, and 2002 Outstanding Young Electrical Engineer Award from the Chinese Institute of the Electrical Engineering.
Chien-Min Ou obtained his degree in electronics engineering from Chien-Shin Institute of Technology, in 1978, and M.S. degree in electrical engineering from Chung Yuan Christian University,Taiwan, in 2000. He is currently pursuing the Ph.D. degree at the same university. In 1978, he joined the staff at Ching-Yun Institute of Technology as an instructor of Electronics Engineering. His areas of interest are video signal processing and VLSI design.
Rui-Chuan Lin was born in Keelung, Taiwan, R.O.C., on January, 15, 1981. He received the B.S. degree in 2002 from Chung Yuan Christian University, where he is presently working toward the M.S. degree. His research interests include image coding and multimedia communications.
Wen-Wei Hu was born in I-Lan, Taiwan, R.O.C., on September 27, 1979. He received the B.S. degree in 2002 from Chung Yuan Christian University, where he is presently working toward the M.S. degree. His research interests include image coding and digital signal processing.