Scalable Rate Control
Scalable Rate Control
6, SEPTEMBER 2000
QCIF to CCIR-601), various temporal resolutions (e.g., 7.5–30 our scheme, not only more accurate bit-rate control with buffer
fps), various coders (e.g., DCT and wavelet), and various regulation is achieved, but scalability is also preserved for all
granularities of video object (VO) (e.g., single VO to multiple test video sequences in various applications. For example, in the
VOs, frame-layer to macroblock (MB)-layer). object-based video coding of the emerging ISO MPEG-4 inter-
In developing a rate-control technique, there are two widely national standard, it is very important to appropriately allocate
used approaches: 1) an analytical model-based approach and bits among different VOs. In allocating bits among VOs, video
2) an operational rate-distortion (R-D) based approach. In the contents and coding complexity must be considered. Otherwise,
model-based approach, various distribution and characteristics over- and under-runing of the bit budget can occur. In general,
of signal source models with associated quantizers are consid- a rate-control scheme should spend more bits in the VO of user
ered. Based on the selected model, a closed-form solution is de- interests (e.g., foreground VO) than in other areas (e.g., back-
rived using optimization theory. Such a theoretical optimization ground VO). Without employing proper bit allocation, for ex-
solution cannot be implemented easily because there is only a fi- ample, the background VO could have excellent quality, while
nite discrete set of quantizers and the source signal model varies the foreground VO could suffer from lots of annoying distortion,
spatially. Alternatively, an operational R-D based approach is even though bits are evenly distributed. By considering video
used in practical coding environment. For example, to minimize contents and coding complexity in our quadratic R-D model,
the overall coding distortion subject to a total bit budget con- our rate-control scheme with joint buffer control can dynami-
straint, lots of techniques based on dynamic programming or cally and appropriately allocate the bits among VOs to meet the
Lagragian multiplier for optimization solutions have been de- overall bit-rate requirement with uniform video quality.
veloped [7]–[12]. These methods share the similar concept of Another unique merit of MPEG-4 VO-based coding from
data pre-analysis. By analyzing the R-D characteristics of fu- the other video coding standards like MPEG-1, MPEG-2,
ture frames, the bit-allocation strategy is determined afterwards. and H.263 is that an encoder can separately encode any
The Lagragian multiplier is a well-known technique for optimal VO from the rest and transmit these individual elementary
bit allocation in image and video coding, but with an assump- bitstreams independently. In this category of applications, our
tion that the source consists of statistically independent com- proposed joint-buffer rate-control scheme seems imposing
ponents. Thus, an inter-frame based coding may not find the fairly restrictive conditions that each VO shares the same buffer
Lagragian multiplier approach applicable because of the tem- management. As a matter of fact, the proposed rate-control
poral dependency. Although Ramchandran [9] takes frame de- scheme can quite easily handle this scenario. A simple way is
pendencies into account in bit-rate control, its potentially high to consider each individual VO as a “frame” so that each VO
complexity with increasing the operating R-D points make it can operate on its own (i.e., separate buffer control and separate
unsuitable for the applications requiring interactivity or low en- R-D model and so on). In this case, the frame-level rate control
coding delay. In [13], [14], Ding investigated a joint encoder and can be directly applied with minor modifications (e.g., initial
channel rate-control scheme for VBR video over ATM networks setting of the buffer fullness). Note that since there is no joint
and claimed that the rate-control scheme has to balance both is- buffer control in this case, before an encoder starts encoding
sues of consistent video quality in the encoder side and bitstream each VO, the application needs to specify the buffer condition
smoothness for statistical multiplexing gain in the network side. for each VO.
Tao et al. [15] proposed a parametric R-D model for MPEG en- With its precision of the R-D model and ease of implemen-
coders, especially for the picture-level rate control. Based on tation, our rate-control scheme with the following new con-
the bit-rate mquant model, the desired mquant is calculated and cepts and techniques has been adopted as part of the rate-con-
used for encoding every MB by combining with appropriate trol scheme in the International Standard of the emerging ISO
quantization matrix entry in a picture. A normalized parametric MPEG-4 standard:
R-D model based approach [16] has been also developed for
H.263-compatible video codecs. By providing good approxima- 1) a more accurate second-order R-D model for the target
tions of all 31 rate-distortion relations, the authors claim that the bit-rate estimation;
proposed model offers an efficient and less memory requirement 2) a sliding-window method for smoothing the impact of
approach to approximate the rate and distortion characteristics scene change;
for all QPs. Recently, Vetro and Sun [17], [18] and Ribas-Cor- 3) an adaptive selection criterion of data points for better
bera and Lei [19], [20] also proposed rate-control techniques for model updating process;
MPEG-4 object-level and MB-level video coding, respectively. 4) an adaptive threshold shape control for better use of bit
However, most of the aforementioned techniques only focus on budget;
a single coding environment, either frame level, object level, or 5) a dynamically bit-rate allocation among VOs with dif-
macro level. None of these techniques demonstrates its applica- ferent coding complexities.
bility to MPEG-4 video coding including the above three coding The proposed rate-control scheme provides a scalable solution,
granularities simultaneously. meaning that our rate-control technique offers a general frame-
In this paper, based on a newly revised quadratic R-D model, work for multiple layers of controls for objects, frames, and
our SRC proposes a single framework which is designed to MBs in various coding contexts.
meet both VBR without delay constraints and CBR with low- The rest of the paper is organized as follows. Section II re-
latency and buffer constraints. With this scalable framework views the theoretical foundation of the proposed rate control,
based on a new R-D model and several new concepts [21] in and characterizes new features for generalizing and improving
880 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 10, NO. 6, SEPTEMBER 2000
the R-D models. In Section III, detailed descriptions of our gen- A. Scalable Quadratic Rate Distortion Model
eralized rate control are presented. Some fundamental research To illustrate the rationale of quadratic R-D function mod-
problems for rate-control scheme in various coding granular- eling, we summarize the result derived in [22], [23]. Assuming
ities are also addressed. In Section IV, extensive experiments that the source statistics are Laplacian distributed
are conducted to evaluate the performance of the scheme. This
paper concludes with Section V.
where
II. FRAMEWORK OF SRC
the distortion measure is defined as , then
In this section, we describe the framework of the proposed
SRC scheme which provides an integrated approach with three there is a closed-form solution for the R-D functions as derived
in [24]
different coding granularities, including frame level, object
level, and MB level. The theoretical foundation behind the
proposed rate-control scheme is based on the R-D model, where
the distortion is measured in terms of quantization parameter.
The block diagram of the proposed SRC is depicted in Fig. 2, where
where the proposed rate control consists of four stages: 1)
initialization stage; 2) pre-encoding stage; 3) encoding stage;
and 4) post-encoding stage.
LEE et al.: SCALABLE RATE CONTROL FOR MPEG-4 VIDEO 881
(3) (4)
to prevent buffer overflow, is its major drawback. Note that since linear regression technique, the two model parameters and
our rate control is a GOP basis, the insertion of I-frame can can be obtained as
be viewed as the resume of another GOP round, thus our rate
control still applicable.
Step 2—Calculation of the Model Parameters and :
Based on these two values, the theoretical target bit rate can
be calculated for each data point within the sliding window ob- (5)
tained in step 1. For those selected data points, the encoder col-
lects quantization levels and actual bit-rate statistics. Using a
884 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 10, NO. 6, SEPTEMBER 2000
then the target bit rate is recalculated based on the new model count difference is depicted in Fig. 3, where the -axis denotes
parameters. By applying this statistical technique, more repre- the bit-count difference of a coded P frame from the average
sentative data are selected, and the final model parameters can coding bit count in a sequence or GOP, while the -axis denotes
be derived based on new data. Note that, to avoid the removal the number of P frames. In this example, the average bit count
of all data points, the latest data point is always selected in the for a P-frame is calculated as (24 kbits sec kbit)/100
data set. frames kbits, given 1 kbit is used for coding the first
I frame. If the target buffer level is in the middle level (i.e.,
C. Predictive Frame-Skipping Control 12 kbits), then the encoder has a upper or lower half of 6/2.39
Ideally, if the input bit rate to the buffer is equal to the channel frames for buffering any sudden bit count increasing, or de-
output rate, then the buffer fullness will keep its middle level, creasing, respectively. Considering the buffer-overflow case, if
which is the target buffer level. Otherwise, it changes its full- a threshold value is properly set, then there is only a slight
ness upwards or downwards which may potentially cause buffer chance of overflowing the buffer. In this case, if the threshold
overflow or underflow, respectively. To effectively prevent the is set as 80% of the buffer size (i.e., 4.8 kbits), the consecutive
buffer overflow without losing overall video quality due to con- frame skipping occurs very rarely from the statstical viewpoint.
tinuous frame skipping, we proposed a predictive frame-skip- This empirical assumption by setting a proper threshold demon-
ping control. By setting a safety threshold, if an encoder predicts strates although our predictive frame-skipping scheme cannot
to have a potential buffer overflow, then it skips next frame (ob- guarantee no consecutive frame skipping occurring; however,
ject) and subtracts the buffer fullness from the channel output there exist fairly high confidence that consecutive frame skip-
rate. The safety threshold is selected based on the distribution ping rarely occurs.
of bit-count difference of a predicted frame (object) from the The proposed frame-skipping mechanism is described as fol-
average bit count of coded frame (object) in a video sequence lows. Before encoding the next frame, the encoder first exam-
or GOP. For example, a containership video sequence with 300 ines the current buffer occupancy and the estimated target bit
qcif frames is coded as the first I-frame and subsequent P-frame rate for the next frame. If the current buffer fullness plus the es-
coding pattern at 10 fps and 24 kbits/s. Its distribution of the bit- timated frame bits for the next frame is larger than some pre-de-
886 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 10, NO. 6, SEPTEMBER 2000
(a) (b)
(c) (d)
(e) (f)
Fig. 7. Frame-level rate control: the buffer occupancy for various test conditions. SVORC: (a) silent qci f at 10 fps and 24 kbits/s; (b) hall qci f at 7.5 fps and
10 kbits/s; (c) mad qci f at 10 fps and 24 kbits/s; (d) madl qci f at 7.5 fps and 10 kbits/s; (e) news cif at 15 fps and 112 kbits/s. (f) news cif at 7.5 fps and
48 kbits/s.
LEE et al.: SCALABLE RATE CONTROL FOR MPEG-4 VIDEO 887
TABLE I
VBR EXPERIMENTAL RESULTS OF THE FRAME-LEVEL RATE CONTROL
TABLE II
CBR EXPERIMENTAL RESULTS OF THE FRAME-LEVEL RATE CONTROL
termined threshold, called the safety margin, for instance 80% However, this simple scheme suffers several serious problems.
of the buffer size, the next frame will be skipped. Note that the For example, the background VO may have bits left unused,
safety margin is used to reduce continuous frame skipping and while the foreground VO requires more. Therefore, we propose
can be adaptively changed based on the content, or a pre-deter- a new bit-allocation method to adaptively adjust the bit budget
mined constant for simplicity. An example, as shown in Fig. 4, for each VO. Based on the coding complexity and perceptual
demonstrates our frame-skipping control. Before encoding the importance, the distribution of the bit budget is proportional to
frame , the encoder uses the actual bit count used for the the square of MAD of a VO, which is obtained empirically [27].
frame to predict the next frame . Then the buffer As long as the target bit rate is known at the frame level, say ,
occupancy will exceed 80% of the buffer size, resulting in skip- the target bit rate for VO at time , , is
ping the frame and decreasing the buffer occupancy by the
channel output rate. The frame-skipping condition can be for- (7)
mulated as follows.
where MAD MAD MAD MAD and
WHILE (buffer fullness + are the number of VOs in the coding frame at time .
actual bitcounts for last frame 0 channel output rate In addition to the distribution of bits among VOs in the
buffer size3 skip_margin) f spatial domain, we also need consider the composition of
Skip the next frame; VOs in the temporal domain. Since each VO has different
buffer fullness = buffer fullness 0 channel output rate coding complexity (e.g., low or high motion), to obtain better
coding efficiency, it is fairly straightforward to encode the
VOs at different frame rates, then the decoder can display
D. Dynamic Target Bit-Rate Distribution Among VOs the composite video. However, our experimental results show
To estimate the target bit rate for each VO, a straightfor- that the significant quality deterioration is experienced in the
ward way is to allocate an equal number of bits to each VO “gluing” boundary of VOs. Thus, encoding the VOs at the same
without considering its complexity and perceptual importance. frame rate is a better alternative to yield better video quality.
888 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 10, NO. 6, SEPTEMBER 2000
TABLE III
EXPERIMENTAL RESULTS OF MULTIPLE VO’S RATE CONTROL
or (8) END
;
(a) (b)
(c) (d)
(e) (f)
Fig. 8. Object-level rate control: the buffer occupancy for various test conditions. MVORC: (a) akiyoq ci f at 10 fps and 24 kbits/s; (b) akiyo qci f at 15 fps and
64 kbits/s; (c) coastguardq ci f at 10 fps and 48 kbits/s; (d) news ci f at 15 fps and 192 kbits/s. (e) containerq ci f at 10 fps and 24 kbits/s. (f) container qci f
at 15 fps and 64 kbits/s.
LEE et al.: SCALABLE RATE CONTROL FOR MPEG-4 VIDEO 891
(a) (b)
(c) (d)
(e) (f)
Fig. 9. MB-level rate control: the buffer occupancy for various test conditions under the maximum accumulated delays are 500 ms (left column) and 250 ms
(right column). (a), (b): MBRC coastguard qci f at 10 fps and 24 kbits/s. (c), (d): MBRC container qci f at 10 fps and 24 kbits/s. (e), (f): mad qci f at 10 fps
and 24 kbits/s.
892 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 10, NO. 6, SEPTEMBER 2000
(g) (h)
(i) (j)
Fig. 9. (Continued). MB-level rate control: the buffer occupancy for various test conditions under the maximum accumulated delays are 500 ms (left column)
and 250 ms (right column). (g), (h): news ci f at 7.5 fps and 192 kbits/s. (i), (j): silent qci f at 10 fps and 24 kbits/s. (i), (j): silent qci f at 10 fps and 24 kbits/s.
TABLE IV
EXPERIMENTAL RESULTS OF MB-LAYER RATE CONTROL GIVEN THE MAXIMUM ACCUMULATED DELAYS ARE 500 AND 125 MS
LEE et al.: SCALABLE RATE CONTROL FOR MPEG-4 VIDEO 893
though there exist fairly large dB differences up to 5, 3, 7, 3, The proposed SRC has shown the following advantages: 1) low
12, and 11 dB between objects in the respective tests. It is ob- latency and the limited buffer constrains are satisfied for CBR
vious that the background object has pretty much static with applications; 2) the VBR quality is maintained; 3) both the target
very little difference from its previous frame. In this case, the bit rate and the target frame rate are obtained within a negligible
peak signal-to-noise-ratio (PSNR) value is not a sufficient in- error; and 4) easy extension to the multiple VOs and MB layer.
dicator for visual quality. Our experimental results show that The proposed SRC has been adopted in the International Stan-
equal or similar PSNR values in this case create more coding dard of the emerging MPEG4 standard.
artifacts in background objects and in a composite frame. So,
even the PSNR values are fairly different among VOs; however, REFERENCES
their composite video frame shows a better visual quality. [1] Coding of Moving Pictures and Associated Audio ISO/IEC 14 496-2,
Next the buffer occupancy is examined, and plotted in Fig. 8. MPEG-4 Committee Draft ISO/IEC JTC1/SC29/WG11, Oct. 1997.
[2] Coding of Moving Pictures and Associated Audio ISO/IEC 14 496-2,
In our experiments, a single joint buffer control is used and the MPEG-4 Committee Draft ISO/IEC JTC1/SC29/WG11, Oct. 1998.
maximum accumulated delay is set to be 500 ms for the en- [3] H. Zhang and D. Ferrari, “Rate-controlled static priority queueing,” in
coding process. As shown in Fig. 8, the buffer occupancy is Proc. IEEE INFOCOM’93, Apr. 1993, pp. 227–236.
maintained at around 50%–60% of the buffer size with a small [4] D. Ferrari and D. Verma, “A scheme for real-time channel establish-
ment in wide-area networks,” IEEE J. Select. Areas Commun., vol. 8,
variation, as expected, even for the case that there are up to six pp. 368–379, Apr. 1990.
VOs (e.g., container video sequence). It also demonstrates that [5] L. Zhang, “Virtual clock: A new traffic control algorithm for packet
with our rate-control scheme, the buffer overflow or underflow switching networks,” in Proc. ACM SIGCOMM’90, Sept. 1990, pp.
19–29.
does not occur for all test conditions of high- and low-bit rate [6] S. J. Golestani, “A stop-and-go queueing framework for congestion
coding contexts. management,” in Proc. ACM SIGCOMM’90, Sept. 1990, pp. 8–18.
[7] Y. Shoham and A. Gersho, “Efficient bit allocation for an arbitrary set
of quantizers,” IEEE Trans. Acoust., Speech, Signal Processing, vol. 36,
C. MB-Level Rate Control pp. 1445–1453, 1988.
[8] S. W. Wu and A. Gersho, “Rate-constrained optimal block-adaptive
Table IV reports the results of MB layer rate control under coding for digital tape recoding of HDTV,” IEEE Trans. Circuits Syst.
two maximum accumulated latencies which are set to be 500 Video Technol., vol. 1, Mar. 1991.
and 250 ms, respectively. Compared to the frame-level rate con- [9] K. Ramchandran, A. Ortega, and M. Vetterli, “Bit Allocation for depen-
dent quantization with applications to multiresolution and MPEG video
trol, the MB-level rate control generates more accurate target coders,” IEEE Trans. Image Processing, vol. 3, pp. 533–545, Sept. 1994.
bit estimation, resulting in the higher frame rate, and lower la- [10] A. Ortega, K. Ramchandran, and M. Vetterli, “Optimal trellis-based
tency delay at the expense of slightly quality loss. For the case of buffered compression and fast approximations,” IEEE Trans. Image
Processing, vol. 3, pp. 26–40, Jan. 1994.
the maximum accumulated delay equal to 250 ms, a larger—but [11] J. Choi and D. Park, “A stable feedback control of the buffer state using
within the tolerance—fluctuation of buffer occupancy is experi- the controlled Langrange multiplier method,” IEEE Trans. Image Pro-
enced, as expected, since a smaller buffering is used to smooth cessing, vol. 3, pp. 546–558, Sept. 1994.
[12] L. J. Lin and A. Ortega, “Bitrate control using Piecewise approximated
out the bit rate fluctuation of encoding process (see Fig. 9). rate-distortion characteristics,” IEEE Trans. Circuits Syst. Video
However, under this strict test condition, no buffer overflow oc- Technol., vol. 8, pp. 446–459, Aug. 1998.
curs, proving the robustness and stability of our MB rate-control [13] W. Ding, “Rate control of MPEG video coding and recording by rate-
quantization modeling,” IEEE Trans. Circuits Syst. Video Technol., vol.
scheme. The results show that our MB-level rate control can be 6, pp. 12–20, Feb. 1996.
applied for the various applications with higher bit rate and strict [14] , “Joint encoder and channel rate control of VBR video over ATM
buffer regulations. networks,” IEEE Trans. Circuits Syst. Video Technol., vol. 7, Aug. 1997.
[15] B. Tao, H. A. Peterson, and B. W. Dickinson, “A Rate-Quantization
model for MPEG Encoders,” in Proc. 1997 Int. Conf. Image Processing,
V. SUMMARY AND CONCLUSIONS Oct. 1997, pp. 338–341.
[16] K. H. Yang, A. Jacquin, and N. S. Jayant, “A normalized rate-distor-
In this paper, we proposed a single, integrated and SRC tion model for H.263-compatible codecs and its application to quantizer
scheme for different coding granularity including the frame selection,” in Proc. 1997 Int. Conf. Image Processing, Oct. 1997, pp.
41–44.
level, the object level, and the MB level. Our scheme not only [17] Joint rate control for multiple-VO, MPEG96/M1890 ISO/IEC
provides a framework for the very low bit-rate control, but also JTC/SC29/WG11, Apr. 1997.
comes up with several new techniques to improve the accuracy [18] A. Vetro, H. F. Sun, and Y. Wang, “MPEG-4 rate control for multiple
video objects,” IEEE Trans. Circuits Syst. Video Technol., vol. 9, pp.
of the R-D model. Those new methods or concepts are listed 186–199, Feb. 1999.
below: [19] J. Ribas-Corbera and S. M. Lei, “Contribution to rate control Q2 exper-
iment: A quantizer control tool for achieving target bitrates accurately,”
1) a more accurate R-D model which is scalable with MAD; Coding of Moving Pictures and Associated Audio MPEG96/M1812
2) a dynamically bit-rate allocation among VOs with various ISO/IEC JTC/SC29/WG11, Sevilla, Spain, Feb. 1997.
coding complexities; [20] , “Rate control in DCT video coding for low-delay communica-
tions,” IEEE Trans. Circuits Syst. Video Technol., vol. 9, pp. 172–185,
3) a sliding-window mechanism for smoothing the impact of Feb. 1999.
scene change; [21] H.-J. Lee, T. Chiang, and Y.-Q. Zhang, “Scalable rate control for very
4) an adaptive selection criterion of data points for the R-D low bitrate video,” in Proc. 1997 Int. Conf. Image Processing, vol. II,
Oct. 1997, pp. 768–771.
model update process; [22] T. Chiang, “A rate control scheme using a new rate-distortion model,”
5) an adaptive threshold setting for rate reduction in shape JCoding of Moving Pictures and Associated Audio MPEG95/0436
coding; TC1/SC29/WG11, Dallas, TX, Nov. 1995.
[23] T. Chiang and Y.-Q. Zhang, “A new rate control scheme using a new
6) an effective frame-skipping control for the prevention of rate-distortion model,” IEEE Trans. Circuits Syst. Video Technol., pp.
the potential buffer-overflow problem. 246–250, Feb. 1997.
894 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 10, NO. 6, SEPTEMBER 2000
[24] A. Viteribi and J. Omura, “A new rate control scheme using a new Tihao Chiang (S’90–M’95–SM’99) was born
rate-distortion model,” in Principle of Digital Communication and in Cha-Yi, Taiwan, R.O.C., in 1965. He received
Coding. New York: McGraw-Hill, 1979. the B.S. degree from National Taiwan University,
[25] Coding of Moving Pictures and Associated Audio MPEG96, MPEG-4 Taipei, Taiwan, in 1987, the M.S. degree from
video verification model V5.0 ISO/IEC JTC1/SC29/WG11, Nov. 1996. Columbia University in 1991, and the Ph.D. degree
[26] Coding of Moving Pictures and Associated Audio MPEG96, MPEG-4 from Columbia University, New York, in 1995, all
video verification model V8.0 ISO/IEC JTC1/SC29/WG11, July 1997. in electrical engineering.
[27] H.-J. Lee, T. Chiang, and Y.-Q. Zhang, “Multiple-VO Rate Control,” In 1995, he joined David Sarnoff Research Center,
ISO/IEC JTC/SC29/WG11 MPEG97/M2554, July 1997. Princeton, NJ, as a Member of Technical Staff. He
[28] T. Chiang, H.-J. Lee, and Y.-Q. Zhang, “Macroblock Layer Rate Con- was later promoted to Technology Leader and then
trol,” ISO/IEC JTC/SC29/WG11 MPEG97/M2555, July 1997. Program Manager. While at Sarnoff, he led a team
[29] S. Ryoo, J. Shin, and E. Jang, “Rate control tool: Based on human visual of world-class researchers and developed an optimized MPEG-2 software en-
sensitivity (HVS) for low bitrate coding,” ISO/IEC JTC/SC29/WG11 coder. For his work in the encoder and MPEG-4 areas, he received two Sarnoff
Coding of Moving Pictures and Associated Audio MPEG96/M0566, Achievement Awards and three Sarnoff team awards. In September 1999, he
Munich, Germany, Jan. 1996. joined the faculty at National Chiao-Tung University, Taiwan, R.O.C. Since
1992, he has actively participated in ISO’s MPEG digital video-coding stan-
dardization process, with particular focus on the scalability/compatibility issue.
He is currently the co-chair for encoder optimization on the MPEG-4 com-
mittee, has made more than 40 contributions to the MPEG committee over
the past eight years, and has co-authored the rate-control technology that was
adopted as part of the MPEG-4 International Standards in 1998. He holds two
U.S. patents and more than 10 pending patents. He published over 20 technical
journal and conference papers in the field of video and signal processing. His
main research interests include compatible/scalable video compression, stereo-
scopic video coding, and motion estimation.