The Joint Collaborative Team on Video Coding (jct-vc) of itu-t wp3/16 and iso/iec jtc 1/sc 29/wg 11 held its second meeting during 21-28 July, 2010 at the itu-t premises in Geneva, ch



Yüklə 402,98 Kb.
səhifə108/127
tarix09.01.2022
ölçüsü402,98 Kb.
#92585
1   ...   104   105   106   107   108   109   110   111   ...   127

5.12Quantization


JCTVC-B035 [X. Yu, D. He, E.-h. Yang (RIM)] Improved quantization for HEVC

This contribution proposed a quantization scheme for residual coding based on adaptive reconstruction levels, consisting of three steps:



  • First, a "hard decision" quantization is performed: quantization with a rounding offset being zero is conducted, and a reconstruction level is computed as the centroid for each quantization output.

  • Second, fixing the reconstruction level for each quantization output, an RDOQ algorithm is applied to re-calculate quantization outputs.

  • Third, given the quantization decision by RDOQ, the reconstruction levels are updated.

Simulation results reportedly show 0.2 to 0.5 dB gain, compared with RDOQ.

Non-uniform reconstruction was advocated. In this proposal, the encoder sends three values q1, q2, and q on a frame basis – separately for each block size (10 bits each). For i=0, the reconstruction is 0. For |i|=1, the reconstruction is sign(i)*q1. For |i|=2, the reconstruction is sign(i)*q2. For |i|>2, reconstruction is i*q.

This scheme was reportedly tested in the TMuC 0.2 context.

Intra MB usage was disabled for non-I frames, and the technique was not applied to I frames – this was due to the spatial prediction in I frames that causes the reconstruction to affect the prediction of the subsequent blocks.

A 2-15% bit rate improvement per sequence was reported for the non-I frames with this scheme (relative to intra-disabled reference). Only 8 frames were coded for each sequence.

It was remarked that a prior relevant contribution was JVT-P053.

It was remarked by a participant that both the distortion and bit rate change significantly relative to the reference. The number of bits was lower and the PSNR was lower when using the technique – which is somewhat like increasing the QP value.

The short sequence provides less opportunity for error propagation relative to the I frame. The gain seemed best for low-activity sequences – i.e., sequences that rely more on the I frame.

It was remarked that just having the encoder optimally choose a separate QP value for each transform block size might provide some gain.

A participant remarked that for rate control or perceptual reasons, an encoder would change the QP value within a picture – which would affect that ability to use this scheme.

A participant suggested adjusting lambda for a given QP.

A participant noted that the relationship between QP and lambda may be different in the TMuC than in prior designs.

The results seemed somewhat preliminary. There needs to be some way to deal with intra and spatially-adaptive QP selection. And it should be tested relative to using a larger QP in the reference to produce a more similar bit rate and PSNR operating point.

However the concept seems interesting and potentially promising in some form.

Further notes:

Design like rate-constrained Lloyd Max but also taking into account the necessary rate for side info for encoding the reconstruction table of a non-uniform quantizer by yet another lambda-times-rate term.

Proposal to signal only the two innermost reconstruction levels and the stepsize for the outer levels (each by 10 bits). This is done separately for each DCT size.

It is a two-pass encoding process: Encoding is done by uniform quantization first, computing centroids of two innermost reconstruction levels and the q value (for distance of the outer reconstruction levels). These are used for the non-uniform quantizer used in final quantization (RDOQ-like).

It is strange that in the reported results are lower in bitrate and in PSNR as compared to the TMuC results. Apparently, this corresponds to a larger QP value. This could have implications as it is similar to having a larger QP variation between I and P. (this explains relatively larger gain in low activity sequences such as Vidyo).

Only 8 frames were encoded per sequence, results reported for class C, D and E

The current method does not allow change of QP below slice level.

Intra disabled, currently only applied for P pictures, Rate gain reported counts only the P pictures.

Results are preliminary, but further investigation necessary on issues above to take any action.


Yüklə 402,98 Kb.

Dostları ilə paylaş:
1   ...   104   105   106   107   108   109   110   111   ...   127




Verilənlər bazası müəlliflik hüququ ilə müdafiə olunur ©muhaz.org 2024
rəhbərliyinə müraciət

gir | qeydiyyatdan keç
    Ana səhifə


yükləyin