Joint Video Exploration Team (jvet) of itu-t sg 6 wp and iso/iec jtc 1/sc 29/wg 11


EE5: Improvements on adaptive loop filter



Yüklə 0,73 Mb.
səhifə9/17
tarix15.09.2018
ölçüsü0,73 Mb.
#82166
1   ...   5   6   7   8   9   10   11   12   ...   17

5.5EE5: Improvements on adaptive loop filter


JVET-C0038 EE2.5: Improvements on adaptive loop filter [M. Karczewicz, L. Zhang, W.-J. Chien, X. Li (Qualcomm)]

In this contribution, results of the EE 2.5 testing several modifications to the adaptive loop filter (ALF) in HM16.6 JEM-2.0 are presented. In the modified ALF, the diagonal gradients are taken into consideration during classification, geometric transformations of filter coefficients and prediction from fixed filters are also employed. In addition, some cleanups of the ALF design and software are included. It is reported that 1.0%, 1.2%, 1.1% and 1.5% BD-rate reduction is achieved for AI, RA, LDB and LDP configurations, respectively.

Elements of the proposal:


  • Diagonal gradients taken into consideration during classification.

  • Geometric transformations of filter coefficients (diagonal, vertical flip and rotation) based on the classification results.

  • Prediction from set of the fixed filters.

  • Design and software clean ups, e.g.:

    • Chroma filter shape is aligned with the luma 5x5 filter shape,

    • Context coded bins used for ALF coefficients are replaced by by-pass coded bins with the exception of the on/off flag signalled on the CU level.

The gain is mainly due to the diagonal classification and filtering. Prediction from fixed filters gives a small additional gain

The additional diagonal classification increases complexity. Decoder runtime is increased by roughly 5%, which may also be partially due to the fact that ALF is used more frequently.

Cross-checks confirm results, where however the reported decoding time increase is higher in the reports JVET-C0057, JVET-C0074 and JVET-C0091.

Several experts expressed the opinion that the additional gain in compression performance is attractive and still a reasonable tradeoff versus the increased complexity.



Decision: Adopt all aspects of the proposal into JEM3 and CTC.

JVET-C0057 Cross-check of JVET-C0038 (GALF) [K. Choi, M. Park, E. Alshina, C. Kim (Samsung)] [late]
JVET-C0036 Crosscheck of JVET-C0038 (EE2.5 Improvements on adaptive loop filter) [R. Chernyak (Huawei)] [late]

Subjective testing was performed, and it is reported that mainly the quality is increased, whereas there are few cases where a decrease of quality is observed. This however applies when looking at single frames, whereas for moving video it is hardly recognizable.



JVET-C0074 Cross-check of JVET-C0038 (EE2.5: Improvements on adaptive loop filter) [Y. Yamamoto, T. Ikai (Sharp)] [late]
JVET-C0091 Crosscheck for EE2.5 Improvements on adaptive loop filter (JVET-C0038) [X. Cai, L. Xu, Y. Chiu (Intel)] [late]

5.6EE6: Modification of Merge candidate derivation


JVET-C0035 EE2.6: Modification of Merge candidate derivation: ATMVP simplification and Merge pruning [S. Lee, W.-J. Chien, L. Zhang, J. Chen, M. Karczewicz (Qualcomm)]

In this contribution, merge pruning and ATMVP simplification are proposed to achieve BD rate reduction without complexity increase of both encoder and decoder. With the proposed modifications, additional 0.06%, 0.13%, and 0.25% BD rate reduction over JEM2.0 is achieved with same encoder/decoder complexity for random access, low delay B, and low delay P configurations, respectively.

The first aspect (ATMVP) is a slight simplification, whereas the second aspect has a slight increase in complexity.

In general, the change is small and it gives small benefit for all classes, where however the gain is largest for class E and for specific sequences (e.g. rollercoaster). In very few cases, a small loss is observed.

Some support is expressed for the simplification of ATMVP. For the modification of the pruning in merge, no support is given by other experts.

Decision: Adopt the ATMVP simplification to JEM3.

JVET-C0085 Cross-check of JVET-C0035 (EE2.6: Modification of Merge candidate derivation: ATMVP simplification and Merge pruning) [H. Chen, H. Yang (Huawei)] [late]
JVET-C0060 Cross-check of JVET-C0035 (Modification of merge candidate derivation) [A. Tamse, E. Alshina, C. Kim (Samsung)] [late]
JVET-C0073 Cross-check of JVET-C0035 (EE2.6: Modification of Merge candidate derivation: ATMVP simplification and Merge pruning) [Y. Yasugi (Sharp)] [late]

5.7EE7: TU-level non-separable secondary transform


JVET-C0053 EE2.7: TU-level non-separable secondary transform [X. Zhao, A. Said, V. Seregin, M. Karczewicz, J. Chen, R. Joshi (Qualcomm)]

This contribution reports the results of Exploration Experiment (EE) 2.7 “TU-level non-separable secondary transform”. On top of JEM-2.0, the overall BD-rate gain for luma component is -0.46%, -0.21%, -0.14% and -0.04% for All Intra (AI), Random Access (RA), Low Delay B (LDB) and Low Delay P (LDP) configurations, respectively, with 17% overall encoder run-time saving for AI.



Powerpoint deck not uploaded in v3.

The main elements of the proposal are



  • Hypercubic Givens transform replacing the current secondary transform

  • signalling it at TU level instead of CU (which does mainly impact encoder complexity and only gives small benefit in compression) (note with QTBT CU and TU are identical anyway)

If the new transform is enabled at CU level and can be used in combination with PDPC, the BR reduction reaches 0.62%, but encoder runtime is increased by 50%.

It is claimed that the memory consumption of the new transform is less than current NSST (80%) when it is implemented in butterfly structure. With direct implementation via matrix multiply, it would be increased 3x compared to NSST.

C0053 also contains results “test 1b” which show the replacement of NSST by HyGT in combination with QTBT. In this case, the encoder runtime increases by approx. 25%, giving BR reduction of 0.4-0.5% luma (class A results not available yet). Chroma shows small loss.

The TU level decision does not consider chroma in the decision, the same is the case with the current QTBT combination.

Continue EE with the goal of further investigating the HyGT in combination with QTBT


  • investigate the impact of conditional usage in combination with PDPC (encoder runtime, compression performance)

  • further investigate the chroma loss that is observed.

The results of test 5 indicate that usage of HyGT is useless when TS or LM are on. The same could be the case for the current NSST (note that C0045 investigates that).

JVET-C0058 Cross-check of JVET-C0053 (TU-NSST) [K. Choi, M. Park, E. Alshina, C. Kim (Samsung)] [late]
JVET-C0076 Cross-check of JVET-C0053 (TU-NSST) [V. Lorcy (bcom), P. Philippe (Orange), T. Biatek (TDF)] [late]
JVET-C0086 Cross-check of EE2.7: TU-level non-separable secondary transform [S.-H. Kim (Sharp)] [miss] [late]


Yüklə 0,73 Mb.

Dostları ilə paylaş:
1   ...   5   6   7   8   9   10   11   12   ...   17




Verilənlər bazası müəlliflik hüququ ilə müdafiə olunur ©muhaz.org 2024
rəhbərliyinə müraciət

gir | qeydiyyatdan keç
    Ana səhifə


yükləyin