Joint Collaborative Team on Video Coding (jct-vc)


Non-normative: Encoder optimization, decoder speed improvement and cleanup, post filtering, loss concealment, rate control (1)



Yüklə 1,12 Mb.
səhifə17/20
tarix15.09.2018
ölçüsü1,12 Mb.
#82383
1   ...   12   13   14   15   16   17   18   19   20

6.6Non-normative: Encoder optimization, decoder speed improvement and cleanup, post filtering, loss concealment, rate control (1)




6.6.1Rate control




6.6.2Encoder optimization


JCTVC-P0059 Unifying HM and RExt Inter-Prediction Search [K. Sharman, N. Saunders, J. Gamei (Sony)]

Notes in RExt BoG?

6.6.3Software development




6.7Allocation unclear, withdrawn (10)


JCTVC-P0072 Withdrawn
JCTVC-P0103 Withdrawn
JCTVC-P0167 Withdrawn
JCTVC-P0170 Withdrawn
JCTVC-P0206 Withdrawn
JCTVC-P0253 Withdrawn
JCTVC-P0258 Withdrawn
JCTVC-P0259 Withdrawn
JCTVC-P0263 Withdrawn
JCTVC-P0264 Withdrawn
JCTVC-P0314 Withdrawn

7Plenary Discussions and BoG Reports




7.1Project development


The VCEG and MPEG parent bodies held joint meetings with JCT-VC at 1600–1800 Monday, 1600–1800 Wednesday, and 1400–1500 Thursday.

Topics discussed in the joint discussion particularly included the following:



  • Liaison letter from SG 16 to WG 11 (m31807)

    • 4:4:4 8b and 4:4:4 generally

    • Screen content coding CfP

    • FPA versioning

    • extended gamut / high dynamic range

    • interlaced source content

    • medical applications

    • general

Prior-identified issues needing parent-body attention:

  • RExt profiles

  • SEI / VUI / uses of auxiliary pictures development methodology

  • SHVC with non-HEVC base layer (so-called "hybrid" scalability)

  • SHVC colour gamut and bit depth scalability

  • Field/frame issues in regard to scalability (for either AVC or HEVC base layers)

  • Frame packing SEI messages – versioning scheme agreed

New issues highlighted for parent-body attention:

  • Field/frame in regard to scalability (for either AVC or HEVC base layers)

  • New Call for Proposals on Screen Content Coding – issued jointly; technical work to be done in JCT-VC.

  • HEVC version 1 errata

    • CPB removal delay

    • Electro-optical transfer characteristics – esp. for BT.2020

    • MinCR for high tier of Main and Main 10 profiles

    • Desire for high visibility and rapid processing of corrections

    • plan to merge errata into Amd.1

  • 3D-HEVC status (outside the scope of JCT-VC, moved forward for standardization)

  • Interlaced single-layer coding (esp. in re technical changes specifically for that) – study work ongoing in WG 11 parent body with preparation of draft Requirements and possible Call for Proposals

  • Extended colour-gamut / high dynamic range video coding – study work ongoing in parent bodies

  • Colour-related SEI

  • Overlay auxiliary pictures (type code, carriage)

    • Discussion 01-16 Rule for Aux pic type code: 5 unaffiliated proponents and interop needed and achieved

  • Whether functionality of decoding 4:4:4 content with v1 decoders is relevant

    • Discussion 01-16 Not currently of interest

  • Depth view packing SEI message

    • Deferred to the next meeting

  • MFC with depth (similar approach to MVC+D, only texture uses MFC, proposal JCT3V-G0115)

    • Deferred to further study

  • Noted that there is a problem with the coding efficiency of the lossless coding of 16 b 4:4:4 in the design as of 2013-11.

[Add description of resolution outcome on SEI/VUI]

[Add description of on 4:4:4 8b profile]

[Add description of status of 4:4:4 profiles, SG 16 view expressed in m31807, VCEG continues to hold its previous position on 4:4:4 profiles]

[Add detail on hybrid scalability discussion – it was planned to establish a profile for this, but the design was insufficiently mature for current specification]
Regarding 4:4:4 profiles of RExt


  • Decision: Regarding 8 b 4:4:4 profile, the output draft output will say this is TBD, and not include such a profile definition.

  • Decision: Add 16 b 4:4:4 all-intra profile with 3 tiers

  • Wait until March/April to decide on what is in 4:4:4 profiles. The contention is primarily regarding intra block copy.

  • Decision: The output draft will identify all (~5?) identified features [add detail] as TBD in the new profile, and whatever it said before for other 4:4:4 profiles

Ballot results



m32217 Summary of Voting on ISO/IEC 23008-2:201X/PDAM 3 [SC 29 Secretariat]
Regarding colour gamut / bit depth scalability – it was agreed for there to be such a profile specified in the output draft, with a maximum of 10 bit support with the bit depth of the enhancement layer required to be greater than or equal to the bit depth of the base layer.


7.2BoGs


JCTVC-P0288 BoG report on Range Extensions [C. Rosewarne, K. Sharman]

Discussed in JCT-VC plenary Sunday 01-12 am (JRO & GJS).

Further discussed in JCT-VC Wed 01-15 (GJS).

A break out group was held between Thursday the 9th and MonSaturday the 131th of January 2014 to review input documents relating to the Range Extensions. Specifically, topics on RCE1 on entropy coder throughput, RCE2 on Rice parameter initialization and RCE3 on intra block refinement and other technical contributions were discussed.

In addition, contributions indicated that RExt is giving significant compression benefits over AVC High 4:4:4, with up to 8:1 compression improvement reported for screen content (see P0200 and P0213), and around 30% for AI, and 40% to 50% for inter (based on PSNR).

This report conveys a number of recommendations from the BoG. These recommendations are summarised in section 1. Throughout the report, the expression "Recommendation:" was used to express the consensus in the group when a positive action was recommended by the AHG.

(Note: RCE4 was not addressed in the BoG.)

First pass on contributions had been completed.

The current status was reported by the BoG and discussed by JCT-VC:

RCE1:


  • (updated 01-15 and 01-16) If there is a 16-bit profile with a "super tier", using "Option B3a" (setting CABAC to raw bin coding if any coeff_abs_level_remaining codeword is present)Revisit pending analysis of the worst case in light of RawCtuBits limits. An SPS level flag would indicate this usage, and would be required to be 1 in such "super tier" bitstreams – and this requires consideration of whether "super tier" decoders would also be required to decode lower-tier bitstreams. It was remarked that the lower-tier decoding capability requirement might not provide the decoder with the intended complexity reduction, and that this might mean that some profile might be needed that would not be required to decode bitstreams that don't use this feature (e.g., an all-intra profile that has a non-onion-shell relationship with other all-intra profiles). Decision: Agreed (after parent-level review regarding profiling about the need for such a profile, which left the ). Add control flag and low-level specification, prohibit usage in non-16 bit profiles, identify as TBD in 16 bit profiles, identify 16 bit 4:4:4 as TBD.

RCE2:

  • (updated 01-15 and 01-16) Rice parameter initialization – high bit-depth lossless coding had not originally been considered (there was interaction of tested approaches with new AHG18 lossless test results)Revisit pending simulation result of the anchor with the maximum Rice parameter set to 10. After consideration of this issue, it was suggested to focus on the A1 scheme of P0199 with unlimited Rice parameter. Decision: Adopted as described.

RCE3:

  • Padding approaches for intra block copy are not necessary.

  • Initialization of the intra block copy vector predictor is left unchanged. In follow-up discussion about initialization in the JCT-VC plenary Sunday a.m., a contributing participant requested re-consideration of the initialization issue (which currently initializes to zero, which cannot be a valid displacement). As a result of this follow-up discussion, an initialization discussed in JCTVC-P0111 (initialization to displacement (-−W,0)) was adopted. Decision: Initialize to (−W, 0).

(Test results for this initialization value are provided in P0217-v2.)

  • The allowed range of vectors for intra block copy was not recommended to be decreased.

  • Clarification was requested regarding all discrepancies between the intra block copy software and the text. In the JCT-VC review, this was agreed as follows:

    • The intended behaviour of the BV range for different LCU sizes (i.e., always 64 wide versus the current CTB width). Decision: Always 64 wide for luma.

    • Reconstruction process (TU or CU based). The current text already has a restriction that would allow the decoder to use CU-based prediction.

  • (updated 01-15) RevisitAdoption of either of: “NxN only” or “NxN/2NxN/Nx2N” partitioning for intra block copy (with PU-level overlap prohibited). Text for “NxN/2NxN/Nx2N” was provided in an update to P0180. Between these two, the “NxN/2NxN/Nx2N” variation was preferred in plenary discussion. NxN and 2NxN/Nx2N proposals of RCE3 pending resolution of the following: No concerns were raised regarding adoption of this.

    JCT-VC discussion chaired by C. Rosewarne: Decision: Adopted.



    • The earlier discussion was opened for further comments and no new concerns were raised.

    • Chroma block sizes for non-4:4:4 chroma formats (e.g., in a 4:4:4 profile), which can generate 2x2 or 2x4 PBs as tested.

    • Reconstruction process (TU or PU based).

    • An RCE is established if NxN intra block is adopted.

  • (updated 01-15) To establish an RCE if NxN intra block is adopted (i.e. either of the above options) to examine block vector coding and binarization, to include P0149. To be further discussed to decide whether to define an RCE.

Non-CE:

  • Review the remaining 2 contributions under “Source video test material”.

  • To discuss establishing a CE to study worst case code lengths for coeff_abs_level_remaining for high bit-depths and complexity aspects. [P0061]

  • There were 4 remaining contributions to be reviewed.

  • Revisit P0061 on reduction in the worst case escape code length

  • Decision: Adopt P0066 on bit depth correction for CCP.

  • Plenary-level discussion for SEI for FPA. See notes elsewhere on this topic.

  • Add lossless to the AHG18 test conditions and/or establish a CE relating to this

  • (updated 01-16) Decision: Adopt change to SAO for high bit depths (P0222) to send shift values for luma and chroma for the offset value in PPS (no change to classification, with shift value in the range of 0 to Max{BitDepth – 10, 0})remove quantization of factors, but revisit to determine whether a flag or variable shift is necessary.

  • To discuss establishing a CE Form a CE/AHG for motion vector precision [CE still needs to be properly defined]

  • To discuss establishing a CE Form a CE/AHG to examine encoder configuration for CCP, regarding two methods (predict Cr from Cb, and use of a programmable LUT) [CE still needs to be properly defined]

  • Decision: Adopt an additional context for CCP alpha signalling.

  • Further study is recommended on encoder RQT searching following previous changes for CCP.

  • See other notes regarding P0044.

  • (updated 01-16) The group considered moving the rotation and single context enable from the SPS to the PPS conditional on transquant bypass or transform skip being enabled [P0166], but thought that having these aspects switching on a picture-by-picture basis did not seem reasonable, so no action was taken on this.

Further discussion in closing plenary (JRO):

  • Integer MVs for inter prediction gives benefit for screen that was likely rendered with integer precision. Requires changing the interpretation of decoded motion vector (conditional shift), and conditional rounding of predictor. No change of parsing. This appears as a slight increase of worst case complexity, whereas for content types where it is used the average complexity would decrease. Agreed to start a CE. Part of the reported gain could likely be achieved by encoder decision, this should also be investigated in the CE.

  • TU or PU overlap IBC - More study required on complexity impact. No CE.


JCTVC-P0298 BoG report on Palette mode [C. Rosewarne]

Discussed in JCT-VC plenary Sunday 01-12 am (JRO & GJS).

Candidate software and reference configuration(s) for TE or AHG testing were discussed.

The following recommendations are made by the BoG:

Options recommended are either to have one or two basis softwares.

Two basis software option:

New test 1: P0108 also with previous coded palette reuse [P0153].

New test 2: P0231 + previous CU palette prediction [P0114] and the copy-from-above syntax [P0116, P0095] and removal of the copy-from-above for first line [P0116, P0179].

One basis software option:

P0198 less prediction of escape pixel with throughput issue solution [ aspect of P0231] + previous CU palette prediction [P0114, P0153] and the copy-from-above syntax [P0116, P0095] and removal of the copy-from-above for first line [P0116, P0179] + adaptive bit lengths to code the quantized escape value [aspect of P0231] + escape pixel index signaling [aspect of P108].

No consensus on selecting whether to have one or two basis softwares.

As a conclusion, it was discussed in the plenary that the investigation of palette mode coding shall be further conducted in an “AHG on investigation of palette mode coding tools (chairs t.b.d.)” with mandates to perform further investigation of P0108, P0198 and related technology, define test conditions for investigating the compression performance, investigate the complexity.


JCTVC-P0292 BoG Report on Color Gamut Scalability (CGS) [A. Duenas]

Discussed in JCT-VC plenary Sunday 01-12 am (JRO & GJS).

This report summarizes the activities of the BoG on color gamut scalability during the 16th JCT-VC meeting. Break out group sessions were held during Friday the 10th and Saturday 11th of January.

This report conveys a number of recommendations from the BoG.

The BoG recommended testing 1x and 2x spatial scalability cases.

From discussion of JCTVC-P0127 two use cases were identified as valuable for technical study of CGS techniques:



  • HEVC HD (1080p50/60) with Rec. ITU-R BT.709 and 8 bits to HEVC UHD-1 (2160p50/60) with Rec. ITU-R BT.2020 and 10 bits.

  • HEVC UHD-1 (2160p50/60) with Rec. ITU-R BT.709 and 10 bits to HEVC UHD-1 (2160p50/60) with Rec. ITU-R BT.2020 and 10 bits. (The resolutions, bit depth and frame rate are the same in both layers. The only differences between the two layers will be just the color representation. In this second use case, the base layer and the enhancement layer will be 10 bits or above.)

The BoG recommended further reviewing contribution JCTVC-P0127 On a CGS profile for SHVC, as it relates to profiling and use cases.

The BoG recommended continuing using



  • The current 1080p (BT.709 and BT.2020) sequences for 1x tests

  • Using 1080p downsampled (with SHVC downsampling) versions of 2160p BT.709 version for the 2x case, with an enhancement layer that is BT.2020.

The source test sequences were generated in the P3 domain as 2160p and then "color graded" to BT.709 and BT.2020 (by Technicolor, see N0163).

Further discussion of some aspects of test conditions was needed.

It was noted that it is important that the content needs to be available for use by all participants for developing, analyzing and reporting results of technical approaches. A new version of the Technolor terms was later provided (see P0292). It was asserted by a Technicolor representative that these terms allow development of technology considered for contribution as well as evaluation of actual contributions.

Based on the review of JCTVC-L0440 it was noted that the following items are important to take into account to evaluate complexity of color gamut scalability. The BoG recommended to use the following data when analyzing the algorithmic complexity of each of the techniques:



  • Consider the number of multipliers and if they are 8 bits or 16 bits (or any other type). As we are now considering 10 Bits input we should consider the different cost for different types of multipliers. We need to count the number of multipliers and type. It was noted that they may cases where we have a mixed type of operation and this may be affecting some implementations. The BoG recommends that when we do worse case analysis we should consider the different types of operations and those should be reported independently.

  • Reporting the potential sizes of LUT in number of table entries contained on the LUT.

  • Reporting the number of stages and a short summary of each stage (reporting how many passes of the data or pipeline stages are needed). This would capture aspects such as 2D spatial filters applied as part of the colour transformation.

  • Reporting if re-sampling is used when reporting the number of multiplications.

  • Reporting if cross color dependency is being used.

The BoG did not conclude that it was necessary to report the memory access for each of the proposals, although some participants suggested that this should be done.

One example raised in the discussion was whether the transformation would apply before or after an upsampling process, which does not seem to be accounted for in the above.

The BoG recommended that proposals should include descriptions of the encoder optimizations being used.

Other agreed aspects of CE plans were also included in the BoG report.



Further BoG discussion was plannedheld and additionally discussed 01-16 (GJS).
JCTVC-P0290 Joint BoG report on High Level Syntax [J. Boyce]

Discussed in JCT-VC plenary Sunday 01-12 am (JRO & GJS).



The suggested plan for publication for ISO/IEC was described as follows:

  • Edited DAM or FDAM considered issued in April for RExt and MV-HEVC, but not balloted to enable preparation of new FDIS.

  • Edited DAM or FDAM considered issued in July for SHVC, but not balloted.

  • FDIS of new edition issued in July with all three amendments integrated, and balloted.

(Consent in July of full text new edition.)

Decision: The BoG recommended, and the JCT-VC, endorsed, the following actions:

  • Remove profile_ref_minus1 from the VPS extension, from JCTVC-P0048

  • Move video signal information syntax structure earlier in the VPS VUI, from JCTVC-P0076

  • Not signal the sps_max_num_reorder_pics[], sps_max_latency_increase_plus1[], and sps_max_dec_pic_buffering_minus1[] syntax elements in the SPS when nuh_layer_id > 0, from JCTVC-P0155.

  • Add PPS extension type flags for conditional presence of syntax extensions per extension type, aligned with the SPS extension type flags, from JCTVC-P0166. Further align the SPS extension type flags syntax between RExt and MV-HEVC/SHVC.

  • Modification of derivation of variable NumActiveRefLayerPics, subject to review of the revised text by interested experts, from JCTVC-P0079 (confirmed 01-16).

  • Require that end of bitstream NAL unit shall have nuh_layer_id equal to 0, from JCTVC-P0130. Decoders shall allow an end of bitstream NAL unit with nuh_layer_id > 0 to be present, and shall ignore the NAL unit.

  • Add constraint restricting pictures marked as discardable from being present in the temporal or inter-layer RPS, from JCTVC-P0130.

The BoG recommended the following activities take place:

  • To further discuss JCTVC-P0110 in the track to select between two options to enable no default output layer sets (See notes for P0110).

  • Revisit Further discussion of JCTVC-P0262 in the track – see notes elsewhere

  • Side activity launched was requested to consider modifications to the VPS extension to remove unnecessary syntax elements and change syntax elements to ue(v) coding, consistent with abandoning a design goal of avoiding ue(v) decoding in the VPS extension. This was later resolved in the BoG.

  • Side activity launched was requested to classify VPS extension syntax elements per extension(s), to consider per-extension type syntax, including reordering syntax elements to cluster per extension type. This was later resolved in the BoG.

Decision (Ed.): The BoG recommended, and the JCT-VC endorsed, the following suggestions to the editors:

  • Improve or add definitions in the MV-HEVC and SHVC specifications for layer sets, target output layers, output layer sets, and consider adding explanatory notes

  • Delegate to the editors aspects raised by the following contributions: JCTVC-P0052, JCTVC-P0078, JCTVC-P0155, JCTVC-P0181, JCTVC-P0130

Further BoG activity was planned.

Further review of BoG status was held 01-16 (GJS).

The BoG also met 13 Jan.

The BoG recommended the following:



  • Add a flag in VUI to indicate that all IRAP pictures are IDRs and that all layer pictures in an AU are IDR aligned, from JCTVC-P0068 proposal 1. Decision: Adopted.

  • Several minor modifications to the VPS syntax, consistent with eliminating the previous intention to avoid ue(v) parsing in the VPS, as represented in JCTVC-P0306. Decision: Agreed.

  • Several additional questions were suggested to be discussed in the track:

    • Is the VPS VUI extension offset necessary? Should there be a mechanism for additional extension of the VPS extension before the VPS VUI? The syntax provided by Y.-K. Wang in P0307. Decision: Adopt modification in P0307.

    • Is the VPS extension offset necessary in the VPS? If so, how to address the start code emulation issue raised in JCTVC-P0125? Decision: Keep it as a reserved FFFF value.

The BoG also met 14 Jan.

The BoG recommended the following:



  • Add alpha channel information SEI message, from JCTVC-P0123. Decision: Adopt. Constrain the bit depth indicated to be equal to the coded bit depth of the aux picture.

  • Add sub-bitstream property SEI message, from JCTVC-P0204. Decision: Adopt.

  • Change alt output layer flag to be signalled within the loop of output layer sets, from JCTVC-P0300-v2. Decision: Adopt.

The BoG recommended the following:

  • Further discussion of JCTVC-P0133, on recovery point and region refresh information SEI messages. Discussed 01-16 (GJS). Decision: Adopt change to recover point semantics only (-v3). Further study was requested regarding region refresh information SEI message.

  • The HLS BoG requested further work on the text to add further clarification.

  • Review JCTVC-P0261, on pic_struct,which was not reviewed in BoG, since the presenter was unavailable


JCTVC-P0302 BoG report on performance analysis and demos [J. Samuelsson]

A break out group was held on Sunday the 12th of January 2014 to review the documents assigned to categories 3.4.2-3.4.5 in version 2 of the draft JCT-VC meeting report. P0261 was noted to be a technical change proposal, so it was not reviewed in the BoG.

Four contributions were reviewed in the BoG.

P0169 reported on performance benefits obtained using a parallel-processing implementation for SHVC decoding.

P0082 was tutorial information about using the pic_struct syntax coding of interlaced

P0083 contained an analysis comparison of different techniques for coding interlaced material with HEVC (version 1). It contained a comparison against picture-adaptive frame/field coding (which is something not supported in version 1) and sequence-adaptive frame/field coding (which is supported in version 1).

P0158 reported informal subjective video quality comparison results for HEVC versus AVC implemenation.
JCTVC-P0312 BoG report on phase adjustment in SHVC re-sampling process [E. Alshina]

P0XXX Phase alignment (E. Alshina)

Discussed 01-15 (GJS).

It was clarified that the enhancement layer is not envisioned to be switching between frame and field referencing to the base layer on a picture-by-picture basis within a CVS. So the scalability resampling ratio is fixed within a CVS. (At least if the referenced "picture" array is supplied by external means, this does not constrain how that array was coded before it was presented to the enhancement layer for referencing.)

Only the 2:1 case has been tested.

Currently the draft has a cross-layer phase alignment flag at the VPS level to control vertical phase.

The BoG was considering (but had not concluded on discussion of) a four-flag scheme



  • A VPS VUI constraint indicator applying to all layers

  • The cross-layer phase alignment flag

  • A presence flag at the SPS level

  • When present, a vertical phase position flag at the slice header level

Alternatives discussed included having 4 bits for luma and 4 bits for chroma in the PPS (with some gating flag(s)).

For horizontal phase, the same possibilities exist, but there was less interest in having additional flexibility horizontally.

For upsampling ratios other than 2:1, the scheme would not necessarily provide optimal phase behaviour.

Further BoG discussion was plannedheld.

BoG report (r2) reviewed.



Decision: Add the proposed signaling (3 added bits and the previously drafted bit) and its mechanism for phase adjustment described above in to the SHVC specification draft text and next release of SHM s/w. Do not constrain the use of the flags to particular scalability ratios.

It was noted that the vertical aligment of chroma relative to luma built into the scheme corresponds to that used for interlaced fields in the 2:1 case. However, the chroma phase alignment does not seem critical.

Further study was requested to determine whether constraints should be imposed or different syntax should be used.


Yüklə 1,12 Mb.

Dostları ilə paylaş:
1   ...   12   13   14   15   16   17   18   19   20




Verilənlər bazası müəlliflik hüququ ilə müdafiə olunur ©muhaz.org 2024
rəhbərliyinə müraciət

gir | qeydiyyatdan keç
    Ana səhifə


yükləyin