International organisation for standardisation organisation internationale de normalisation



Yüklə 8,63 Mb.
səhifə39/117
tarix25.10.2017
ölçüsü8,63 Mb.
#13029
1   ...   35   36   37   38   39   40   41   42   ...   117

Test Model Development


4.2.0.1.1.1.1.1.58m28125 CDVS: Test Model code maintenance [Massimo Balestri, Gianluca Francini, Skjalg Lepsoy, Andrea Varesio]

(Chaired and recorded by Danilo Pau)



The contribution examined TM4, in terms of portability and performance criteria. There was reported to be a problem in annotation in 1b and 1c execution speed extraction and retrieval, which were not finished. The performance achieved thus far was presented:

  • For extraction, this drops a sort method in S mode such that only sort method 1 is used and this optimizes extraction.

  • PW for some queries was slow. Time had been spent in computing inside find overlap to compute Jaccard index ration projected bbox and annotated bbx. The Jaccard index was computed pixel by pixel, with per pixel iteration no matter whether inside or outside bbox. An inefficient 80% of the time had been spent inside this function. Also, the bbox can be outside the query image. This produced an approximated Jaccard index. The projected bbox can be outside the image. The union of them annotated and projected can be outside the image. This was replaced pixel by pixel with a more efficient implementation: the union and intersection were computed for an area of bboxs and the intersection and formula were now simplified The results were obtained in a fraction of the time and gave more accurate results.

  • The computed localization accuracy was reported using TM4 with a drop in 1a,b,c. It was not clear why it is different among them. Annotations (some in 1a, b) had been wrongly corrected with scaling and now localization results are more reasonable and had the same drop in 1a,b,c. The drop was was due wrong annotations. The contribution proposes corrected annotations.

  • Portability was an issue on the Windows OS. CDVS Exceptions class was derived from std class to catch std and specific exceptions. This is more portable in Windows. Another problem is related to port CDVS code on Android due to Yale lib (C lib that depend on lapack and Blas) Lapack is in Fortran and poorly supported in Android. The contribution replaced Yale with Eigen that is much easier to port and reimplemented SCVFIndex. This got the same results and was much easier to port on Android.

  • Memory is an issue in the arithmetic code class. This methods were used to allocate memory. In one of the two cases, allocation may fail. The contributor decided to use a new more correct way – Copyconstructors in DBlist.

  • Other minor issues: arith code small test to pass basic unit tests.

  • Perl scripts were provided but sometimes print what they do and sometimes do not.

  • In some parts, admin privileges are required.

Conclusion: Create maintenance branch w/out affecting main trunk and adopt into TM5.0 without changing results. Switch to compute approximated and actual Jaccard index and correct annotation for 1a,b,c. It was proposed to insert these changes at first during TM5 integration. The group agreed to incorporate the proposed changes.

    1. TM Implementation


4.2.0.1.1.1.1.1.59m28296 Stripe Based DoG Hardware: performance evaluation in the TM4.0 [Mario Vigillar, Gian Domenico Licciardo, Danilo Pau, Arcangelo Bruna, Claudio Parella]

This contribution proposed a stripe based DoG – avoiding full frame based buffering – total buffer size 512 kBytes – 14 bpp in 28 nm design (low leakage), 90fps @ 333MHz. Good performance – very slight (insignificant) drop in performance.


TM Software Integration Plan

  1. CE1- Global Descriptor and CE6 – retrieval code optimization – PKU – by 30 Jan

  2. CE2 & CE3 – by 5 Feb

  3. Test results (set 1) generated: Linux (VA) + Windows (PKU)

  4. Code cleanup (main path) & bug fixes – TI – by 12 Feb.

  5. Test results generated (Set 2): Linux (TI,VA) + Windows (PKU,VA).

TM and anchors were planned to be released on 15 February 2013.

    1. CDVS Development and Requirements


4.2.0.1.1.1.1.1.60m28232 Additional requirements on CDVS in Video Applications [Sabino Metta, Alberto Messina]

This proposal was for possible requirements derived from extension of CVDS use cases to video content.

All such requirements (combined) should be submitted to the Requirements group for consideraion.

    1. WD and TM Development


New documents: WD 3 (N13334) and TM5 (N13335)

Changes:


  • Global descriptor tool revised – second order Fisher kernels added for 4 kB and above

  • Only one mode (transform domain representation and scalar quant – VQ removed)

  • one minor change in location coding

  • simplification in arithmetic coding (minor change)

Code optimization (only software change)

Update to accuracy measure of localization (slightly worse results, but believed to be more reliable)

Software and anchor results were to be available Feb. 15


    1. CEs


The following set of CEs was decided to be performed by the next meeting (see N13336):

  • CE1: Global descriptors (continuing)

  • CE2: Key point detection (previous CE4)
  1. Internet Video Coding


A breakout group on IVC was active during the week to perform the actions regarding the following:

  • Assessment of tools proposed on top of ITM3. All proposed tools were recommended to be included in ITM4 (N13353)

  • Performance of ITM (with new tools) versus AVC anchor and newly proposed alternative (VP8) in terms of SNR & visual performance.

Regarding the alternative method, which had not originally been proposed in the CfP, it was suggested that a more detailed analysis should be performed for one more meeting cycle, with a decision for actions to be made at the next meeting. Besides the performance, the general status in terms of licensing, conditions of transfer to MPEG, software, and copyright require clarification and better understanding.

To achieve the appropriate results in the comparison of ITM4 versus VP8 in an open process during the subsequent AHG period, CE1 (N13354) was established which includes plans for the codec settings, timelines, etc.

Another CE, identified as CE2 (N13355) was set up which is intended to investigate technology for potential further improvements of ITM4 (not to be used in CE1).

As part of the breakout activity, an expert viewing was performed on Thursday afternoon. By viewing VP8 and AVC anchor (from the CfP) results, it was confirmed that the data rate points planned for CE1 would be suitable for the formal subjective tests that are planned to be performed prior to the last meeting. Due to the lack of decoded sequences of either ITM3 or with added tools newly integrated in ITM4, no preliminary comparison was possible involving the ITM codec.

Some further information regarding the licensing situation was provided during the video closing plenary (see under section 12)

4.2.0.1.1.1.1.1.61m27964 Adaptive Non-Reference P Optimization for Internet Video Coding [Xianguo Zhang, Long Zhao, Hao Lv, Ronggang Wang, Siwei Ma, Tiejun Huang, Wen Gao]

From BoG:


  • Presented on Monday: from abstract “The Internet Video Codec ITM 3.0 described in N13162 is further optimized. We propose to opti-mize the non-reference P coding in form of more accurately and adaptively detecting which frames should encoded as a non-reference P frame.” : improvement of 0.8% was reported, but no complexity related data was reported. Recommend to adopt into the encoder.

  • We need a common test conditions document first before deciding on encoder optimizations. A draft has been developed (in the room).

In the Video plenary, it was asked whether subjective quality had been checked. With too drastic change of QP over the levels, there could be subjective degradation.

4.2.0.1.1.1.1.1.62m28161 cross-check of m27964 [Weiran Li]


4.2.0.1.1.1.1.1.63m27965 A direction and distance based motion vector prediction method for internet video coding [Hao Lv, Ronggang Wang, Lei Chen, Xianguo Zhang, Siwei Ma, Tiejun Huang, Wen Gao]

From BoG:



  • Different motion vector estimation, increased number of blocks used.

  • There was an opinion from the room that this seems to be similar to what is used in VC-1.

  • No information regarding performance was provided when multiple slices are used, but there are no test conditions defined that include slices – thus, we should define such a testing case for the common test conditions.

It was suggested that delegates who bring questions regarding IPR on a proposal should provide the basis for that question (as in some evidence that leads them to question the IPR statement made by the proponents).

It was suggested to draft a resolution requesting review of the IVC ITM for IPR ownership of the technology used.

The BoG recommended to adopt this into the ITM after proposed text has been reviewed.
From video plenary:

It wsa confirmed that the prediction is used for coding, but there is no signalling of candidates (rather, rules are established about selecting two candidates for computing an average).

One suggestion was given that the operation suggested to compute the prediction (based on comparing candidates against each other) might be similar to median computation, the latter being potentially critical w.r.t. IPR.

4.2.0.1.1.1.1.1.64m28162 cross-check of m27965 [Weiran Li]


4.2.0.1.1.1.1.1.65m27966 New B frame encoding technology for internet video coding [Lei Chen, Ronggang Wang, Hao Lv, Xianguo Zhang, Siwei Ma, Tiejun Huang, Wen Gao]

From BoG:

This was a proposal to remove B_direct_16x16 and replace B_direct_8x8 with B_skip_8x8. It has a small gain of 0.3%.

Discussion: this is done because other modes seem to cover the test sequences sufficiently. Reducing the modes reduces complexity of the codec.

The BoG recommended to adopt into the ITM after proposed text has been reviewed.
4.2.0.1.1.1.1.1.66m28163 cross-check of m27966 [Weiran Li]
4.2.0.1.1.1.1.1.67m27968 Transform and quantization method for internet video coding [Ronggang Wang, Lei Chen, Hao Lv, Xianguo Zhang, Siwei Ma, Tiejun Huang, Wen Gao]

From BoG:

Different DCT formulation, based on Chen IDCT.

Theer was an objection to adopting this proposal – requesting to require frequency response of the proposed transform before adopting (dropped).

Proponents report that the motivation for this is that the ITM 3.0 transform has patent issues, so they went back to ITM 2.0 transform. There was an agreement to have informal viewing, but it was suggested that we probably have to adopt because of the patent issues in any case. Experts reported that the ITM 2.0 transform produced visual artefacts and that needs to be checked again. The BoG recommended to adopt this conditioned on the informal viewing results (no visual artefacts).
4.2.0.1.1.1.1.1.68m28164 cross-check of m27968 [Weiran Li]
4.2.0.1.1.1.1.1.69m28187 Proposed Internet Video Coding Test Model (ITM) Version 4.0 (VP8 compatible) [Mohamad Raad, Lazar Bivolarski, Harald Alvestrand]

Discussed in conjunction with m28182, see the notes in section below.


4.2.0.1.1.1.1.1.70m28182 Comparison of the VP8 video codec and the current IVC Test Model [Lazar Bivolarsky, Mohammed Raad, Harald Alvestrand]

Notes from the BoG (chaired and recorded by R. Wang):

a. Comparison of VP8 to current IVC test model.

b. Using VP8 encoder version 0.9.6 to ITM 3.0

c. Discusses the reportedly wide use and adoption of VP8 in the industry.

d. Reports that VP8 (IPPP) has approx. 24% improvement over IVC ITM 3.0 (IPPP).

e. Reports that VP8 (IPPP) has approx. 6% improvement over IVC ITM when used with IBBP.

f. Recommends replacing the current ITM with VP8 (as described in m28187).

i. Conditioned on the above, the authors recommended moving IVC to WD stage.

ii. Prof. Yun He raised a concern regarding the patent situation of VP8.

1. The contribution states that the proponent is willing to license the proposed technology under Type 1 licensing.

iii. There was interest from the group to look at the test scripts and visual results.

Discussion was chaired by Ronggang Wang. Notes from Ronggang follow:

Questions from the experts and Answers from the proponents:

Q: Is there cross-checking for the performance of VP8?

A: No cross-checking now.

Q: Is there any text guidance for the VP8 encoder?

A: No


Q: Where's the performance gain from? Please try to identify the key technologies.

A: Entropy coding, segment organization, ...

Q: If adopted, can it be changed in the future?

Q: If it can't be changed, VP8 should go to ISO to be stamped as standard, other than contributing to IVC.

Q: Are there any other third part techniques in VP8?

A: Not sure.

Q: If adopted and some modules are changed in the future, Google will still license for free for their IPs?

A: Not sure, need to ask to Google lawyers

Q: IPR analyses are required.

Q: Verification of the performance gains needs to be done before the decision of changing platform.

Q: What’s the motion vector prediction method in VP8?

Q: Detailed experiment results?

Q: Is there rate control in the testing?

A: Yes, VBR rate-control is used.


Comments from experts:

(1) The proponents shall provide cross-checking input document from a third party.

(2) A VP8 encoder description document is required to help the participants of testing to understand the source code of VP8.

(3) Rate control shall not be used for performance testing.

(4) Technique details on VP8 codec should be presented from proponents, not just providing a specification of VP8.

(5) The IPRs in VP8 not owned by Google should be carefully checked.


In further review, the points were not agreed upon by other experts.
4.2.0.1.1.1.1.1.71m28184 New IVC Test Sequences and AVC anchors [Lazar Bivolarsky]

Notes from BoG:

New sequences were proposed.

i. We will need a common test set document.

ii. Bitstream generation was to be checked during the week.

iii. It was recommended to have this done by the end of the week.

It was recommended to wait for IVC experts to check the new sequences during the week. It was recommended to adopt the new sequences if no issues were raised during this week.

In later review, it was later agreed – after subjective viewing – that the new sequences as well as the rate points of the previous 720p / videoconferencing-type sequences are suitable. Therefore, they are included in CE1.


4.2.0.1.1.1.1.1.72m28107 Update on MPEG-IETF Coordination on Royalty Free Coding [Gerard Fernando, Rob Glidden]

Not presented (as it was not obvious that it should be handled by the BoG).


Other BoG activity (Monday and Tuesday):

The BoG suggested a set of test conditions to use when testing VP8 vs. the current test model. The BoG was in agreement to plan for testing two different models and that this test will probably take place between now and the next meeting. Proponents for either technology had been asked to provide the configurations for each codec. Since we have the source code available to all members, all members have been invited to generate bitstreams for both codecs (to ensure that there is consistency).

An informal expert viewing test was being organized with the help of the test chair (Vittorio) so that we could collect more performance-related information at this meeting, but it was understood that we may not get the time to fully complete this. A formal subjective test performed by Vittorio between the meetings was understood to potentially be feasible.

The informal viewing took place in room T072 from 14:00 on Thursday the 24th of January. It was agreed that the new class D sequences provided in m28184 should be used instead of the class D sequences used in the IVC CfP.


There was discussion in IVC breakout Thursday morning on further progressing (chaired and recorded by Jens-Rainer Ohm), as follows:

Establishment of CE1: Investigation of overall compression performance

Description shall include:


  • Set of sequences, bit rates, additional constraints to be obeyed

  • Encoder settings of ITM4 and VP8 (if further investigation on QP ranges is necessary, this should be clearly expressed and a date by which these parameters are t.b.d. must be provided

  • ITM4: ITM3 + 4 new adopted proposals (27964, 27965, 27966, 27968)

  • VP8: Refer to 28187

  • (*) To be decided: Formal subjective tests ahead of meeting or expert viewing tests during meeting

  • Depending on (*): timeline

(It was later confirmed together with the Test SG chair that it is possible to run formal subjective tests prior to the next meeting. This is reflected in the CE1 plan and in the AHG mandates.)

Some other discussion was performed on the usage of alternative reference pictures in VP8. It is confirmed by the contributors of 28187 that disabling multi-pass coding also disables usage of that feature. One other independent expert who has studied m28187 also confirms that this is the case.

There is also the feature of “golden frame” which is a long-term reference picture. It was remarked by some experts that this may violate the random access constraint. It is however confirmed by the contributors of m28187 that this “GF” is reset whenever a RA position (keyframe) appears.

The contributors of m28187 also announced that a more precise description of the encoder will be provided before the next meeting.

CE2 (improvements of ITM) was reviewed. It will contain sub-experiments on:


  • Intra prediction,

  • Transform / quantization,

  • Entropy coding,

  • B frame coding,

  • Inter P prediction,

  • In/out loop filtering/processing.




  1. Yüklə 8,63 Mb.

    Dostları ilə paylaş:
1   ...   35   36   37   38   39   40   41   42   ...   117




Verilənlər bazası müəlliflik hüququ ilə müdafiə olunur ©muhaz.org 2024
rəhbərliyinə müraciət

gir | qeydiyyatdan keç
    Ana səhifə


yükləyin