Organisation internationale de normalisation


Received National Body Comments and Liaison matters



Yüklə 7,54 Mb.
səhifə148/166
tarix03.01.2022
ölçüsü7,54 Mb.
#33742
1   ...   144   145   146   147   148   149   150   151   ...   166

24.7Received National Body Comments and Liaison matters


There were none.

24.8Plenary Discussion


It was the consensus of the Audio Subgroup to affirm the recommendations of the AhG, as recorded in Section 3.1.

25Record of AhG meetings

25.1AhG Meeting on USAC -- Sunday 1000-1800


Takeshi Norimatsu, Panasonic, presented

m19245

Panasonic cross check report on PVC for SBR

Takeshi Norimatsu, Tomokazu Ishikawa, Zhong Haishan, Zhao Dan

USAC-PVC

The contribution reports the results of a listening test. At the operating point of 8 kb/s and differential MOS score analysis, the results showed 3 items better and mean score better at the 95% level of significance.

The document also reports that Panasonic verified the CE decoder software, in that the CE bitstreams decoded exactly to the CE waveforms.

Heiko Purnhagen, Dolby, presented

m19267

Dolby listening test results for CE on PVC for SBR in USAC

Heiko Purnhagen, Kristofer Kjörling

USAC-PVC

The contribution reports the results of a listening test. At the operating point of 8 kb/s and differential MOS score analysis, the results showed 2 items better and mean score better at the 95% level of significance.

Max Neuendorf, FhG, presented



m19316

FhG listening test report on PVC for SBR envelope coding in USAC

Stephan Wilde

USAC-PVC

The contribution reports the results of a listening test. At the operating point of 8 kb/s and differential MOS score analysis, the results showed 4 items better, 2 worse and mean score better at the 95% level of significance.

Toru Chinen, Sony, presented



m19256

Finalization of CE on PVC for SBR

Toru Chinen, Yuki Yamamoto, Mitsuyuki Hatanaka, Hiroyuki Honma, Masayuki Nishiguchi

USAC-PVC

Predictive Vector Coding technology adds an alternave coding scheme for SBR envelope scale factors, one that has greatest performance at the lowest bit rates (i.e. 8 kb/s and 12kb/s).

The technology used in this CE listening test uses adaptive factors in the encoder and decoder, e.g. having the SBR oversampling rate as a parameter rather than fixed as 2. In addition, PVC adds bitstream syntax elements to support SBR sinusoidal coding.

Complexity of PVC typically leads to a .45 WMOPS increase in decoder complexity, or a 9% increase in decoder WMOPS. PVC requires 2070 bytes of ROM, or 2% increase in of decoder ROM.

The contribution reports the results of a Sony listening test. At the operating point of 8 kb/s and absolute MOS score analysis, the results showed that the mean is better. At the operating point of 8 kb/s and differential MOS score analysis, the results showed 7 items better and mean score better.

The contribution reports the results of pooling all listening test. At the operating point of 8 kb/s and differential MOS score analysis, the results showed 10 items better and mean score better. The highest differential scores were 4 MUSHRA points. When comparing consistency across listening test sites, 5 items showed consistent improvement for at least 2 out of the 4 test sites.

Werner Oomen, Philips, requested time to check the pooled data, for example looking at pooling without considering the Sony data.



It was agreed to bring this matter up for further discussion Tuesday after lunch.

Taejin Lee, ETRI, presented



m19214

ETRI listening test result for USAC CE on "Increased structural flexibility in SBR"

Seungkwon Beack, Taejin Lee, Kyeongok Kang

USAC-SBR-SF

The contribution reports the results of a listening test. At the operating point of 8 kb/s mono and absolute MOS score analysis (WD7 – WD7+CE), the results showed 1 item better and mean score better. At the operating point of 8 kb/s mono and differential MOS score analysis (WD7 – WD7+CE), the results showed 10 items better and mean score better.

Eunmi Oh, Samsung, presented



m19235

Crosscheck report on increased structural flexibility in SBR

Miyoung KimEunmi Oh

USAC-SBR-SF

The contribution reports the results of a listening test. At the operating point of 8 kb/s mono and differential MOS score analysis (WD7 – WD7+CE), the results showed no statistically significant differences. Comment from listeners indicated that the CE technology provided an increased bandwidth but with increased distortion.

Samsung also verified CE decoder executable: the CE bitstreams decoded exactly to the CE waveforms used in the listening test.

Max Neuendorf, FhG, presented

m19328

Finalization of CE on increased structural flexibility in SBR

Stephan Wilde, Markus Multrus, Max Neuendorf, Sascha Disch, Frederik Nagel, Kristofer Kjörling, Heiko Purnhagen

USAC-SBR-SF

The contribution reviews the CE technology, which is to permit a 4:1 SBR up-sampling by re-using existing SBR tools to the greatest extent possible. The presenter notes that such a 4:1 system exists in the DRM system. In the 4:1 system the core coder only codes the signal from the 16 lowest subband filters (as opposed to the lowest 32 for the 2:1 system).

Two complexity comparisons were presented. The first assumes that the core coder runs at the same sampling rate. In this case, at 8 kb/s mono, the complexity of the 4:1 system as compared to the current 2:1 system is 142%, which is due to running the SBR synthesis filterbank at double the sampling rate and also running the FFT harmonic at a higher rate. The second assumes that the SBR synthesis filterbank runs at the same sampling rate. In this case, at 8 kb/s mono, the complexity of the 4:1 system as compared to the current 2:1 system is 73%, which is due to running the core coder and the FFT harmonic transposer at half the sampling rate.

It reports the results of a test at the FhG site, at the operating point of 8 kb/s mono and differential MOS score analysis (WD7 – WD7+CE), the results showed 6 items better and mean better. The results of a test at Dolby site, at the operating point of 8 kb/s mono and differential MOS score analysis (WD7 – WD7+CE), the results showed 5 items better and mean better.

It also reports results for all data pooled, in which 9 items are better and mean better. Three items are better at the level of 6 MUSHRA points. When looking at consistency across all test sites, 8 items are reported better for at least 2 of the 4 sites and also show improvement in the mean for the pooled data.

There was some discussion, and it was noted that the two complexity comparisons show that the 4:1 SBR tool provides a “middle ground” between 2:1 system at low output sampling rate and hence low audio bandwidth (but reasonable audio quality at 8 kb/s) and the 2:1 system at high output sampling rate and hence high audio bandwidth(but most likely very poor audio quality at 8 kb/s).

David Virette, Huawei, asked whether the consistency of just cross-check sites could be presented. Heiko Purnhagen, Dolby, had done those calculations and presented that data as both a table of “items better” and as MUSHRA differential score plots. The Chair noted that some items show improvement of 6 or even 8 MUSHRA points.



It was the consensus of the AhG to recommend to the Audio Subgroup to incorporate this CE technology into the USAC DIS.

Takeshi Norimatsu, Panasonic, presented



m19248

Panasonic listening test report on increased structural flexibility in SBR with PVC

Takeshi Norimatsu, Tomokazu Ishikawa, Zhong Haishan, Zhao Dan,

USAC-PVC-SF

The contribution reports the results of a listening test. At the operating point of 8 kb/s mono and differential MOS score analysis (WD7+2:1SBR+PCV – WD7+4:1SBR+PVC), the results showed differences at the 95% level of significance. It also reported on the number of listeners who preferred one system to the other (or vice versa) for each item. This preference was nearly evenly split amongst the listeners for every item, suggesting that the true preference distribution may be bi-modal due to strong individual preferences.

Toru Chinen, Sony, presented



m19329

Combination of the USAC Core Experiments on PVC for SBR Envelope Coding and Increased Structural Flexibility in SBR

Stephan Wilde, Markus Multrus, Max Neuendorf, Kristofer Kjörling, Heiko Purnhagen, Toru Chinen, Yuki Yamamoto, Mitsuyuki Hatanaka, Hiroyuki Honma, Masayuki Nishiguchi

USAC-PVC-SF

The contribution reviews the motiviation for the CE: that it should be checked that PVC also works with the proposed 4:1 SBR configuration. Note that for this CE, the parameterization of PVC factors is used (e.g. RATE). A favourable outcome would indicate that PVC can be used in combination with 4:1 SBR without any negative interaction.

It noted that there is negligible increase in complexity for combining PVC with the 4:1 SBR system. Any increase in complexity is due to running PVC over 48 rather than 32 subbands in the high frequency generation.

The contribution reports the results of pooling the Sony, FhG and Dolby listening test data. At the operating point of 8 kb/s mono and absolute MOS score analysis (WD7+2:1SBR+PCV – WD7+4:1SBR+PVC), the results showed the mean better at the 95% level of significance At the operating point of 8 kb/s mono and differential MOS score analysis (WD7+2:1SBR+PCV – WD7+4:1SBR+PVC), the results showed 14 items better and mean better at the 95% level of significance.

When Sony, FhG, Dolby and Panasonic listening test data are pooled, the results show, for absolute score analysis, the mean is better and for differential score analysis, 8 items better and mean better.

David Virette, Huawei, presented

m19263

Report on cross-check listening test for the CE on Lower-Complexity Decorrelator for USAC

David Virette

USAC-LC-DCOR

The contribution reports the results of a listening test. At the operating point of 32 kb/s stereo and differential MOS score analysis, the results showed no differences at the 95% level of significance.

Philippe Gournay, VoiceAge, presented



m19346

VoiceAge Report on Cross-check Listening Test and Software Verification for the Core Experiment on Low-Complexity Decorrelator

Philippe Gournay, Roch Lefebvre

USAC-LC-DCOR

The contribution reports the results of a listening test. At the operating point of 16 kb/s stereo and differential MOS score analysis, the results showed 1 item better at the 95% level of significance. At the operating point of 32 kb/s stereo and differential MOS score analysis, the results showed no differences at the 95% level of significance.

In addition, it verifies that the CE decoder executable was able to decode the CE bitstreams and produce the CE waveforms to within +/- 1 lsb.

Julien Robilliard, FhG, presented

m19314

Finalization of CE proposal on lower-complexity decorrelator in USAC

Julien Robilliard, Matthias Neusinger, Johannes Hilpert, Erik Schuijers, Bert den Brinker, Werner Oomen

USAC-LC-DCOR

The contribution reviewed the CE technology: The decorrelator is comprised of three stages:


  • Pre-delays

  • Lattice all-pass filter

  • Energy adjustment

The CE proposes the following changes:

  • reduce the length of the all-pass filter by a factor of 2

  • compensate the delay accordingly

  • eliminate the fractional delay in the filter (which requires complex filter coefficients)

The CE complexity is


Yüklə 7,54 Mb.

Dostları ilə paylaş:
1   ...   144   145   146   147   148   149   150   151   ...   166




Verilənlər bazası müəlliflik hüququ ilə müdafiə olunur ©muhaz.org 2024
rəhbərliyinə müraciət

gir | qeydiyyatdan keç
    Ana səhifə


yükləyin