Annex A-V: Audio Task Groups -
MPEG Audio FAQ/Web Page - Thom
-
13818-7 AAC Corrigendum - Johnston
-
MPEG Audio - Preparation of press statement - Meares
-
MPEG-4 Audio/Systems Issues - Paley
-
MPEG-2 AAC Conformance - Paley
-
MPEG-2 AAC Technical Report - Paley
-
MPEG-4 Verification Tests - Kim/Contin/Edler
-
MPEG-4 FCD study - Edler, Grill, Lee, Nishiguchi, Lee, Vaananen
-
MPEG-4 Reference Software Study - Purnhagen
-
MPEG-4 Error resilience - Dietz
-
MPEG-4 Conformance - Spille
-
MPEG-4 Profiles and levels - Brandenburg/Edler
-
TTSI/FBA timing and bitstream definition - Lee
-
MPEG-4 SA issues & MMA alignment - Ray
-
Subjective alignment of speech items - Ojala
-
Review of MPEG-4 Overview - Thom
Annex A-VI: Input/Output Documentation Contributed documents
The following documents were contributed to the Audio Subgroup and were considered during this meeting:
Number
|
Source
|
Title
|
3513
|
Pete Schirling
|
Document Register for 44th Meeting in Dublin, Ireland
|
3515
|
ITU-T SG 12 via the SC 29 Secretariat
|
Liaison Statement from ITU-T SG 12 on MPEG-4 Audio Test (SC 29 N 2501)
|
3518
|
SC 29 Secretariat
|
Summary of Voting on ISO/IEC 13818-4/FPDAM 1 SC 29 N 2521
|
3529
|
JTC 1 Secretariat
|
Summary of Voting on ISO/IEC TR 13818-5/DAM 1: Information technology -- Generic coding of moving pictures and associated audio information -- Part 5: Software simulation, AMENDMENT 1: Advanced Audio Coding (AAC) SC 29 N 2557, JTC 1 N 5291
|
3562
|
NB of Japan via the SC 29 Secretariat
|
Late Comments on ISO/IEC 13818-7/DCOR 1 SC 29 N 2620
|
3563
|
NB of Japan via the SC 29 Secretariat
|
Late Comments on ISO/IEC TR 13818-5/DAM 1 SC 29 N 2621
|
3565
|
David Meares
|
Bibliography of WG11 Input Documents: m0551 to m3511, December 1995 to March 1998
|
3566
|
David Meares
|
Bibliography of WG11 Output Documents: N0965 to N2247, July 1995 to March 1998
|
3578
|
Peter G. Schreiner III
|
Report of the Ad-Hoc Group on AAC Dynamic Range Control
|
3584
|
Hiroyuki Fukuchi
|
Brief Report of AAC Conformance Testing (SSR profile)
|
3589
|
David Thom, Heiko Purnhagen
|
Ad-Hoc group report on Audio Web Page activity
|
3598
|
Laura Contin
|
List of audio, video and audiovisual excerpts that have been released to MPEG
|
3601
|
Giorgio Zoia
|
Proposed revisions to FCD 14496-3 Subpart 5
|
3602
|
Giorgio Zoia
|
A method for complexity measurements in Structured Audio
|
3605
|
Jens Spille
|
Report of Ad-Hoc Group on MPEG-4 Audio Tools Complexity
|
3606
|
Jens Spille
|
Report of Ad Hoc Group on MPEG-4 Audio Conformance
|
3609
|
Structured Audio AHG, MIDI Manufacturers Association
|
Revised Structured Audio Sample Bank Format
|
3610
|
Eric Scheirer, Lee Ray
|
Report of AHG on Structured Audio
|
3611
|
Eric Scheirer
|
A method different than Giorgio's for complexity measurements in SA
|
3626
|
Youngjik Lee, Jeorn Ostermann
|
AHG Report on TTS/FBA Convergence
|
3627
|
Youngjik Lee, Jung-Chul Lee, Hang-Seop Lee
|
MPEG-4 Audio Markup TTS
|
3668
|
Mike Coleman, Eric Scheirer, Carsten Herpel
|
Report of the Ad-hoc group on MPEG-4 Audio/Systems issues
|
3669
|
Mike Coleman, Chuck Lueck, Mark Paley, David Thom
|
Report of Ad-Hoc group on AAC technical report & conformance
|
3680
|
Young-Kwon Lim, Youngjik Lee
|
On the MPEG-4 TTS application with FA
|
3684
|
The National Body of Japan
|
Comments on MPEG-4 Audio
|
3686
|
Takehiro Moriya, Akio Jin, Takeshi Norimatsu, Mineo Tsushima, Tomokazu Ishikawa
|
Reports on the AAC-TwinVQ convergence work
|
3692
|
The National Body of Korea
|
A study on the FCD 14496-3 Audio and FCD 14496-5 Software
|
3698
|
Yuji Maeda, Masayuki Nishiguchi
|
EP tool version 0.5 with parametric speech coder
|
3700
|
Yuji Maeda, Masayuki Nishiguchi, Akira Inoue
|
Listening test results of CELP coder
|
3713
|
Toshiro Kawahara, Sanae Hotani, Takashi Suzuki, Toshio Miki
|
Core Experiment of Common EP tool for MPEG-4 Audio error resilience
|
3716
|
Giorgio Zoia, Ulrich Horbach
|
Proposal for new SAOL core opcodes for high quality equalizing and dynamic processing
|
3730
|
The National Body of Japan
|
On the wording of the copyright disclaimer
|
3741
|
Swedish National Body
|
Comments on the use of software in the normative parts of MPEG4
|
3744
|
Naoya Tanaka
|
A proposal to handle PICOLA speed change tool in the Audio profiles
|
3745
|
Heiko Purnhagen, Bernd Edler
|
A Study of Parametric Audio FCD and HILN Scalability
|
3750
|
Naoya Tanaka
|
Listening test results of optimized MPEG-4 CELP coders
|
3754
|
Akihiko Sugiyama
|
Report of the Ad-Hoc group on MPEG-4 Audio CELP optimization
|
3756
|
Toshiyuki Nomura, Masahiro Iwadare
|
Listening test results of MPEG-4 Audio CELP
|
3758
|
Gael RICHARD, Ariane LEDORE, Philip LOCKWOOD
|
Comparative test results for speech coders (MPEG4 CELPs, G723.1, Scalable coder based on G723.1)
|
3759
|
Akira Inoue, Masayuki Nishiguchi
|
Proposed Conformance Testing Procedures on Noise Component Generator for Parametric Speech Coder(HVXC)
|
3761
|
Bernhard Grill, Heiko Purnhagen
|
Report of the AHG on MPEG-4 Audio FCD and Reference Software FCD progression
|
3763
|
Ralf Funken, Werner Oomen, Frans de Bont
|
Results of an informal listening test assessing the quality of MPEG-4 Wideband CELP with an optimized VQ w.r.t. the MPEG-4 Audio VM
|
3766
|
Ralf Funken, Werner Oomen, Frans de Bont
|
Results of an informal listening test assessing the quality of a modified MPEG-4 Narrowband CELP codec w.r.t. the MPEG-4 Audio VM
|
3783
|
C. Sibade, S. Weisse, A. Ledore, G. Richard
|
MPEG4 Audio demonstrator
|
3795
|
Martin Dietz, Laura Contin, Jean-Bernard Rault
|
Report of the Ad-Hoc Group on MPEG-4 narrowband audio broadcasting verification tests
|
3796
|
Catherine Colomes, Caroline Jacobson, Eric Scheirer, Laura Contin, Jean-Bernard Rault, Martin Dietz
|
Report on the NADIB verification tests
|
3797
|
Martin Dietz, Toshio Miki
|
Ad-hoc group on MPEG-4 Audio error resilience for Version 2
|
3798
|
Martin Dietz, Roland Bitto
|
Proposal for correction of AAC conformance test procedure
|
3820
|
Sang-Wook Kim, Bernd Edler
|
Report of the Ad-Hoc group on MPEG-4 Audio verification tests
|
3823
|
Joern Ostermann, Yao Wang
|
Bookmarks for TTS-FBA Synchronization
|
3825
|
Sang-Wook Kim, Bernd Edler
|
Report of the Ad-Hoc Group on MPEG-4 Audio Verification tests
|
3828
|
Sang-Wook Kim,, Bernd Edler
|
List of selected items for the MPEG-4 Audio verification test: Music on Internet
|
3830
|
S.-W. Kim (Samsung),, M. Lindqvist (Ericsson),, M. Nishiguchi (Sony)
|
List of selected items for the MPEG-4 Audio Speech verification test
|
3831
|
Sang-Wook Kim
|
Perfect AM coding results check for verification test on MPEG-4 narrowband Audio broadcasting
|
3841
|
US NB via the SC 29 Secretariat
|
Late Comments on ISO/IEC 13818-7/DCOR 1 (SC 29 N 2637)
|
3847
|
Y. Takamizawa, M. Iwadare
|
Source Codes for MPEG-2/AAC pulse coding
|
3855
|
H. Fukuchi, M. Iwadare
|
Bug Fix in MPEG-2/AAC TR
|
Output Documents
The following output documents were produced in whole or part by the Audio Subgroup. Those shown in Italics were approved for public release.
number
|
Title
|
|
Responses to National Body papers
|
2323
|
MPEG-4 Version 1 Overview (contributions to)
|
2324
|
MPEG-4 Version 2 Overview (contributions to)
|
2257
|
DoC on Conformance 13818-4/FPDAM 1
|
2258
|
Conformance 13818-4/FDAM 1 (covering 13818-7 AAC)
|
2261
|
DoC on Technical Report 13818-5/DAM 1
|
2262
|
Technical Report 13818-5/AMD 1 (covering both 13818-3 Second Edition and 13818-7 AAC)
|
2265
|
DoC on MPEG-2 AAC 13818-7 DCOR 1
|
2266
|
MPEG-2 AAC 13818-7 COR 1
|
2270
|
Workplan for MPEG-2 AAC 13818-7 Dynamic Range Control
|
2271
|
Study on ISO/IEC 14496-3 FDIS
|
2272
|
Study on DoC on MPEG-4 Audio Final Committee Draft 14496-3
|
2273
|
ISO/IEC 14496-4 WD 3 Conformance Testing of the MPEG-4
|
2274
|
Study on MPEG-4 Audio Reference Software FDIS 14496-5
|
2275
|
Study on DoC on MPEG-4 Audio Reference Software Final Committee Draft 14496-5
|
2276
|
MPEG-4 Audio verification test results: narrowband audio broadcasting
|
2277
|
Plan for MPEG-4 Audio verification tests: speech codecs
|
2278
|
Plan for MPEG-4 Audio verification tests: music on Internet
|
2279
|
Prescreening results on MPEG-4 Audio verification test excerpts - Music on Internet
|
2280
|
Information on MPEG-4 Audio systems issues
|
2281
|
Harmonisation of TTS and FBA
|
2282
|
SA complexity tool
|
2283
|
MPEG-4 Audio Version 2 WD
|
2284
|
MPEG-4 Audio error resilience workplan (update of the 44th meeting)
|
2285
|
Identification of source files for watermarking evaluations
|
2286
|
Markup TTS
|
2287
|
MPEG Audio FAQs version 8
|
2288
|
Proposals for the MPEG Audio web site content
|
2289
|
Bibliography of WG11 Input Documents m0551 to m3836, December 1995 to June 1998
|
2290
|
Bibliography of WG11 Output Documents N0965 to N2247, July 1995 to March 1998
|
2298
|
Workplan for AAC Conformance and Technical Report Software
|
2332
|
Report of AAC/Twin VQ convergence work
|
Annex 8
SNHC meeting report
Source: Peter Doenges (Evans & Sutherland), Chair
SNHC Meeting Objectives
The main SNHC objectives for the Dublin meeting were review of the completeness and quality of Version 1 FCD work for Study of FCD, and initial development of conformance contributions to Part 4 based on the improving stability of profile/level work. The main objectives for Version 2 were to move technologies forward in preparation for CD, to continue work on FBA calibration connected with Version 2 profiling, and to advance CGD work in the development of content metrics for the bitstream. Qualified technologies were to be advanced to Visual WD (notably 3D model connectivity coding and body animation) with related VM promotions if justified. Development of experiments for CGD was targeted (along with identifying 3D platforms, data sets, etc. needed to conduct the experiments) to verify a model for estimation of terminal loading during 3D rendering from content metrics in the bitstream.
Detail Dublin objectives are listed below:
Version 1
-
Audit of FCD comments & contribution to Study of FCD
– Review/action on meeting contributions about FCD
– Correct omissions, errors, editorial updates of Visual, Systems specs
-
Conformance
– Development of criteria for testing with full-envelop bitstreams, approach
to verification, link to profile/level data, augmenting data sets
-
Profiling
– Contribute changes driven by Requirements and FCD contributions
-
IM1 software status
– Any revision to work plan to ensure completion
-
Review & update SNHC software plan from Tokyo
– Reference software and plan brought up to date
-
Double-check Systems BIFS (incl. index face set 2D - problems with 2D Mesh)
– Nodes for FBA, 2D mesh, VDS, scene composition
-
Still Texture Coding
– Resolve any final issues to enable progressive, MIP texture
Version 2
1. Assessment of outstanding work items
a. 3D Model Coding
i. Decision - retention of 3D regular gridded mesh vs. efficiency?
ii. M1: 3D mesh connectivity - efficiency, generality, footprint
iii. M2: Geometry coding including progressive shape
iv. M3: Progressive connectivity coding (and links to M1, M2)
v. M4: Properties coding (normals, color, texture coordinates)
b. Body Animation
i. BA2: BAP Compression
ii. BA3: Hand BAP Interpretation
iii. BA6: BAP Quantization Step Sizes
iv. BA7: BDP Interpretation
c. Face Animation
i. FA1: Face Calibration (feature points, mesh texture & shape)
ii. FA2: FAP Coding with Facial Action Basis Functions vs. FIT?
iii. FA3: Error Resilience of FAP Bitstreams
2. VM 9.0 contribution with recommended changes from AHGs
3. Critical review of pre-Dublin CEs compared to interim AHG results
4. VM update per CEs if significant change for technology promotion
5. Verification of 1st bitstream exchanges, selection of WD technologies
6. If technology changes in VM, critical plan to finish WD on CD schedule
7. Quality/effectiveness of documentation/software for 2nd implementers
8. Profiling: FA calibration studies vs. Calibration, Predictable profiles
9. ISG CGD contributions and SNHC experiment design/contributors
a. Adaptive decoding for terminal resources vs. media complexity
b. Basic capability for scalable content with no backchannel
c. Server adaptation of content with backchannel terminal metrics
i. Key applications and requirements
ii. Impact on, supported by content developers
iii. Assistance to Systems for real solutions
iv. Level of server/terminal negotiation
v. Effect on profiling
Most of these objectives were achieved. However for Version 2, profiling discussions were tabled in deference to Version 1 work, and no time was spent on backchannel work associated with CGD and server adaptation of content. For Version 1, significant improvements to the FCD that involve lagging editorial work with NB backing, and final conformance work with supporting test data, must still be achieved by Atlantic City. Key individuals are committed to closure on these points. For Version 2, the schedule imperatives associated with fulfilling required process steps to qualify technology fully for CD now requires another interim AHG meeting for 3D Model Coding before Atlantic City. Promotion to WD of remaining elements of 3D Model Coding is expected, and is needed to produce an integrated tool set for static and progressive compression of connectivity, geometry, and properties.
SNHC Contributions & Related Review
The following SNHC-related contributions were reviewed during the meeting:
FCD for Version 1
3635
|
Systems
|
Comments on FCD 14496-1 (Systems)
|
The National Body of Japan
|
3672
|
Systems
|
Comments on Systems FCD
|
Zvi Lifshitz
|
3689
|
Video
|
Comments on FCD 14496-2 (Visual)
|
The National Body of Japan
|
3690
|
Video
|
Editorial comment on FCD 14496-2 ANNEX
|
The National Body of Korea, ksc29@kisi.or.kr, ahnc@etri.re.kr
|
3691
|
Video
|
Comments on BBM and FCD 14496-2 Visual
|
The National Body of Korea, ksc29@kisi.or.kr, ahnc@etri.re.kr
|
3715
|
Video
|
UK NB comments on FCD 14496-2 (visual)
|
UK National Body
|
3723
|
AHG Report
|
Report of the ad hoc group on video VM and visual WD/FCD editing
|
Touradj Ebrahimi, Caspar Horne, Euee Jang
|
3784
|
Video
|
Study of the FCD Visual Texture Coding
|
Jie Liang, Raj Talluri
|
3807
|
General
|
Comments on FCDs
|
D.Curet
|
3821
|
Video
|
Implementation of Visual Texture Coding in Microsoft FCD software and IM1
|
Iraj Sodagar, Hung-Ju Lee, Bing-Bing Chai, Paul Hatrack, Shipeng Li, B.S. Srinivas
|
3824
|
Video
|
Editorial and minor technical changes of Visual Texture Coding ("proposed study of FCD")
|
Iraj Sodagar, Hung-Ju Lee, Paul Hatrack, Shipeng Li, Bing-Bing Chai
|
2D Mesh for Version 1
3644
|
SNHC
|
Input for Study of MPEG-4 Visual FCD
|
P. van Beek
|
Face & Body Animation for Version 2
3590
|
SNHC
|
Donation to ISO of Hand Animation Software
|
Francoise Preteux,, Marius Preda, and Gerard Mozelle
|
3592
|
SNHC
|
Hand Animation and BAPs Extraction: Reports on Core Experiment 3
|
Francoise Preteux,, Marius Preda, and Gerard Mozelle
|
3661
|
SNHC
|
Model-Based Face Tracking and 3D Pose Estimation
|
Francoise Preteux, and Marius Malciu
|
3755
|
SNHC
|
Report of ad hog group on Face and Body Animation
|
Eric Petajan, Tolga Capin
|
3770
|
SNHC
(no)
|
Status of Body Animation Quantization Core Experiment
|
Damian Lyons
|
3772
|
SNHC
|
Results of Body Animation Core Experiments BAP2, BAP3, BAP6, BAP7
|
Tolga K. Capin, Joaquim Esmerado
|
3773
|
SNHC
|
Report on Local Processing Scalability for Body Animation
|
Tolga K. Capin, Joaquim Esmerado
|
3777
|
SNHC
|
Proposal for Update to Body Animation Specification
|
Tolga K. Capin, Joaquim Esmerado
|
3D Model Coding for Version 2
3530
|
SNHC
|
Mesh Connectivity Coding by Dual Graph Approach
|
Jiankun Li, C.-C. Jay Kuo, Homer Chen
|
3591
|
SNHC
|
Geometry and Topology Compression of 3D Meshes: Results of Core Experiment M1 and M2
|
Francoise Preteux,, Mircea Curila, Sorin Curila, Jose Paumard, and Gerard Mozelle
|
3652
|
SNHC
|
Progressive Mesh Coding by Independent Vertex Split
|
Jiankun Li, C.-C. Jay Kuo, Homer Chen
|
3688
|
SNHC
|
On the development of 3-D mesh coding tools
|
The National Body of Korea, ksc29@kisi.or.kr, ahnc@etri.re.kr
|
3724
|
AHG Report
|
Report of the ad hoc group on 3D model coding
|
Touradj Ebrahimi
|
3735
|
SNHC
|
Results of core experiment M2: Geometry coding using PRVQ
|
Jin Soo Choi, Myoung Ho Lee, Chieteuk Ahn
|
3751
|
SNHC
|
Geometry Compression of 3D Meshes using Optimal Quantization for Prediction Errors
|
Yo-Sung Ho, Jeong-Hwan Ahn
|
3752
|
SNHC
|
Adaptive Quantization Method for 3D Mesh Representation using the Spherical Coordinate System
|
Yo-Sung Ho, Jeong-Hwan Ahn
|
3753
|
SNHC
|
Results of Core Experiment M2 on 3D Model Coding
|
Jeong-Hwan Ahn, Yo-Sung Ho
|
3768
|
SNHC
|
Experimental Results on Mesh Connectivity Coding based on Looping Triangle Strip (M1)
|
Mun-Sub Song, Mahn-Jin Han, Euee S. Jang, Y.S. Seo(SAIT),, Hyungin Choi(SNU)
|
3769
|
SNHC
|
Experimental Results on Geometry Compression for Mesh Coding using Optimal Quantization for Prediction Errors (M2)
|
Mahn-Jin Han, Mun-Sub Song, Euee S. Jang
|
3793
|
SNHC
|
Description of core experiments on 3D model coding
|
Frank Bossen (editor)
|
3794
|
SNHC
|
Results of core experiments on 3D model coding
|
Frank Bossen
|
3801
|
SNHC
|
Progressive 3D mesh coding with subdivision surfaces
|
Francisco Moran
|
3810
|
SNHC
|
Report on Results of Core Experiment M1 on 3D Model Coding
|
Gabriel Taubin, Claudio Silva, Andre Gueziec, Bill Horn
|
3811
|
SNHC
|
Report on Results of Core Experiment M3 on 3D Model Coding
|
Gabriel Taubin, Claudio Silva, Andre Gueziec, Bill Horn
|
SNHC VM for Version 2
3809
|
SNHC
|
SNHC Verification Model 9.0
|
Gabriel Taubin
|
3812
|
SNHC
|
SNHC VM 9.0 Source Code for TS and PFS Connectivity Encoding and Decoding
|
Gabriel Taubin, Claudio Silva, Andre Gueziec, Bill Horn
|
Visual Still Texture Coding
3621
|
Video
|
Scalable Shape Coding for Still Texture
|
Yoshihiro Ueda, Zhixiong Wu
|
3803
|
Video
|
Mini experiment on scanning for low complexity wavelet texture coding
|
Iole Moccagatta, Osama Alshaykh, Homer Chen
|
3804
|
Video
|
Core experiment on error resilience for still texture using a packet approach
|
Iole Moccagatta, Osama Alshaykh, Homer Chen
|
3822
|
Video
|
Bitstream exchange result for visual texture coding
|
Iraj Sodagar, Hung-Ju Lee, Paul Hatrack, Shipeng Li, Bing-Bing Chai, Bing-Bing Chai
|
3826
|
Video
|
Report of results on CE-E16: Error Resilient Still Texture using a Packet Approach
|
Iraj Sodagar, Bing-Bing Chai, B.S. Srinivas
|
3827
|
Video
|
Verification of result on CE-F1: Tiling function for visual texture
|
Hung-Ju Lee, Iraj Sodagar
|
3829
|
Video
|
Report of the ad-hoc group on Visual Texture Coding
|
Iraj Sodagar, Iole Moccagatta
|
Visual WD for Version 2
3553
|
Video
|
MPEG-4 Version 2 Visual Working Draft Rev. 3.1
|
Euee S. Jang, Visual CD Editors
|
Computational Graceful Degradation & Quality of Service
3567
|
ISG
|
Computational Graceful Degradation Analysis in SNHC
|
Gauthier Lafruit, Lode Nachtergaele, Andy Scherpenberg, Tom Huybrechts, Jan Bormans
|
3616
|
ISG
|
Report of the ad-hoc group on decoder QoS
|
Marco Mattavelli
|
3679
|
ISG
|
Report of the Ad-Hoc Group on Computational Graceful Degradation
|
Jan Bormans, Marco Mattavelli
|
Complexity Analysis
3551
|
ISG
|
A complexity analysis tool: iprof (version 0.41)
|
Peter Kuhn
|
3568
|
ISG
|
Complexity Analysis of FCD still texture coding
|
Gauthier Lafruit, Mercedes Peon, Bart Vanhoof, Jan Bormans
|
3631
|
ISG
|
Computation Complexity Profiling of the IM-1 Player
|
Mercedes Peon, Lode Nachtergaele, Gauthier Lafruit, Peter Vos, Jan Bormans
|
3645
|
ISG
|
Complexity analysis and guidelines for profile definition of Still Texture Coding
|
Gauthier Lafruit, Mercedes Peon, Bart Vanhoof, Jan Bormans
|
3786
|
ISG
|
Report on Core Experiment Results of Encoder Complexity Reduction Based on Intelligent Pre-Quantizaton
|
Wei Wu, Homer Chen
| Output Document Editors
Editors or coordinators responsible for SNHC elements of output documents were:
Study of FCD Contributions
|
|
Face Animation
|
Eric Petajan
|
2D Animated Mesh
|
Murat Tekalp
|
View-Dependant Scalable Texture
|
Homer Chen
|
Cross Review of Still Texture
|
Iraj Sodagar
|
Cross Review of Systems BIFS
|
Claudio Lande, Julien Signes
|
SNHC Conformance Working Draft
|
Michael Frater (Thomas Sikora), Eric Petajan, Murat Tekalp, Pete Doenges
|
SNHC/ISG CGD Experiment
|
Gauthier Lafruit, Marco Mattavelli, Jan Bormans, Eric Petajan, Pete Doenges, Claudio Lande
|
SNHC to Visual WD for Version 2
|
Yuchiro Nakaya, Mr. Shin (Thomas Sikora), Gabriel Taubin, Tolga Capin
|
SNHC Profiles/Levels Contributions
|
Murat Tekalp
|
SNHC Software Work Plan
|
Jiankun Li
|
SNHC Verification Model 9.0
|
Gabriel Taubin
|
SNHC Core Experiments
|
Frank Bossen
|
Press Release
|
Pete Doenges
| AHG Meetings and Reports
The following AHG meetings were held on Sunday before the WG11 meeting and reports discussed:
3755
|
SNHC
|
Report of ad hog group on Face and Body Animation
|
Eric Petajan, Tolga Capin
|
3724
|
SNHC
|
Report of the ad hoc group on 3D model coding
|
Touradj Ebrahimi
|
3829
|
Video
|
Report of the Ad Hoc Group on Visual Texture Coding
|
Iraj Sodagar, Iole Moccagatta
|
3679
|
ISG
|
Report of the Ad-Hoc Group on Computational Graceful Degradation
|
Jan Bormans, Marco Mattavelli
|
See those reports for details. An important development emerged from the meetings on 3D Model Coding. While there has been tremendous work by various proponents and partners as 2nd implementers to bring the work to the maturity of bitstream exchanges, these were not consistently achieved across all the M1-M4 experiments. Thus VM and WD promotions were not as aggressive as originally hoped. The Korean NB requested a response to their concern on this, and another AHG meeting is planned in Korea.
Meeting Work and Results Study of FCD for Version 1
Little of the NB comments on FCD applied to SNHC-related functionalities except for Still Texture Coding. The responsible working group in Video continues to handle the DoC and Study inputs for Still Texture Coding with cross-review by SNHC. Iraj Sodagar presented the intended actions for this work along with the AHG report to the SNHC group. Reported inconsistencies in bitstream decoding for different implementations of Still Texture and other problems reported at and after Tokyo are receiving needed attention. The group expected pertinent items to be corrected in the Study of FCD and subsequent NB follow-up for Atlantic City.
After Tokyo, a number of edits in the FCD were discovered missing after the publishing date in areas of FA and 2D Mesh as previously driven by NB comments including references back into CD and Study of CD. The reasons are important to understand (editor overloads, lack of incremental review before publishing, lack of adequate proponent participation at the right times). The results are that FCD can not transition to DIS with reliability without a thorough assessment of the FA and 2D Mesh areas of the FCD. This is obviously not the state that the work should be in nor is it the intent of the process. The process demands a very high level of quality in the text at this point. People involved have lost a bit of confidence in the process, but have pledged to close editing gaps by rebuilding what should have been in FCD from the known NB trail. NBs should take a careful look at this situation in NB meetings before Atlantic City, collaborate with other interested NBs, and make NB contributions to Atlantic City supported by the necessary unanimity of proponent organizations.
The 2D Mesh area was covered by an individual contribution (M3644) in Dublin that provided corrections to the FCD in the necessary details reflecting previous NB requests. The FA area required working time in the Dublin meeting to audit the FCD for missing parts and give priority to the process of recovering the trail by which to incorporate needed changes driven by NB comments going back into Tokyo. This situation was complicated by the necessity to edit the edits of the CD. Individuals in the FA area have committed to "rebuild" a valid NB-driven version of FCD in the affected areas before Atlantic City to gain NB approval and contributions to the meeting. They have also volunteered to join the main Visual and Systems editors wherever necessary to ensure progressive verification and cross-checking of DIS editing after Atlantic City.
Conformance Work for Version 1
An outline for the Conformance strategy and content of Part 4 was initially developed in the Dublin meeting, along with a review of the existing state of Part 4. Then several discussions were held with proponents (specifically FA and 2D Mesh) to characterize the methods of conformance, exercising necessary modalities in the decoder, and augmenting current bitstreams to achieve adequate verification of proper function and performance as driven by profile/level results. An initial version of the Conformance additions to Part 4 with some embedding of current Part 4 language to help with convergence of consistent results was issued as N2299. Key individuals for FA and 2DM will expand this document after Dublin, and discuss its refinement as well as the development of data sets using appropriate reflectors. No AHG was formed, but the mandate is very clear, and proponents must supply an integrated contribution with NB support by Atlantic City.
Still Image Texture Coding in Version 1
In addition to the FCD work, several delegates discussed the need to verify wavelet texture coding in its application to 3D. Specifically, wavelet texture coding should be subjected to an end-to-end (non-normative) verification test (not a CE) relative to the resolution scalability and quality of still texture when used in 3D rendering. The test would include conversion of a downloaded wavelet image pyramid from MPEG-4 Part 5 decoding into MIP map texture for varied viewing of textured 3D models on a PC graphics accelerator or workstation. OpenGL hardware with bilinear and trilinear filtering of MIP map texture could be used with dynamic viewing to verify that wavelet texture supports the intended MPEG-4 functionality. E&S volunteered rendering on an OpenGL PC 3D graphics accelerator and Lucent Bell Labs volunteered rendering an SGI OpenGL workstation to observe wavelet-based texture in 3D.
Textured 3D models would be run through the complete MPEG-4 still image texture pipeline. Texture maps would be extracted from 3D models, the textures processed through the wavelet encoder, the complete wavelet bitstream decoded while accumulating the supplied image resolution layers (11 peak), results translated to MIP map texture, and the reintegrated texture models rendered in 3D. Demos would be staged under viewing conditions of varied model orientation, distance, and perspective distortion to show the effectiveness of resolution scalability. If possible, videos would be prepared for Atlantic City. This effort will be pursued following the Dublin meeting. Sarnoff had anticipated such a demo with some work now applied in this direction. Rockwell and AT&T Research may be able to help. IM1 now includes the tools, and may be a platform for testing. Sarnoff confirmed that a schedule of tasks and deadlines before Atlantic City and spreading of this collaborative work over partners should be detailed.
CGD for SNHC Functionalities in Version 2
The ISG group has done a productive job of laying the basis for using CGD to support SNHC/Systems BIFS functionalities that depend on the display of models at useful rates and fidelities while driven by MPEG-4 decoders producing 2D, 3D, or mixed media streams. MPEG-4 compliant decoders should achieve adequate performance in decoding rates due to the profile/level specifications and associated conformance definitions. However, there is presently no normative profiling of model or rendering complexity for specific 2D/3D content whose terminal loading beyond the decoder can be difficult to predict from the content. Overload of terminal rendering is possible with compliant decoding.
The goal of current CGD work is to provide bitstream parameterization of content complexity and structure that could be used by the terminal to select scalable versions of the content related to the 2D/3D geometry and rendering capacities of the terminal beyond the decoder. The correct model for estimating geometry and rendering complexity will ultimately be hardware/software-specific and can depend on 3D content organization, graphics architecture, viewpoint location/look angle and associated culling, scene occlusion, the context of state in the graphics pipeline, bus and memory limitations, etc.
ISG has done previous work including some benchmarking and analysis of specific systems to characterize basic content metrics that are likely to produce varied effects on video composting and 3D rendering accelerators. M3567 for Dublin from ISG members provided a more substantial proposal with functional analysis of model transformations and polygon/texel rendering into the pixels of an output image, along with some approximations that predict rendering load for component pieces of a 2D/3D media stream. This estimating is simplified compared to hardware-specific models, but a useful start.
During Dublin several joint working sessions and informal discussions were held between ISG and SNHC about how to design experiments validating the utility of estimating metrics in M3567. These discussions also dissected what kinds of 3D models and variations in viewing conditions should be used to help validate specific performance estimators. Some time was also spent educating and clarifying specific points about the context-dependent aspects of rendering that are not adequately accounted for now by the ISG CGD model for SNHC. Specific vendors undoubtedly have and might supply more details on the total performance prediction model for estimating rendering load. However, the agreed scope of the current work is to identify hardware architectural abstractions that provide more promise for achieving a normative methodology for hardware, software, and content developers to adapt MPEG-4 content in relation to predicted loading, and to provide scalability controls for content in the bitstream.
The output document N2317 provides the results of this work. Without going into details here, several organizations (EPFL, IBM, France Telecomm, E&S, etc.) have agreed to provide models with and without texture for testing. Some organizations have agreed to supply platform assistance (e.g. E&S has shipped a high-performance PC 3D accelerator to IMEC), and may provide more performance estimation insights if this helps broaden the applicability of ISG results to a larger set of hardware and software renderers. On-going discussions should be on the ISG reflector. Some members have agreed to post other academic and industrial references on benchmarking and performance prediction to the reflector.
BIFS for SNHC Functionalities
Two joint meetings were held with Systems BIFS. One session discussed remaining issues on Version 1 system integrity. A separate session examined new functionalities in Version 2 to support SNHC decoding and scene composition. The Version 1 discussion dealt with specific node and synchronization issues. Version 2 discussions compared Advanced BIFS with SNHC directions including these points:
Scripts & more sophisticated animation/interaction
Customizing BAP streams by some extension mechanism
- Adding feature control points, clothing, jewelry
Dealing with progressive & incremental 3D models
Harmonizing aural environment modeling
- Material properties & room geometry
- Consistency with 3D Model Coding
These discussions were fruitful, and no basic problems or duplications were identified. The mechanisms for gradually expanding model detail or for adding to the model in chunks for error resilience while always having an instantaneous model ready for scene composition and rendering need further case study.
Face Animation
Some encouraging work in model-based face tracking and 3D pose estimation was reviewed. This tends to strengthen the case for the developing availability of technology on the encoder side to instrument live faces for interactive applications of face animation at very low bandwidth. Discussions continued on face calibration for Version 2 and the FCD problems for Version 1 cited earlier. The FAP default values in arithmetic coder limits needed to be fixed for Version 1. Facial Action Basis Functions, as a means to new functionality with simultaneous high coding efficiency and low decoding lag (compared with the current frame-based arithmetic coding or DCT coding of FAPs), was dropped for lack of adequate results on-time. This decision removes from consideration any new technologies that might have challenged backward compatibility with Version 1 or the principle of one function, one tool. The area of face calibration still needs more work to enable closure on useful profiling results with Requirements in Version 2 for Calibration and Predictable FA. Due to the many Version 1 priorities at the Dublin meeting, no effort was joined with Requirements to move this further along. A meeting was held with Requirements to assess the packaging of Simple FA in Version 1 profiling relative to Main Profile; results should be reviewed in the output of Requirements from Dublin.
2D Mesh
The Study of FCD contribution (M3644) was made available to the Visual editors as a reflection of prior NB support. A joint session with Requirements was held to clarify level points previously recommended.
A new CE (M6) was added to the 3D Model Coding work associated with a slight change in header bitstream flag semantics of 2D Mesh that would access 3D model connectivity coding now in the Visual WD for Version 2. This means that the same Version 2 tool set would provide a form of generalized 2D topological coding in addition to the 2D regular and Delaunay mesh coding now covered by the header for 2D mesh coding in Version 1. Thus no new technology is introduced, and applications that need general 2D mesh topological coding for MPEG-4 Version 2 could achieve this by invoking the 3D mesh connectivity tool now headed for CD. This would also provide stronger support for efficient coding of 2D model primitives now supported in the BIFS 2D nodes and profiling work. Due to the meeting steps remaining in Version 2 work, we must achieve bitstream testing and exchange by the interim 3D Model Coding AHG meeting before Atlantic City. Then the M6 initiative must gain the approval of the Video and SNHC groups showing that adequate discipline has been followed to incorporate M6 results in the CD when the currently envisioned detail (however minor) is not in the VM. This coupling of 2D mesh and 3D mesh connectivity coding was rehearsed and agreed a year ago in Stockholm. The necessary work before Dublin was not completed on the agreed VM/WD/CD schedule.
Body Animation in Version 2
Much of the work was concentrated on verifying that all conditions have been met for promoting body animation to WD including bitstream exchange and on improving the quality of the text for this purpose. INT generously donated the hand animation software reviewed in the last few meetings, and reported on successful BAP extraction for hand animation. Some valuable results were reported on body animation core experiments. Analysis and demonstrations of animated models illuminated a better understanding of bitrate vs. quantization tradeoffs. Dynamic animations helped illustrate where BAP quantization levels become unacceptable for useful body motion granularity, signing intelligibility, stable dependent motions of body parts, etc. Specific hand signing was demonstrated for alphabet examples, and VRML Consortium Humanoid Animation models were integrated with MPEG-4 body animation to observe the composite behavior of this unification of models and BAP decoding.
The Dublin meeting work actively incorporated informal agreements made with the VRMLC H-Anim working group in the joint meeting on June 88, 1998, to harmonize the work of the two groups. Body animation faced one dilemma about normative specification referencing. The nomenclature used in MPEG-4 and H-Anim specifications should be the same. However the H-Anim work will likely result in an informative annex to the VRML specification without the force of normative standardization in an ISO framework. Thus MPEG-4 body animation will not make normative reference to H-Anim, but will use the same nomenclature to enhance the prospects for ready development of applications when H-Anim and MPEG-4 are combined. If there is any copyright problem, an indexing system will have to be adopted.
3D Model Coding in Version 2
The baseline work on 3D Model Coding for the interim AHG meeting at IBM Research on May 18-19, 1998, and for the Dublin meeting has included:
M1: 3D mesh connectivity - lossless compression
_ Topological Surgery with enhanced arithmetic coding
_ Dual Graph topology with adaptive context-based arithmetic coding
_ Loop topology coding with islands and bridges
M2: Geometry coding including progressive shape
_ Predictive Residual Vector Quantization (with Lattice VQ)
_ Scalar Quantization (VRML-CBF geometry compression)
_ Successive Quantization (with embedded arithmetic coding)
M3: Progressive connectivity coding
_ Progressive forest split compression
_ Progressive vertex split/edge collapse
M4: Properties coding (normals, color, texture coordinates)
_ Compressed representation of properties attached to a model,
with all bindings defined in VRML (per vertex, per face, per corner)
The experimentation with 3D models has been extensive and the technologies analyzed increasingly sophisticated as well as cutting-edge. As mentioned before, a considerable effort has been applied to 3D model coding, and care has been taken to exercise the discipline of N2073 as well as requiring that results to be produced by deadlines to meet the CD targets. The efforts at the meeting have included bitstream exchanges achieved during the meeting. Extensive data on bitrate vs. geometric distortion have been rerun to isolate factors for more valid comparisons in selecting technologies. In one CE review, even interactive end-to-end model processing into views on a VRML browser was accomplished while participants watched to verify visually (as well as with data plots) the geometric error dispersion of encoding and decoding. In some cases 2nd implementations and bitstream exchange did not finish in time.
The process reviews and results are sufficiently complex, that a viewgraph summary of the status of each area was developed and maintained during the meeting and will be posted on the snhc-obj reflector. At a summary level, the relative movement of technologies into the WD or VM is shown below:
Version 2 technologies to Visual/Systems WDs
_ 2D/3D Mesh Connectivity: Topological Surgery/IBM
Prior run in VRML community, much work & maturity
Excellent competing technologies from USC, Samsung
_ Body Animation: Fusion of FA adaptation & H-Anim
“Loose” specification linkage with shared nomenclature
Version 2 SNHC Verification Model
_ 3DMC Connectivity: promoted out of VM ??WD
_ Body Animation: promoted out of VM ??WD
_ 3DMC Geometry: Successive Quantization ??VM (from CE)
_ 3DMC Progressive: Forest Split already in VM
_ 3DMC Properties: VQ ??SQ? evaluating alternatives
This resulted in the continuance of M2-M4 for bitstream exchanges and in some cases for the achievement of other required process steps. Partners were enlisted to enhance the verification of the various competing technologies to strengthen the viability of the primary candidates for CD promotion. The AHG meeting in Korea in late August will be pivotal in providing sufficient evidence that the remainder of the 3D Model Coding tool set can and should be advanced to CD.
The Korean NB expressed due concern about the adequacy of resources to finish the process in a manner that justifies standardization. After much open discussion of the status of the work and the further volunteering of participation, the following response to the Korean NB was given:
"WG11 thanks the Korean National Body for its comments on the progress of SNHC and in particular the 3D Model Coding tools for Version 2 MPEG-4. In response, two additional 3D Model Coding Ad Hoc Group meetings have been scheduled before the WG11 meeting in Atlantic City to ensure adequate attention to this important priority. Moreover, necessary steps have been taken to enlist additional resources to confirm the superiority of the technologies already selected for advancement to Working Draft in Atlantic City. A request for additional test models will be issued, and the 3D Model Coding Verification Model software will be made available for public evaluation, at this meeting."
The group also recommended to WG11 a call for data sets to expand the 3D models used in testing and to publish the VM/WD software to the industry to encourage experimentation and feedback on the robustness of the tools. A special header for the software (expected to be issued after the Korean AHG meeting) was drafted toward the end of the WG11 meeting and approved. Owners of the M1-M4 software should review this header language for acceptability (N2397) before issuing the software.
Starting with the AHG meeting at IBM Research, Samsung has proposed a variant of strip-oriented 3D mesh coding, in addition to the Dual Graph (DG) and Topological Surgery (TS) technologies entered earlier, that performs well for looping or ring mesh topologies. Examination of this coding method revealed that it is compatible with incremental transmission of the 3D model for a given level of detail. This opened consideration of how M1 WD technology (TS) and others offered so far could be modified to support error resilience. Error resilience would be supported by any coding method that preserves high lossless coding efficiency for the base mesh, but also allows ready partitioning of the coding into free-standing chunks of the model while yielding only a small loss in coding efficiency. M5, Partitioning of Data for 3D Model Coding, was devised and supported by partners to rapidly investigate this functionality by building on the technologies already under consideration. A meeting with Requirements was held to establish that this functionality is useful, and the MPEG-4 Requirements were updated.
2D/3D Mesh Unification, M6, was described earlier to access 3D connectivity coding to serve as a general topological coding method for 2D meshes as well without new technology.
MPEG-4 Overview
Late in the Dublin meeting, an update of SNHC functional language and partitioning of tools into Version 1 and 2 matching the current status of work was supplied to Requirements for the MPEG-4 Overview. The submission was too late to affect output documents of Dublin, but should be refined as soon as Requirements has time to incorporate and critique the changes.
Output Documents
-
SNHC Conformance Plan N2299
-
SNHC Reference Software & Work Plan N2300
-
SNHC Verification Model 9.0 N2301
-
Core Experiments in Face Body Animation N2302
-
Core Experiments in 3D Model Coding N2303
-
SNHC Public Request for Domain-Specific
Data Sets for 3D Model Coding N2304
-
SNHC Release of Verification Model Tools
in 3D Model Coding for Public Evaluation N2305
-
SNHC FAQs N2309
-
Header for Release of MPEG-4 3D Model
Coding VM Software N2397
Core Experiments
The following Core Experiments were formulated for Atlantic City:
3D Model Coding N2303
_ M2 Mesh Geometry/Vertex Coding
Predictive Residual Vector Quantization (with Lattice VQ)
Scalar Quantization (IBM VRML-CBF geometry compression)
Successive Quantization (with embedded arithmetic coding)
_ M3 Progressive/Scalable 3D Mesh Coding
Progressive forest split compression
Progressive vertex split/edge collapse
_ M4 Attribute Coding and Tool Integration
Compressed representation of properties attached to a model,
with all bindings defined in VRML (per vertex, per face, per corner)
_ M5 Partitioning of Data for 3D Model Coding
Error resilience potential vs. efficiency loss
_ M6 2D/3D Mesh Unification
Face Body Animation N2302
_ FA1 Face Model Mesh Calibration
_ BA2 BAP Coding
_ BA6 BAP Quantization Step Size
_ BAT BAT Interpolation
Other Output Documents
The following output documents were generated:
-
SNHC Reference Software & Work Plan N2300
-
Body Animation, Topological Surgery removed, Successive Quantization added
-
Pre-Atlantic City AHG meetings aim to recommend further technology promotion per N2073
-
SNHC Public Request for Domain-Specific
Data Sets for 3D Model Coding N2304
-
SNHC Release of Verification Model Tools
in 3D Model Coding for Public Evaluation N2305
-
SNHC FAQs N2309
Ad Hoc Groups for Dublin
The following groups were established to coordinate core experiments and documents:
3D Model Coding N2306
_ Big workload, carefully designed priorities to move VM technologies to WD
_ Key work with much expanded partnering on 2nd implementations & verification - THANKS!
_ Heading toward very robust 3DMC tools
_ Crucial meetings for success
30-31 August 1998, Seoul Korea
11 October 1998, Atlantic City
Face Body Animation N2307
_ 11 October 1998, Atlantic City
SNHC VM Editing N2308
Dostları ilə paylaş: |