《ISO-14496-10-2008.pdf》由会员分享,可在线阅读,更多相关《ISO-14496-10-2008.pdf(586页珍藏版)》请在三一文库上搜索。
1、 Reference number ISO/IEC 14496-10:2008(E) ISO/IEC 2008 INTERNATIONAL STANDARD ISO/IEC 14496-10 Fourth edition 2008-09-15 Information technology Coding of audio-visual objects Part 10: Advanced video coding Technologies de linformation Codage des objets audiovisuels Partie 10: Codage visuel avanc IS
2、O/IEC 14496-10:2008(E) PDF disclaimer This PDF file may contain embedded typefaces. In accordance with Adobes licensing policy, this file may be printed or viewed but shall not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In
3、downloading this file, parties accept therein the responsibility of not infringing Adobes licensing policy. The ISO Central Secretariat accepts no liability in this area. Adobe is a trademark of Adobe Systems Incorporated. Details of the software products used to create this PDF file can be found in
4、 the General Info relative to the file; the PDF-creation parameters were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the addr
5、ess given below. COPYRIGHT PROTECTED DOCUMENT ISO/IEC 2008 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either
6、 ISO at the address below or ISOs member body in the country of the requester. ISO copyright office Case postale 56 CH-1211 Geneva 20 Tel. + 41 22 749 01 11 Fax + 41 22 749 09 47 E-mail copyrightiso.org Web www.iso.org Published in Switzerland ii ISO/IEC 2008 All rights reserved ISO/IEC 14496-10:200
7、8(E) ISO/IEC 2008 All rights reserved iii Contents Page Foreword.xiii 0 Introduction.xv 0.1 Prologuexv 0.2 Purpose.xv 0.3 Applications xv 0.4 Publication and versions of this specification.xv 0.5 Profiles and levelsxvi 0.6 Overview of the design characteristics xvi 0.6.1 Predictive coding.xvii 0.6.2
8、 Coding of progressive and interlaced videoxvii 0.6.3 Picture partitioning into macroblocks and smaller partitionsxvii 0.6.4 Spatial redundancy reduction.xviii 0.7 How to read this specificationxviii 1 Scope.1 2 Normative references1 3 Terms and definitions.1 4 Abbreviations 9 5 Conventions.10 5.1 A
9、rithmetic operators .10 5.2 Logical operators10 5.3 Relational operators11 5.4 Bit-wise operators.11 5.5 Assignment operators11 5.6 Range notation11 5.7 Mathematical functions.12 5.8 Variables, syntax elements, and tables13 5.9 Text description of logical operations.13 5.10 Processes.14 6 Source, co
10、ded, decoded and output data formats, scanning processes, and neighbouring relationships15 6.1 Bitstream formats15 6.2 Source, decoded, and output picture formats15 6.3 Spatial subdivision of pictures and slices20 6.4 Inverse scanning processes and derivation processes for neighbours21 6.4.1 Inverse
11、 macroblock scanning process.21 6.4.2 Inverse macroblock partition and sub-macroblock partition scanning process.22 6.4.3 Inverse 4x4 luma block scanning process.23 6.4.4 Inverse 4x4 Cb or Cr block scanning process for ChromaArrayType equal to 323 6.4.5 Inverse 8x8 luma block scanning process.24 6.4
12、.6 Inverse 8x8 Cb or Cr block scanning process for ChromaArrayType equal to 324 6.4.7 Derivation process of the availability for macroblock addresses24 6.4.8 Derivation process for neighbouring macroblock addresses and their availability.24 6.4.9 Derivation process for neighbouring macroblock addres
13、ses and their availability in MBAFF frames.25 6.4.10 Derivation processes for neighbouring macroblocks, blocks, and partitions 26 6.4.11 Derivation process for neighbouring locations31 7 Syntax and semantics34 7.1 Method of specifying syntax in tabular form.34 7.2 Specification of syntax functions,
14、categories, and descriptors35 7.3 Syntax in tabular form.36 7.3.1 NAL unit syntax36 7.3.2 Raw byte sequence payloads and RBSP trailing bits syntax.37 7.3.3 Slice header syntax46 ISO/IEC 14496-10:2008(E) iv ISO/IEC 2008 All rights reserved 7.3.4 Slice data syntax. 50 7.3.5 Macroblock layer syntax. 51
15、 7.4 Semantics58 7.4.1 NAL unit semantics 58 7.4.2 Raw byte sequence payloads and RBSP trailing bits semantics. 67 7.4.3 Slice header semantics 81 7.4.4 Slice data semantics 91 7.4.5 Macroblock layer semantics . 92 8 Decoding process. 104 8.1 NAL unit decoding process.105 8.2 Slice decoding process1
16、06 8.2.1 Decoding process for picture order count. 106 8.2.2 Decoding process for macroblock to slice group map 110 8.2.3 Decoding process for slice data partitioning. 114 8.2.4 Decoding process for reference picture lists construction 114 8.2.5 Decoded reference picture marking process. 121 8.3 Int
17、ra prediction process125 8.3.1 Intra_4x4 prediction process for luma samples 126 8.3.2 Intra_8x8 prediction process for luma samples 132 8.3.3 Intra_16x16 prediction process for luma samples 140 8.3.4 Intra prediction process for chroma samples 142 8.3.5 Sample construction process for I_PCM macrobl
18、ocks. 147 8.4 Inter prediction process147 8.4.1 Derivation process for motion vector components and reference indices. 150 8.4.2 Decoding process for Inter prediction samples. 163 8.4.3 Derivation process for prediction weights 172 8.5 Transform coefficient decoding process and picture construction
19、process prior to deblocking filter process 174 8.5.1 Specification of transform decoding process for 4x4 luma residual blocks 174 8.5.2 Specification of transform decoding process for luma samples of Intra_16x16 macroblock prediction mode 175 8.5.3 Specification of transform decoding process for 8x8
20、 luma residual blocks 176 8.5.4 Specification of transform decoding process for chroma samples 177 8.5.5 Specification of transform decoding process for chroma samples with ChromaArrayType equal to 3 179 8.5.6 Inverse scanning process for transform coefficients. 179 8.5.7 Inverse scanning process fo
21、r 8x8 transform coefficients 180 8.5.8 Derivation process for the chroma quantisation parameters and scaling function 182 8.5.9 Scaling and transformation process for DC transform coefficients for Intra_16x16 macroblock type. 184 8.5.10 Scaling and transformation process for chroma DC transform coef
22、ficients . 185 8.5.11 Scaling and transformation process for residual 4x4 blocks. 186 8.5.12 Scaling and transformation process for residual 8x8 blocks. 189 8.5.13 Picture construction process prior to deblocking filter process 193 8.5.14 Intra residual transform-bypass decoding process 194 8.6 Deco
23、ding process for P macroblocks in SP slices or SI macroblocks195 8.6.1 SP decoding process for non-switching pictures 195 8.6.2 SP and SI slice decoding process for switching pictures 198 8.7 Deblocking filter process200 8.7.1 Filtering process for block edges 204 8.7.2 Filtering process for a set o
24、f samples across a horizontal or vertical block edge . 206 9 Parsing process . 212 9.1 Parsing process for Exp-Golomb codes .212 9.1.1 Mapping process for signed Exp-Golomb codes 214 9.1.2 Mapping process for coded block pattern. 214 9.2 CAVLC parsing process for transform coefficient levels217 9.2.
25、1 Parsing process for total number of transform coefficient levels and trailing ones 217 9.2.2 Parsing process for level information. 221 9.2.3 Parsing process for run information 222 9.2.4 Combining level and run information. 225 9.3 CABAC parsing process for slice data.225 9.3.1 Initialisation pro
26、cess. 227 ISO/IEC 14496-10:2008(E) ISO/IEC 2008 All rights reserved v 9.3.2 Binarization process249 9.3.3 Decoding process flow257 9.3.4 Arithmetic encoding process (informative).278 Annex A Profiles and levels.285 A.1 Requirements on video decoder capability .285 A.2 Profiles285 A.2.1 Baseline prof
27、ile .285 A.2.2 Main profile.286 A.2.3 Extended profile286 A.2.4 High profile.286 A.2.5 High 10 profile287 A.2.6 High 4:2:2 profile287 A.2.7 High 4:4:4 Predictive profile.288 A.2.8 High 10 Intra profile288 A.2.9 High 4:2:2 Intra profile .289 A.2.10 High 4:4:4 Intra profile .289 A.2.11 CAVLC 4:4:4 Int
28、ra profile290 A.3 Levels 290 A.3.1 Level limits common to the Baseline, Main, and Extended profiles.290 A.3.2 Level limits common to the High, High 10, High 4:2:2, High 4:4:4 Predictive, High 10 Intra, High 4:2:2 Intra, High 4:4:4 Intra, and CAVLC 4:4:4 Intra profiles 292 A.3.3 Profile-specific leve
29、l limits .293 A.3.4 Effect of level limits on frame rate (informative)298 Annex B Byte stream format.301 B.1 Byte stream NAL unit syntax and semantics301 B.1.1 Byte stream NAL unit syntax301 B.1.2 Byte stream NAL unit semantics.301 B.2 Byte stream NAL unit decoding process302 B.3 Decoder byte-alignm
30、ent recovery (informative).302 Annex C Hypothetical reference decoder 304 C.1 Operation of coded picture buffer (CPB)306 C.1.1 Timing of bitstream arrival .306 C.1.2 Timing of coded picture removal308 C.2 Operation of the decoded picture buffer (DPB)308 C.2.1 Decoding of gaps in frame_num and storag
31、e of “non-existing“ frames308 C.2.2 Picture decoding and output309 C.2.3 Removal of pictures from the DPB before possible insertion of the current picture.309 C.2.4 Current decoded picture marking and storage.310 C.3 Bitstream conformance .310 C.4 Decoder conformance.312 C.4.1 Operation of the outpu
32、t order DPB .313 C.4.2 Decoding of gaps in frame_num and storage of “non-existing“ pictures313 C.4.3 Picture decoding313 C.4.4 Removal of pictures from the DPB before possible insertion of the current picture.313 C.4.5 Current decoded picture marking and storage.314 Annex D Supplemental enhancement
33、information .316 D.1 SEI payload syntax316 D.1.1 Buffering period SEI message syntax.318 D.1.2 Picture timing SEI message syntax.318 D.1.3 Pan-scan rectangle SEI message syntax319 D.1.4 Filler payload SEI message syntax319 D.1.5 User data registered by ITU-T Rec. T.35 SEI message syntax.320 D.1.6 Us
34、er data unregistered SEI message syntax320 D.1.7 Recovery point SEI message syntax .320 D.1.8 Decoded reference picture marking repetition SEI message syntax321 D.1.9 Spare picture SEI message syntax.321 D.1.10 Scene information SEI message syntax.322 D.1.11 Sub-sequence information SEI message synt
35、ax322 D.1.12 Sub-sequence layer characteristics SEI message syntax.322 ISO/IEC 14496-10:2008(E) vi ISO/IEC 2008 All rights reserved D.1.13 Sub-sequence characteristics SEI message syntax 323 D.1.14 Full-frame freeze SEI message syntax 323 D.1.15 Full-frame freeze release SEI message syntax 323 D.1.1
36、6 Full-frame snapshot SEI message syntax 323 D.1.17 Progressive refinement segment start SEI message syntax. 324 D.1.18 Progressive refinement segment end SEI message syntax 324 D.1.19 Motion-constrained slice group set SEI message syntax 324 D.1.20 Film grain characteristics SEI message syntax. 325
37、 D.1.21 Deblocking filter display preference SEI message syntax 325 D.1.22 Stereo video information SEI message syntax 326 D.1.23 Post-filter hint SEI message syntax 326 D.1.24 Tone mapping information SEI message syntax. 327 D.1.25 Reserved SEI message syntax. 327 D.2 SEI payload semantics327 D.2.1
38、 Buffering period SEI message semantics 327 D.2.2 Picture timing SEI message semantics . 328 D.2.3 Pan-scan rectangle SEI message semantics 332 D.2.4 Filler payload SEI message semantics 333 D.2.5 User data registered by ITU-T Rec. T.35 SEI message semantics 333 D.2.6 User data unregistered SEI mess
39、age semantics 333 D.2.7 Recovery point SEI message semantics 333 D.2.8 Decoded reference picture marking repetition SEI message semantics 335 D.2.9 Spare picture SEI message semantics. 335 D.2.10 Scene information SEI message semantics. 337 D.2.11 Sub-sequence information SEI message semantics 339 D
40、.2.12 Sub-sequence layer characteristics SEI message semantics . 340 D.2.13 Sub-sequence characteristics SEI message semantics 341 D.2.14 Full-frame freeze SEI message semantics 342 D.2.15 Full-frame freeze release SEI message semantics 343 D.2.16 Full-frame snapshot SEI message semantics 343 D.2.17
41、 Progressive refinement segment start SEI message semantics . 343 D.2.18 Progressive refinement segment end SEI message semantics 344 D.2.19 Motion-constrained slice group set SEI message semantics. 344 D.2.20 Film grain characteristics SEI message semantics 345 D.2.21 Deblocking filter display pref
42、erence SEI message semantics. 350 D.2.22 Stereo video information SEI message semantics 352 D.2.23 Post-filter hint SEI message semantics. 353 D.2.24 Tone mapping information SEI message semantics . 354 D.2.25 Reserved SEI message semantics . 355 Annex E Video usability information 356 E.1 VUI synta
43、x357 E.1.1 VUI parameters syntax. 357 E.1.2 HRD parameters syntax 358 E.2 VUI semantics.358 E.2.1 VUI parameters semantics 358 E.2.2 HRD parameters semantics. 371 Annex F Patent Rights 373 Annex G Scalable video coding 375 G.1 Scope.375 G.2 Normative References.375 G.3 Definitions 375 G.4 Abbreviati
44、ons379 G.5 Conventions379 G.6 Source, coded, decoded and output data formats, scanning processes, neighbouring and reference layer relationships.379 G.6.1 Derivation process for reference layer macroblocks. 379 G.6.2 Derivation process for reference layer partitions 383 G.6.3 Derivation process for
45、reference layer sample locations in resampling 384 G.7 Syntax and semantics386 ISO/IEC 14496-10:2008(E) ISO/IEC 2008 All rights reserved vii G.7.1 Method of specifying syntax in tabular form386 G.7.2 Specification of syntax functions, categories, and descriptors386 G.7.3 Syntax in tabular form.386 G
46、.7.4 Semantics397 G.7.4.1 NAL unit semantics.398 G.8 SVC decoding process.423 G.8.1 SVC initialization and decoding processes .424 G.8.2 SVC reference picture lists construction and decoded reference picture marking process .441 G.8.3 SVC intra decoding processes.450 G.8.4 SVC Inter prediction proce
47、ss 459 G.8.5 SVC transform coefficient decoding and sample array construction processes470 G.8.6 Resampling processes for prediction data, intra samples, and residual samples.488 G.8.7 SVC deblocking filter processes .514 G.8.8 Specification of bitstream subsets.525 G.9 Parsing process.526 G.9.1 Alt
48、ernative parsing process for coded block pattern.526 G.9.2 Alternative CAVLC parsing process for transform coefficient levels 527 G.9.3 Alternative CABAC parsing process for slice data in scalable extension.531 G.10 Profiles and levels.534 G.10.1 Profiles534 G.10.2 Levels537 G.11 Byte stream format5
49、41 G.12 Hypothetical reference decoder541 G.13 Supplemental enhancement information.541 G.13.1 SEI payload syntax542 G.13.2 SEI payload semantics 549 G.14 SVC video usability information extension565 G.14.1 SVC VUI parameters extension syntax.565 G.14.2 SVC VUI parameters extension semantics565 LIST OF FIGURES Figure 6-1 Nominal vertical and horizontal locations of 4:2:0 luma and chroma samples in a frame.16 Figure 6-2 Nominal vertical and horizontal sampling locations of 4:2:0 samples in top and bottom fields17 Figure 6-3 Nominal vertical and
链接地址:https://www.31doc.com/p-3774735.html