ISO IEC 23000-11:2009 pdf download - Information technology — Multimedia application format (MPEG-A) — Part 11: Stereoscopic video application format

ISO IEC 23000-11:2009 pdf download – Information technology — Multimedia application format (MPEG-A) — Part 11: Stereoscopic video application format

ISO IEC 23000-11:2009 pdf download – Information technology — Multimedia application format (MPEG-A) — Part 11: Stereoscopic video application format
1 Scope
This part of ISO/IEC 23000 specifies a file format which is capable of storage,interchange,management,editing, and presentation of stereoscopic video contents based on the ISO base media file format. The fileformat provides the overall structure for storing stereoscopic video contents with the related stereoscopicinformation in mobile environments.
2Normative references
The following referenced documents are indispensable for the application of this document.For datedreferences,only the edition cited applies. For undated references,the latest edition of the referenceddocument (including any amendments) applies.
ISO/IEC 10918-1:1994,Information technology — Digital compression and coding of continuous-tone stillimages: Requirements and guidelines
ISOIEC 14496-2, Information technology — Coding of audio-visual objects — Part 2: VisualISO/IEC 14496-3, Information technology —Coding of audio-visual objects — Part 3:Audio
ISOIEC 14496-10,Information technology —Coding of audio-visual objects — Part 10: Advanced VideoCoding
ISO/IEC 14496-12, Information technology — Coding of audio-visual objects — Part 12: ISO base media fleformat
ISO/IEC 14496-20,Information technology — Coding of audio-visual objects – Part 20: LightweightApplication Scene Representation (LASeR) and Simple Aggregation Format(SAF)
ISO/IEC 15948:2004,Information technology – Computer graphics and image processing — PortableNetwork Graphics (PNG): Functional specification
3GPP TS 26.071,Mandatory speech CODEc speech processing functions; AMR speech Codec; Generaldescription
TIA/EIA/IS-127,Enhanced Variable Rate Codec(EVRC)
3 Terms and definitions
For the purposes of this document, the following terms and definitions apply.
3.1
baseline
line between origins of the respective cameras
3.2
convergence distance
distance between a convergence point and a midpoint of baseline
3.3
convergence point
point at which two optical axes of left and right cameras intersect
3.4
disparity
horizontal difference between corresponding points in stereoscopic view
3.5
focal length
distance from a surface of a lens (optical center) or mirror to its focal point (image plane)
3.6
frame
one of the many still images which compose the complete moving picture
NOTE A frame contains an array of luma samples and two corresponding arrays of chroma samples. A frame consists of two fields: a top field and a bottom field.
3.7
lenticular
array of magnifying lenses designed so that, when viewed from slightly different angles, different images are magnified
NOTE A lenticular sheet is placed on a normal display panel to show two or more different views simply by changing the angle of light direction. It can make left and right views display on left and right eyes, respectively, creating a sense of depth.
3.8
max of disparity
maximum disparity value within a stereoscopic fragment
3.9
monoscopic fragment
set of successive samples which represents only monoscopic sequence
3.10
min of disparity
minimum disparity value within the stereoscopic fragment
4 Abbreviated terms
3D Three Dimensional
AAC Advanced Audio Coding
AF Application Format
AMR Adaptive Multirate
AVC Advanced Video Coding
CDMA Code Division Multiple Access
EVRC Enhanced Variable Rate Codec
GSM Global Systems for Mobile communications
HE-AAC High Efficiency AAC
JPEG Joint Photographic Experts Group
LASeR Lightweight Application Scene Representation
PNG Portable Network Graphics
PMP Portable Multimedia Player
UMTS Universal Mobile Telecommunications System
5 Overview
5.1 Overall procedure of stereoscopic contents
The overall procedure for stereoscopic contents can be explained as follows. Both left and right view sequences are acquired from a stereoscopic camera for stereoscopic video sequences, and are composited into a video sequence or two video sequences according to the composition types specified in 5.3.
This composited video sequence is encoded and then stored into an AF. A file generator for Stereoscopic Video AF is to accept the stereoscopic contents with video, audio and LASeR streams. The file satisfying the Stereoscopic Video AF is parsed, decoded and then rendered for a stereoscopic display device.
5.2 Acquisition of the stereoscopic contents Stereoscopic video sequences are acquired from two cameras, left and right view. As described in Figure 1, camera parameters shall be needed for specifying spatial relationship between two cameras. The stereoscopic contents can be rendered in the display device more precisely by using these camera parameters. The camera parameters shall be described in the ‘ scdi’ box, which will be specified in 8.5.

Leave a Reply

Your email address will not be published. Required fields are marked *