首页> 外文学位 >Multiplexing/de-multiplexing Dirac video with AAC audio bit stream.
【24h】

Multiplexing/de-multiplexing Dirac video with AAC audio bit stream.

机译:将Dirac视频与AAC音频比特流复用/解复用。

获取原文
获取原文并翻译 | 示例

摘要

With the inception of High Definition Television (HDTV) for broadcasting digital multimedia, enormous demand for video streaming over internet and Internet Protocol Television (IPTV) applications, the choice of a good compression scheme is vital. A good compression scheme assists in exploiting the limited storage capacity and efficient use of bandwidth required for broadcasting.;Dirac [31] is a state-of-the-art video codec aimed at applications from HDTV to web streaming [1]. Dirac was developed by the British Broadcasting Corporation (BBC) and is an open technology which does not involve any licensing fees. Studies have shown that the performance of Dirac compares well to the H.264 video codec [3]. At low bitrates, the quality of video deteriorates due to distortion for the Dirac video codec, while H.264 outperforms [2]. Performance of Dirac for HD media is similar to H.264, due to absence of large and intolerable variations between the two codecs [2]. Hence, Dirac is chosen as the video codec in this thesis. The right choice of audio codec is also necessary. Advanced Audio Coding (AAC) [4] is one of the audio digital codec standards defined in Moving Picture Experts Group (MPEG-2) and MPEG-4 with a few modifications [4]. The audio sampling frequency ranges from 8 kHz--96 kHz [5]. The performance of AAC is superior at bitrates greater than 64 Kbps and also at lower bitrates (16 Kbps), and hence it is adopted in this thesis [22]. The raw video and audio data is encoded using the Dirac video and the AAC audio codec respectively. The video and audio bit-streams obtained need to be multiplexed as a single stream in order to be transmitted over the network. The objective of this thesis is to multiplex the video and audio bit-streams for transmission, de-multiplex audio and video bit-streams at the receiver's end while maintaining lip synchronization during the playback. The MPEG-2 [19] system is adopted in this thesis to achieve the multiplexing process. The bit-streams of audio and video correspond to the respective frame data. This data is packetized as Packetized Elementary Streams (PES) which is of variable lengths. This is further packetized as Transport Stream (TS) packets of fixed length and 188 bytes long [9]. The fixed size packet length facilitates the transmission process. The timestamp information is encapsulated into the PES header in the form of frame numbers which help in achieving lip synchronization during playback. The presentation time of video and audio is used as a reference in multiplexing the audio and the video TS packets which aid in ensuring the buffer fullness (i.e. prevents buffer overflow or underflow) at the de-multiplexer end. Sequence Parameter Sets (SPS) and Picture Parameter Sets (PPS) present in the video bit-stream are also transmitted in the form of packets to assist the decoder in decoding the video data. The header information included helps in a faster and efficient demultiplexing process. The algorithm for multiplexing and de-multiplexing was implemented while maintaining lip sync during playback. Advanced Television Systems Committee - Mobile/ Handheld (ATSC - M/H) has an allocated bandwidth requirement of 19.6 Mbps [13], whereas the transport stream bitrates obtained using the multiplexing algorithm implemented for the inter coding of sequences used are 102.13 kbps and 96.72 kbps which can be easily and efficiently accommodated. Encoding video using Dirac and audio based on AAC, multiplexing the two coded bit-streams, packetization, de-multiplexing the two coded bit-streams, decoding the video (Dirac) and audio (AAC) while maintaining the lip sync are the highlights of this thesis. Advantages and limitations of the method proposed are discussed in detail.
机译:随着用于广播数字多媒体的高清晰度电视(HDTV)的出现,对互联网和Internet协议电视(IPTV)应用程序上视频流的巨大需求,选择良好的压缩方案至关重要。良好的压缩方案有助于利用广播所需的有限存储容量和带宽的有效利用。; Dirac [31]是针对从HDTV到Web流[1]的应用程序的最新视频编解码器。狄拉克(Dirac)由英国广播公司(BBC)开发,是一种开放技术,不涉及任何许可费用。研究表明,Dirac的性能与H.264视频编解码器相当[3]。在低比特率下,由于Dirac视频编解码器的失真,视频质量会下降,而H.264的性能优于[2]。由于两个编解码器之间没有较大且无法忍受的变化,因此Dirac在HD媒体上的性能类似于H.264 [2]。因此,本文选择Dirac作为视频编解码器。正确选择音频编解码器也是必要的。高级音频编码(AAC)[4]是运动图像专家组(MPEG-2)和MPEG-4中定义的音频数字编解码器标准之一,并做了一些修改[4]。音频采样频率范围为8 kHz--96 kHz [5]。在大于64 Kbps的比特率和较低的比特率(16 Kbps)上,AAC的性能都很高,因此在本文中被采用[22]。原始视频和音频数据分别使用Dirac视频和AAC音频编解码器进行编码。为了通过网络传输,需要将获得的视频和音频比特流复用为单个流。本文的目的是对视频和音频比特流进行多路传输,在接收端解复用音频和视频比特流,同时在播放过程中保持口型同步。本文采用MPEG-2 [19]系统来实现复用过程。音频和视频的比特流分别对应于帧数据。该数据被打包为长度可变的打包基本流(PES)。这被进一步打包为固定长度和188字节长的传输流(TS)数据包[9]。固定大小的数据包长度有助于传输过程。时间戳信息以帧号的形式封装到PES报头中,有助于在回放过程中实现口型同步。视频和音频的显示时间在多路复用音频和视频TS数据包时用作参考,有助于确保多路分解器端的缓冲区充满(即,防止缓冲区溢出或下溢)。视频比特流中存在的序列参数集(SPS)和图片参数集(PPS)也以数据包的形式发送,以帮助解码器解码视频数据。包含的标头信息有助于更快,更有效的多路分解过程。实现了复用和解复用的算法,同时在播放过程中保持了口形同步。高级电视系统委员会-移动/手持(ATSC-M / H)分配的带宽要求为19.6 Mbps [13],而使用为帧间编码实现的复用算法获得的传输流比特率分别为102.13 kbps和96.72。可以轻松有效地容纳的kbps。使用Dirac和基于AAC的音频对视频进行编码,对两个编码的比特流进行复用,打包,对两个编码的比特流进行解复用,对视频(Dirac)和音频(AAC)进行解码,同时保持口型同步是这些产品的亮点。这个论文。详细讨论了所提出方法的优点和局限性。

著录项

  • 作者

    Urs, Ashwini.;

  • 作者单位

    The University of Texas at Arlington.;

  • 授予单位 The University of Texas at Arlington.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 M.S.
  • 年度 2011
  • 页码 65 p.
  • 总页数 65
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号