首页> 外国专利> REAL-TIME LOSSLESS COMPRESSION METHOD OF BINARY DATA ENCODED IN GENERAL UTF-8 TYPE

REAL-TIME LOSSLESS COMPRESSION METHOD OF BINARY DATA ENCODED IN GENERAL UTF-8 TYPE

机译:通用UTF-8类型的二进制数据实时无损压缩方法

摘要

In the present invention, provided is a universal compression method regarding to a UTF-8 encoded text. A UTF-8 code is invented by Ken Thompson and Rob Pike, wherein a UTF-8 is one of variable length character encoding schemes for Unicode. The UTF-8 is an abbreviation of universal coded character set + transformation format 8-bit, and is originally proposed with a name of a file system safe UCS/Unicode transformation format (FSS-UTF). The UTF-8 encoding is used with 1 to 4 bytes in order to represent one Unicode character. The UTF-8 is defined by other methods in various standard documents, but a general structure thereof is the same. Bits indicating a Unicode code point are divided into several parts to be included in lower bits of the bites represented by the UTF-8. The character up to U+007F are displayed in the same manner as 7 bits ASCII characters, and the subsequent characters are displayed by a bit pattern up to 4 bytes as follows. The most significant bit of all bytes is 1 not to be confused with the 7 bit ASCII characters. As a result, a high compression efficiency is exhibited in the case of a country in which a native language takes overwhelmingly great importance in communications such as Korea, Japan, China, etc., which are non-English-speaking countries in a multi-lingual system, and compression is not performed even in an English-speaking country so data dose not increase.;COPYRIGHT KIPO 2018
机译:在本发明中,提供了一种关于UTF-8编码文本的通用压缩方法。 Ken Thompson和Rob Pike发明了UTF-8代码,其中UTF-8是Unicode的可变长度字符编码方案之一。 UTF-8是8位通用编码字符集+转换格式的缩写,最初是用文件系统安全UCS / Unicode转换格式(FSS-UTF)的名称提出的。 UTF-8编码使用1到4个字节,以表示一个Unicode字符。 UTF-8在各种标准文档中通过其他方法定义,但是其通用结构相同。指示Unicode代码点的位被分为几部分,以包含在UTF-8表示的位的低位中。直到U + 007F为止的字符都以与7位ASCII字符相同的方式显示,随后的字符以最多4个字节的位模式显示,如下所示。所有字节的最高有效位是1,请勿与7位ASCII字符混淆。结果,在以韩语,日本,中国等为母语的国家/地区中,英语是多语言国家/地区的非英语国家/地区,在这种语言中,母语极为重要的国家/地区表现出很高的压缩效率。语言系统,即使在英语国家也不进行压缩,因此数据不会增加。; COPYRIGHT KIPO 2018

著录项

  • 公开/公告号KR20180004410A

    专利类型

  • 公开/公告日2018-01-12

    原文格式PDF

  • 申请/专利权人 KIM JEONG HUN;

    申请/专利号KR20160083902

  • 发明设计人 KIM JEONG HUNKR;

    申请日2016-07-04

  • 分类号H03M7/30;

  • 国家 KR

  • 入库时间 2022-08-21 12:41:18

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号