首页> 外国专利> METHOD FOR FOCUSLY COMPRESSING KOREAN CHARACTER THROUGH 3-5 BIT COMPRESSION FOR BYTE 1, METHOD FOR FOCUSLY COMPRESSING 2-4 BITS FOR BUTE 2, AND DEVICE THEREOF, IN UTF-8 CODE CHARACTER SYSTEM

METHOD FOR FOCUSLY COMPRESSING KOREAN CHARACTER THROUGH 3-5 BIT COMPRESSION FOR BYTE 1, METHOD FOR FOCUSLY COMPRESSING 2-4 BITS FOR BUTE 2, AND DEVICE THEREOF, IN UTF-8 CODE CHARACTER SYSTEM

机译：通过字节1到3-5位压缩来集中压缩韩国字符的方法，针对字节2来压缩2-4位二进制代码的方法及其设备，在UTF-8代码字符系统中

页面导航

摘要
著录项
相似文献

摘要

In case of social network service (SNS) in Korea, a range of Unicode containing Korean is from U+AC00 to U+D7AF, and a byte header of UTF-8 is ″1110″. Since Korean language frequently appears in social media based on Korean language, in the present invention a shorter compression header bit is mapped with ″10″. In this case, there is no benefit for other characters in which a header of BYTE 1 does not appear in a high frequency (of course, there is no loss), but for an area containing Korean characters, the present invention substitutes a header bit with ″10″ in a first top byte to acquire a gain of 2 bits, then compresses additional 1-3 bits in a process of combining remaining 4 bits of byte 1, thereby acquiring gains of overall 3-5 bits. Of course, for a byte 2, the present invention is designed to acquire a gain of 2-4 bits and for byte 3-6 to acquire by 2 bits without fail. In addition, in case of English characters starting with 0, the present invention additionally compresses 1 bit in a blank character to increase a compression efficiency in UTF-8 of Korean characters, which activates a spacing in documents based on Korean characters.;COPYRIGHT KIPO 2018

机译：对于韩国的社交网络服务（SNS），包含韩语的Unicode范围是从U + AC00到U + D7AF，而UTF-8的字节头是＆Prime; 1100-1Prime;。由于朝鲜语频繁出现在基于朝鲜语的社交媒体中，因此在本发明中，较短的压缩头比特被映射为＆Prime; 10＆Prime;。在这种情况下，对于没有高频率出现BYTE 1的标题的其他字符（当然，没有损失）没有好处，但是对于包含朝鲜语字符的区域，本发明代替了标题位与＆Prime; 10＆Prime;在第一个高位字节中获取1位以获取2位的增益，然后在合并字节1的其余4位的过程中压缩其他1-3位，从而获得整体3-5位的增益。当然，对于字节2，本发明被设计成获得2-4位的增益，并且对于字节3-6被设计成无故障地获得2位。另外，在英文字符从0开始的情况下，本发明另外压缩空白字符中的1位以增加韩文字符的UTF-8的压缩效率，这激活了基于韩文字符的文档中的间隔。 2018年

著录项

公开/公告号KR20180007397A

专利类型
公开/公告日2018-01-23

原文格式PDF
申请/专利权人 KIM JEONG HUN;
展开▼

申请/专利号KR20160088318
发明设计人 KIM JEONG HUNKR;
展开▼

申请日2016-07-13
分类号H03M7/30;H03M7/48;
国家 KR
入库时间 2022-08-21 12:41:15

相似文献

专利
外文文献
中文文献