【24h】

Tibetan Text Classification Algorithm Based on Syllables

机译:基于音节的藏文文本分类算法

获取原文

摘要

Tibetan text classification is one of the core technologies in the field of Tibetan information processing. With the rapid development of the Internet, a large amount of Tibetan Internet text data will be generated every day. Text classification technology can quickly and accurately obtain the required information to solve the problem of out-of-order in text. Tibetan syllables are the basic components of Tibetan text, and each syllable in Tibetan is divided by syllable nodes. This paper proposes a Tibetan syllable as a text representation feature, and uses deep neural network models such as CNN, BiL STM and RCNN to classify Tibetan text. Experiments show that this method has achieved prefect results in different depth neural network classification models.
机译:藏文文本分类是藏文信息处理领域的核心技术之一。随着互联网的迅猛发展,每天都会产生大量的藏文互联网文本数据。文本分类技术可以快速,准确地获得所需的信息,以解决文本混乱的问题。藏文音节是藏文文本的基本组成部分,藏文中每个音节都由音节节点划分。本文提出了一个藏语音节作为文本表示特征,并使用诸如CNN,BiL STM和RCNN之类的深度神经网络模型对藏语文本进行分类。实验表明,该方法在不同深度的神经网络分类模型中均取得了理想的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号