首页> 外文会议>International Conference on Cloud Computing and Big Data Analytics >An Evaluation Set for Tibetan Sentences Similarity Computing
【24h】

An Evaluation Set for Tibetan Sentences Similarity Computing

机译:藏语句子相似性计算的评估集

获取原文

摘要

Sentence is not only the basic unit of natural language, but also the research object of Nature Language Process. Sentence similarity computing is the basic of text similarity computing, so sentence similarity evaluation set is an essential data set for similarity technology research. Only by establishing an appropriate evaluation set can we objectively evaluate the advantages and disadvantages of similarity computing methods. In order to objectively evaluate the performance of Tibetan sentence similarity, this paper designs the construction scheme of Tibetan sentence similarity evaluation set, and builds the evaluation set TSS_320 for evaluating Tibetan sentence similarity, based on the analysis of the construction method of English and Chinese sentence similarity evaluation set, also combined with the characteristics of Tibetan sentences. The validity of the evaluation set is verified by statistical methods.
机译:句子不仅是自然语言的基本单位,还是自然语言过程的研究对象。 句子相似度计算是文本相似性计算的基本,因此句子相似性评估集是相似技术研究的基本数据集。 只有通过建立适当的评估集,我们才能客观地评估相似性计算方法的优缺点。 为了客观地评估藏语句子的性能,本文设计了藏语句子相似性评估集的施工方案,并根据英语和汉语施工方法的分析,建立评估SETS_320用于评估西藏句的相似性 相似性评估集,也结合了藏语的特征。 评估集的有效性通过统计方法验证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号