首页> 外文会议>Future Technologies Conference >Use of Interpretable Evolved Search Query Classifiers for Sinhala Documents
【24h】

Use of Interpretable Evolved Search Query Classifiers for Sinhala Documents

机译:对僧伽罗文献使用可解释的进化搜索查询分类器

获取原文
获取外文期刊封面目录资料

摘要

Document analysis is a well matured yet still active research field, partly as a result of the intricate nature of building computational tools but also due to the inherent problems arising from the variety and complexity of human languages. Breaking down language barriers is vital in enabling access to a number of recent technologies. This paper investigates the application of document classification methods to new Sinhalese datasets. This language is geographically isolated and rich with many of its own unique features. We will examine the interpretability of the classification models with a particular focus on the use of evolved Lucene search queries generated using a Genetic Algorithm (GA) as a method of document classification. We will compare the accuracy and interpretability of these search queries with other popular classifiers. The results are promising and are roughly in line with previous work on English language datasets.
机译:文档分析是一个成熟但仍然活跃的研究领域,部分原因是构建计算工具的复杂性,但也由于人类语言的多样性和复杂性带来的固有问题。打破语言障碍对于获得一些最新技术至关重要。本文研究了文档分类方法在新僧伽罗语数据集上的应用。这种语言在地理上是孤立的,有许多独特的特点。我们将研究分类模型的可解释性,特别关注使用遗传算法(GA)生成的进化Lucene搜索查询作为文档分类方法。我们将比较这些搜索查询与其他流行分类器的准确性和可解释性。研究结果令人鼓舞,与之前对英语语言数据集的研究大致一致。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号