首页> 外文会议>International conference on artificial intelligence >Segmentation of Natural Language Documents using Term Distance as Discourse Coherency Measure
【24h】

Segmentation of Natural Language Documents using Term Distance as Discourse Coherency Measure

机译:使用术语距离作为语篇连贯性度量的自然语言文档分割

获取原文

摘要

Machine learning (ML) methods are used to extract new knowledge from existing datasets. Ensemble methods (EM) were introduced to improve the ML performance. While EM's offer performance improvements, they limit the amount of control and general understanding of how the final result was achieved. We are researching the possibility to verify ML results with formalization of research results accessible in natural language. We propose to use the extracted information from scientific papers for the verification of ML results. In order to be able to extract the information, relevant to the ML results, an approach to extract specific information rich segments from scientific papers is needed. We present an approach to automated selection of relevant segments from large repositories.
机译:机器学习(ML)方法用于从现有数据集中提取新知识。引入了集成方法(EM)以改善ML性能。尽管EM提供了性能改进,但它们限制了控制量以及对最终结果实现方式的一般理解。我们正在研究以自然语言可访问的研究结果形式化来验证ML结果的可能性。我们建议使用从科学论文中提取的信息来验证机器学习结果。为了能够提取与ML结果相关的信息,需要一种从科学论文中提取特定信息丰富片段的方法。我们提出了一种从大型存储库中自动选择相关细分受众群的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号