首页> 外文会议>International conference on recent advances in natural language processing >Text Classification into Abstract Classes Based on Discourse Structure
【24h】

Text Classification into Abstract Classes Based on Discourse Structure

机译:基于话语结构的文本分类为抽象类

获取原文

摘要

The problem of classifying text with respect to belonging to a document or a meta-document is formulated and its application areas are proposed. An algorithm is proposed for document classification tasks where counts of words is insufficient do differentiate between such abstract classes of text as metalanguage and object-level. We extend the parse tree kernel method from the level of individual sentences towards the level of paragraphs, based on anaphora, rhetoric structure relations and communicative actions linking phrases in different sentences. Tree kernel learning technique is applied to these extended trees to leverage of additional discourse-related information. We evaluate our approach in the domain of action-plan documents.
机译:提出了关于属于文档或元文档的文本分类问题,并提出了其应用领域。提出了一种用于文档分类任务的算法,其中单词数不足以区分诸如元语言和对象级之类的抽象文本。我们基于回指,修辞结构关系和链接不同句子中的短语的交际动作,将分析树核方法从单个句子的层次扩展到段落的层次。树核学习技术已应用于这些扩展树,以利用其他与语篇相关的信息。我们在行动计划文件领域评估我们的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号