首页> 外文会议>Language and Technology Conference >Hybrid Unsupervised Learning to Uncover Discourse Structure
【24h】

Hybrid Unsupervised Learning to Uncover Discourse Structure

机译:杂交无监督学习揭示话语结构

获取原文

摘要

Data mining allows the exploration of sequences of phenomena, whereas one usually tends to focus on isolated phenomena or on the relation between two phenomena. It offers invaluable tools for theoretical analyses and exploration of the structure of sentences, texts, dialogues, and speech. We report here the results of an attempt at using it for inspecting sequences of verbs from French accounts of road accidents. This analysis comes from an original approach of unsupervised training allowing the discovery of the structure of sequential data. The entries of the analyzer were only made of the verbs appearing in the sentences. It provided a classification of the links between two successive verbs into four distinct clusters, allowing thus text segmentation. We give here an interpretation of these clusters by comparing the statistical distribution of independent semantic annotations.
机译:数据挖掘允许探索现象序列,而一种通常倾向于关注孤立的现象或两种现象之间的关系。它提供了无价的工具,用于理论分析和宣传句子,文本,对话和演讲的结构。我们在此报告了尝试使用它来检查来自法国道路事故的法国账户的动词序列。该分析来自无监督培训的原始方法,允许发现顺序数据的结构。分析仪的条目仅由出现在句子中的动词制成。它提供了两个连续动词与四个不同群集之间的链接的分类,从而允许由此进行文本分段。我们通过比较独立语义注释的统计分布来说,我们在这里解释这些群集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号