【24h】

Measurements of Lexico-Syntactic Cohesion by Means of Internet

机译:互联网对词汇句法内聚力的测量

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Syntactic links between content words in meaningful texts are intuitively conceived 'normal,' thus ensuring text cohesion. Nevertheless we are not aware on a broadly accepted Internet-based measure of cohesion between words syntactically linked in terms of Dependency Grammars. We propose to measure lexico-syntactic cohesion between content words by means of Internet with a specially introduced Stable Connection Index (SCI). SCI is similar to Mutual Information known in statistics, but does not require iterative evaluation of total amount of Web-pages under search engine's control and is insensitive to both fluctuations and slow growth of raw Web statistics. Based on Russian, Spanish, and English materials, SCI presented concentrated distributions for various types of word combinations; hence lexico-syntactic cohesion acquires a simple numeric measure. It is shown that SCI evaluations can be successfully used for semantic error detection and correction, as well as for information retrieval.
机译:有意义的文本中的内容词之间的句法链接被直观地认为是“正常的”,从而确保了文本的连贯性。但是,我们尚不了解一种广泛使用的基于互联网的度量标准,该度量标准是在依存语法的语法上链接的单词之间的衔接。我们建议使用特殊引入的稳定连接索引(SCI)通过Internet来测量内容词之间的词汇语法连贯性。 SCI与统计信息中已知的互惠信息相似,但是不需要对搜索引擎控制下的网页总量进行迭代评估,并且对原始Web统计信息的波动和缓慢增长均不敏感。根据俄语,西班牙语和英语的资料,SCI给出了各种类型的单词组合的集中分布;因此,词汇句法内聚力获得了一个简单的数字量度。结果表明,SCI评估可以成功地用于语义错误检测和纠正以及信息检索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号