首页> 美国政府科技报告 >Deducing Linguistic Structure from the Statistics of Large Corpora.

【24h】

Deducing Linguistic Structure from the Statistics of Large Corpora.

机译：从大型语料库统计推断语言结构。

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Within the last two years, approaches using both stochastic and symbolic techniques have proved adequate to deduce lexical ambiguity resolution rules with less than 3-4% error rate, when trained on moderate sized (500K word) corpora of English text (e.g. Church, 1988; Hindle, 1989). The success of these techniques suggests that much of the grammatical structure of language may be derived automatically through distributional analysis, an approach attempted and abandoned in the 1950s. We describe here two experiments to see how far purely distributional techniques can be pushed to automatically provide both a set of part of speech tags for English, and a grammatical analysis of free English text. We also discuss the state of a tagged NL corpus to aid such research (now amounting to 4 million words of hand-corrected part-of-speech tagging).

著录项

作者
Magerman, D.; Marcus, M.; Santorini, B.;
展开▼
作者单位

展开▼
年度 1990
页码 1-9
总页数 9
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Linguistics; Grammars; Stochastic processes; Resolution; Language; Speech; English language; Lexicography; Labels; Symbols; Ambiguity; Distribution functions;

机译：语言学;语法;随机过程;分辨率;语言;言语;英语;词典;标签;符号;歧义;分布函数;

相似文献

外文文献
中文文献
专利

1. Information theortic models in statistical linguistics--Part II: Word frequencies and hierarchical structure in language--statistical tests [J] . Balasubrahmanyan V. K., Naranan S. Current science . 1992,第06期

机译：统计语言学中的信息理论模型-第二部分：语言中的单词频率和层次结构-统计测试
2. Musical Expertise and Statistical Learning of Musical and Linguistic Structures [J] . Daniele Sch??n, Cl??ment Fran?§ois Frontiers in Psychology . 2011,第4期

机译：音乐专业知识和音乐和语言结构的统计学习
3. Computational linguistics: A new tool for exploring biopolymer structures and statistical mechanics [J] . Ken A. Dill, Adam Lucas, Julia Hockenmaier Polymer: The International Journal for the Science and Technology of Polymers . 2007,第15期

机译：计算语言学：探索生物聚合物结构和统计力学的新工具
4. Deducing linguistic structure from the statistics of large corpora [C] . Brill E., Magerman D., Marcus M., Information Technology, 1990. 'Next Decade in Information Technology', Proceedings of the 5th Jerusalem Conference on (Cat. No.90TH0326-9) . 1990

机译：从大语料统计中推断语言结构
5. Crosslingual implementation of linguistic taggers using parallel corpora. [D] . Safadi, Hani. 2008

机译：使用并行语料库的语言标记器的跨语言实现。
6. Musical Expertise and Statistical Learning of Musical and Linguistic Structures [O] . Daniele Schön, Clément François 2011

机译：音乐专业知识和音乐和语言结构的统计学习
7. Deducing linguistic structure from the statistics of large corpora [O] . Eric Brill, David Magerman, Mitchell Marcus, 1990

机译：从大型语料库统计中推导出语言结构

Deducing Linguistic Structure from the Statistics of Large Corpora.

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅