Ranking concrete and abstract words using Google Books Ngram data

Ivanov Vladimir; Solovyev Valery

首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Ranking concrete and abstract words using Google Books Ngram data

【24h】

Ranking concrete and abstract words using Google Books Ngram data

机译：使用Google书籍数据排名具体和抽象单词

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Creation of dictionaries of abstract and concrete words is a well-known task. Such dictionaries are important in several applications of text analysis and computational linguistics. Usually, the process of assembling of concreteness scores for words begins with a lot of manual work. However, the process can be automated significantly using information from large corpora. In this paper we combine two datasets: a dictionary with concreteness scores of 40,000 English words and the GoogleBooks Ngram dataset, in order to test the following hypothesis: in text concrete words tend to occur with more concrete words, than with abstract words (and inverse: abstract words tend to occur with more abstract words, than with concrete words). Using the hypothesis, we proposed a method for automatic evaluation concreteness scores of words using a small amount of initial markup.

机译：创建抽象和具体词语的词典是一个知名的任务。这些词典在文本分析和计算语言学的几个应用中很重要。通常，用于单词的具体分数组装的过程从很多手工工作开始。但是，该过程可以在大公司的信息中显着地自动化。在本文中，我们组合了两个数据集：一个具有40,000英语单词和Googlebooks ngram数据集的具体分数的字典，以测试以下假设：以文本具体的单词往往以更具体的话语发生，而不是抽象词（和逆：抽象词倾向于以更摘要的单词发生，而不是具体词语）。使用假设，我们提出了一种使用少量初始标记自动评估与单词的分数的方法。

著录项

来源
《Journal of intelligent & fuzzy systems: Applications in Engineering and Technology》 |2020年第2期|共9页
作者
Ivanov Vladimir; Solovyev Valery;
展开▼
作者单位

Innopolis Univ Fac Comp Sci &

Software Engn St Univ Skaya 1 Innopolis 420500 Republic Of Tat Russia;

Kazan Fed Univ Intellectual Technol Text Management Res Lab Linguist Res &

Educ Ctr Kazan The Republic Ta Russia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统;
关键词
Concreteness of words; bigrams; dictionary;

机译：言语的具体性;Bigrams;字典;

相似文献

外文文献
中文文献
专利

1. Ranking concrete and abstract words using Google Books Ngram data [J] . Ivanov Vladimir, Solovyev Valery Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第2Pta2期

机译：使用Google书籍数据排名具体和抽象单词
2. Google Books Ngram Viewer:visualizing cultural trends [J] . Library hi tech news . 2011,第1期

机译：Google图书Ngram Viewer：可视化文化趋势
3. On Arabic Abstract and Concrete Words Recall Using Free Recall Paradigms: Is It Abstractness, Concreteness, or Zero Effect [J] . Nasser Saleh Al-Mansour, Yasir Saad Almukhaizeem, Ahmed Mohammed Alduais Psychology and Behavioral Sciences . 2015,第4期

机译：关于使用自由召回范例进行的阿拉伯文抽象词和具体词的回忆：是抽象性，具体性还是零影响
4. Extracting Frame-Like Structures from Google Books NGram Dataset [C] . Vladimir Ivanov Mexican international conference on artificial intelligence . 2014

机译：从Google图书NGram数据集中提取类似框架的结构
5. The representation of multiple translations in bilingual memory: An examination of lexical organization for concrete, abstract, and emotion words in Spanish-English bilinguals. [D] . Basnight-Brown, Dana M. 2009

机译：双语记忆中多种翻译的表示：西班牙语-英语双语者中具体，抽象和情感词的词汇组织检查。
6. Concreteness/abstractness ratings for two-character Chinese words in MELD-SCH [O] . Xu Xu, Jiayin Li 2020

机译：在MELD-SCH中的两个字符中文单词的具体性/抽象评级
7. Images of the unseen: extrapolating visual representations for abstract and concrete words in a data-driven computational model [O] . Fritz Günther, Marco Alessandro Petilli, Alessandra Vergallito, 2020

机译：看不见的图像：在数据驱动的计算模型中推断出抽象和具体词语的视觉表示

Ranking concrete and abstract words using Google Books Ngram data

摘要

著录项

相似文献

相关主题

期刊订阅