Identifying the Authors' National Variety of English in Social Media Texts

机译：在社交媒体文本中识别作者的民族英语多样性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present a study for the identification of authors' national variety of English in texts from social media. In data from Facebook and Twitter, information about the author's social profile is annotated, and the national English variety (US, UK, AUS, CAN, NNS) that each author uses is attributed. We tested four feature types: formal linguistic features, POS features, lexicon-based features related to the different varieties, and data-based features from each English variety. We used various machine learning algorithms for the classification experiments, and we implemented a feature selection process. The classification accuracy achieved, when the 31 highest ranked features were used, was up to 77.32%. The experimental results are evaluated, and the efficacy of the ranked features discussed.

机译：在本文中，我们提出了一项研究，旨在从社交媒体中识别作者的国家英语多样性。在来自Facebook和Twitter的数据中，注释了有关作者的社交资料的信息，并注明了每个作者使用的国家英语版本（美国，英国，AUS，CAN，NNS）。我们测试了四种特征类型：正式的语言特征，POS特征，与不同品种相关的基于词典的特征以及每种英语品种中基于数据的特征。我们将各种机器学习算法用于分类实验，并实施了特征选择过程。当使用排名最高的31个特征时，实现的分类精度高达77.32％。评估实验结果，并讨论排名功能的功效。

著录项

来源
《International conference on recent advances in natural language processing》|2017年|671-678|共8页
会议地点
作者
Vasiliki Simaki; Panagiotis Simakis; Carita Paradis; Andreas Kerren;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Experimenting Language Identification for Sentiment Analysis of English Punjabi Code Mixed Social Media Text [J] . International Journal of E-Adoption . 2020,第1期

机译：英文旁遮普语代码混合社交媒体文本的情感分析实验语言识别
2. Text Normalization in Social Media: Progress, Problems and Applications for a Pre-Processing System of Casual English [J] . Eleanor Clark, Kenji Araki Procedia - Social and Behavioral Sciences . 2011,第2期

机译：社交媒体中的文本规范化：休闲英语预处理系统的进展，问题和应用
3. Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts [J] . Helena Gómez-Adorno, Ilia Markov, Grigori Sidorov, Computational intelligence and neuroscience . 2016,第Pta4期

机译：基于神经网络的改进特征表示，在社交媒体文本中作者分析
4. Identifying the Authors' National Variety of English in Social Media Texts [C] . Vasiliki Simaki, Panagiotis Simakis, Carita Paradis, International conference on recent advances in natural language processing . 2017

机译：在社交媒体文本中识别作者的国家各种英语
5. Varieties of Business English: A linguistic analysis of written texts. [D] . Morrow, Phillip Robert. 1989

机译：商务英语的种类：对书面文本的语言分析。
6. Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts [O] . Helena Gómez-Adorno, Ilia Markov, Grigori Sidorov, 2016

机译：基于神经网络的社交媒体文本作者特征分析的特征表示改进
7. Realization of Authorial Ideations amp; Purposes Through Transitivity Analysis of Media Texts Written by Native and Non-Native Writers of English: A SFG Perspective [O] . Looloo Mohammed Alraimi, Abdalla Yassin Abdalla 2019

机译：通过媒体文本的传递分析实现授权观点和目的：SFG视角

Identifying the Authors' National Variety of English in Social Media Texts

摘要

著录项

相似文献

相关主题

期刊订阅