Trouble information extraction based on a bootstrap approach from Twitter

机译：基于从Twitter的引导方法的故障信息提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a method for extracting trouble information from Twitter. One useful approach is based on machine learning techniques such as SVMs. However, trouble information is a fraction of a percent of all tweets on Twitter. In general, imbalanced distribution is not suitable for machine learning techniques to generate a classifier. Another approach is to extract trouble information by using handwritten rules. However, constructing high coverage rules by handwork is costly. First, we verify these problems in a preliminary experiment. Then, to solve these problems, we apply a bootstrapping method to our trouble information extraction task. We introduce three characteristics and a scoring method to the bootstrapping. As a result, the iteration process on the bootstrapping increased the number of tweets and patterns for trouble information dramatically.

机译：在本文中，我们提出了一种从Twitter中提取故障信息的方法。一种有用的方法是基于机器学习技术，如SVM。但是，麻烦信息是Twitter上所有推文百分比的一小部分。通常，不平衡的分布不适合生成分类器的机器学习技术。另一种方法是通过使用手写规则提取故障信息。但是，通过手动构建高覆盖规则是昂贵的。首先，我们在初步实验中验证了这些问题。然后，要解决这些问题，我们将引导方法应用于我们的故障信息提取任务。我们介绍了三个特征和对自动启动的评分方法。因此，对自动启动的迭代过程增加了急剧信息的发布次数和模式的数量。

著录项

来源
《Pacific Asia Conference on Language, Information and Computation》|2015年||共9页
会议地点
作者
Kohei Kurihara; Kazutaka Shimada;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机网络;
关键词
入库时间 2022-08-20 20:06:27

相似文献

外文文献
中文文献
专利

1. TwitterNEED: A hybrid approach for named entity extraction and disambiguation for tweet [J] . MENA B. HABIB, MAURICE VAN KEULEN Natural language engineering . 2016,第pta3期

机译：TwitterNEED：用于命名实体提取和消歧的混合方法
2. A Hybrid Approach for Drug Abuse Events Extraction from Twitter [J] . Ferdaous Jenhani, Mohamed Salah Gouider, Lamjed Ben Said Procedia Computer Science . 2016,第1期

机译：从Twitter提取毒品滥用事件的混合方法
3. Hybrid Approach for Sentiment Analysis of Twitter Posts Using a Dictionary-based Approach and Fuzzy Logic Methods: Study Case on Cloud Service Providers [J] . Alharbi Jamilah Rabeh, Alhalabi Wadee S. International journal on Semantic Web and information systems . 2020,第1期

机译：使用基于字典的方法和模糊逻辑方法的Twitter Posts情感分析的混合方法：云服务提供商研究案例
4. Trouble information extraction based on a bootstrap approach from Twitter [C] . Kohei Kurihara, Kazutaka Shimada Pacific Asia Conference on Language, Information and Computation . 2015

机译：基于Twitter的引导方法的故障信息提取
5. Bootstrapping vehicles: A formal approach to unsupervised sensorimotor learning based on invariance. [D] . Censi, Andrea. 2013

机译：自举车辆：一种基于不变性的无监督感觉运动学习的正式方法。
6. Social Media Mining for Birth Defects Research: A Rule-Based Bootstrapping Approach to Collecting Data for Rare Health-Related Events on Twitter [O] . Ari Z. Klein, Abeed Sarker, Haitao Cai, -1

机译：用于出生缺陷研究的社交媒体挖掘：基于规则的引导方法用于在Twitter上收集与健康相关的罕见事件的数据
7. Social media mining for birth defects research: A rule-based, bootstrapping approach to collecting data for rare health-related events on Twitter [O] . Ari Z. Klein, Abeed Sarker, Haitao Cai, 2018

机译：社交媒体矿业出生缺陷研究：基于规则的，引导自动启动方法，用于在Twitter上收集稀有健康相关事件的数据

Trouble information extraction based on a bootstrap approach from Twitter

摘要

著录项

相似文献

相关主题

期刊订阅