Trouble information extraction based on a bootstrap approach from Twitter

机译：基于Twitter的引导方法的故障信息提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a method for extracting trouble information from Twitter. One useful approach is based on machine learning techniques such as SVMs. However, trouble information is a fraction of a percent of all tweets on Twitter. In general, imbalanced distribution is not suitable for machine learning techniques to generate a classifier. Another approach is to extract trouble information by using handwritten rules. However, constructing high coverage rules by handwork is costly. First, we verify these problems in a preliminary experiment. Then, to solve these problems, we apply a bootstrapping method to our trouble information extraction task. We introduce three characteristics and a scoring method to the bootstrapping. As a result, the iteration process on the bootstrapping increased the number of tweets and patterns for trouble information dramatically.

机译：在本文中，我们提出了一种从Twitter提取故障信息的方法。一种有用的方法是基于诸如SVM的机器学习技术。但是，故障信息仅占Twitter所有推文的百分之一。通常，不平衡分布不适合于机器学习技术来生成分类器。另一种方法是通过使用手写规则提取故障信息。但是，通过手工构建高覆盖率规则的成本很高。首先，我们在初步实验中验证了这些问题。然后，为了解决这些问题，我们将自举方法应用于故障信息提取任务。我们向引导程序介绍了三个特征和一种计分方法。结果，引导程序上的迭代过程大大增加了有关故障信息的推文和模式的数量。

著录项

来源
《Pacific Asia Conference on Language, Information and Computation》|2015年|471-479|共9页
会议地点
作者
Kohei Kurihara; Kazutaka Shimada;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. TwitterNEED: A hybrid approach for named entity extraction and disambiguation for tweet [J] . MENA B. HABIB, MAURICE VAN KEULEN Natural language engineering . 2016,第pta3期

机译：TwitterNEED：用于命名实体提取和消歧的混合方法
2. A Hybrid Approach for Drug Abuse Events Extraction from Twitter [J] . Ferdaous Jenhani, Mohamed Salah Gouider, Lamjed Ben Said Procedia Computer Science . 2016,第1期

机译：从Twitter提取毒品滥用事件的混合方法
3. Hybrid Approach for Sentiment Analysis of Twitter Posts Using a Dictionary-based Approach and Fuzzy Logic Methods: Study Case on Cloud Service Providers [J] . Alharbi Jamilah Rabeh, Alhalabi Wadee S. International journal on Semantic Web and information systems . 2020,第1期

机译：使用基于字典的方法和模糊逻辑方法的Twitter Posts情感分析的混合方法：云服务提供商研究案例
4. Trouble information extraction based on a bootstrap approach from Twitter [C] . Kohei Kurihara, Kazutaka Shimada Pacific Asia Conference on Language, Information and Computation . 2015

机译：基于从Twitter的引导方法的故障信息提取
5. Bootstrapping vehicles: A formal approach to unsupervised sensorimotor learning based on invariance. [D] . Censi, Andrea. 2013

机译：自举车辆：一种基于不变性的无监督感觉运动学习的正式方法。
6. Social Media Mining for Birth Defects Research: A Rule-Based Bootstrapping Approach to Collecting Data for Rare Health-Related Events on Twitter [O] . Ari Z. Klein, Abeed Sarker, Haitao Cai, -1

机译：用于出生缺陷研究的社交媒体挖掘：基于规则的引导方法用于在Twitter上收集与健康相关的罕见事件的数据
7. Social media mining for birth defects research: A rule-based, bootstrapping approach to collecting data for rare health-related events on Twitter [O] . Ari Z. Klein, Abeed Sarker, Haitao Cai, 2018

机译：社交媒体矿业出生缺陷研究：基于规则的，引导自动启动方法，用于在Twitter上收集稀有健康相关事件的数据

Trouble information extraction based on a bootstrap approach from Twitter

摘要

著录项

相似文献

相关主题

期刊订阅