Avoiding the Drunkard's search: Investigating collection strategies for building a Twitter dataset

机译：避免醉汉的搜索：研究用于构建Twitter数据集的收集策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We investigate methods for collecting data to form an archive on the debate within Twitter surrounding the UK's inclusion in the EU. We use three strategies, gathering data using hashtags, extracting data from the random stream and collecting from users known to be discussing the debate. We explore the various bias in the resulting datasets.

机译：我们研究了收集数据的方法，以形成有关Twitter围绕英国被纳入欧盟的辩论的存档。我们使用三种策略：使用主题标签收集数据，从随机流中提取数据以及从已知正在讨论辩论的用户收集数据。我们探索了所得数据集中的各种偏差。

著录项

来源
《ACM/IEEE-CS Joint Conference on Digital Libraries》|2016年|205-206|共2页
会议地点 Newark NJ(US)
作者
Clare Llewellyn; Laura Cram; Adrian Favero;
展开▼
作者单位

University of Edinburgh 2F2 Buccleuch Place Edinburgh UK;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Twitter; Tagging; Time-frequency analysis; Data mining; Media; Data collection; Buildings;

机译：推特;标记；时频分析；数据挖掘;媒体;数据采集;建筑物;

相似文献

外文文献
中文文献
专利

1. Parallel Hybrid BBO Search Method for Twitter Sentiment Analysis of Large Scale Datasets Using MapReduce [J] . Ashish Kumar Tripathi, Kapil Sharma, Manju Bala International journal of information security and privacy . 2019,第3期

机译：使用MapReduce的Twitter情感分析的并行混合BBO搜索方法
2. Building a National Neighborhood Dataset From Geotagged Twitter Data for Indicators of Happiness, Diet, and Physical Activity [J] . Quynh C Nguyen, Dapeng Li, Hsien-Wen Meng, JMIR public health and surveillance. . 2016,第2期

机译：从地理标记的Twitter数据构建全国邻里数据集，以获取幸福感，饮食和身体活动的指标
3. A software architecture for Twitter collection, search and geolocation services [J] . M. Oussalah, F. Bhat, K. Challis, Knowledge-Based Systems . 2013,第JANa期

机译：Twitter收集，搜索和地理位置服务的软件架构
4. Avoiding the Drunkard's search: Investigating collection strategies for building a Twitter dataset [C] . Clare Llewellyn, Laura Cram, Adrian Favero ACM/IEEE-CS Joint Conference on Digital Libraries . 2016

机译：避免Drunkard的搜索：调查建立Twitter数据集的收集策略
5. Building Resilience or Building Fragility? Understanding Disaster Resilience Patterns in Guatemala through the Analysis of Disaster Datasets in Connection with Population and Housing Data =BUILDING RESILIENCE OR BUILDING FRAGILITY? UNDERSTANDING DISASTER [D] . García Mejía, Sergio Arnoldo. 2021

机译：建立弹性或建筑脆弱性？通过分析与人口和住房数据相关的灾难数据集，了解危地马拉的灾难恢复模式=建立弹性或建筑脆性？了解灾难
6. A curated collection of transcriptome datasets to investigate the molecular mechanisms of immunoglobulin E-mediated atopic diseases [O] . Susie S Y Huang, Fatima Al Ali, Sabri Boughorbel, 2019

机译：精选的转录组数据集以研究免疫球蛋白E介导的特应性疾病的分子机制
7. Avoiding the Drunkard's Search: Investigating Collection Strategies for Building a Twitter Dataset [O] . Llewellyn, Clare, Cram, Laura, Favero, Adrian 2016

机译：避免Drunkard的搜索：调查构建Twitter数据集的收集策略

Avoiding the Drunkard's search: Investigating collection strategies for building a Twitter dataset

摘要

著录项

相似文献

相关主题

期刊订阅