【24h】

On the Difficulty of Clustering Company Tweets

机译:关于集群公司推文的难点

获取原文
获取原文并翻译 | 示例

摘要

Twitter is a new successful technology of the Web 2.0 genre which is used by millions of people and companies to publish brief messages ("tweets") with the purpose of sharing experiences and/or opinions about a product or service. Due to the huge amount of information available in this type of technology, there is a clear need for new systems that can mine these messages in order to derive information about the collective thinking of twitterers (e.g. for opinion or sentiment analysis). Tweet analysis is a very important task because comments, opinions, suggestions, complaints can be used as marketing strategies or for determining information on a company's reputation. For this purpose, it is necessary to establish whether a tweet refers to a company or not, which is not a straightforward keyword search process as there may be multiple contexts in which a name can be used. The aim of this work is to present and compare a number of different approaches based on clustering that determine whether a given tweet refers to a particular company or not. For this purpose, we have used an enriching methodology in order to improve the representation of tweets and as a consequence the performance of the clustering company tweets task. The obtained results are promising and highlight the difficulty of this task.
机译:Twitter是Web 2.0风格的一项成功的新技术,数以百万计的人和公司使用Twitter来发布简短的消息(“ tweets”),目的是共享有关产品或服务的经验和/或意见。由于此类技术中可用的大量信息,因此显然需要可以挖掘这些消息以获取有关Twitterers集体思维的信息(例如用于意见或情感分析)的新系统。推文分析是一项非常重要的任务,因为评论,意见,建议,投诉可以用作营销策略或用于确定有关公司声誉的信息。为此,有必要确定一条推文是否指向公司,这不是一个简单的关键字搜索过程,因为可能存在多个可以使用名称的上下文。这项工作的目的是提出和比较基于聚类的多种不同方法,这些方法可确定给定推文是否指向特定公司。为此,我们使用了一种丰富的方法来改进推文的表示形式,从而改善集群公司推文任务的性能。获得的结果是有希望的,并突出了此任务的难度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号