【24h】

Blog Mining for the Fortune 500

机译:博客矿业为财富500强

获取原文

摘要

In recent years there has been a tremendous increase in the number of users maintaining online blogs on the Internet. Companies, in particular, have become aware of this medium of communication and have taken a keen interest in what is being said about them through such personal blogs. This has given rise to a new field of research directed towards mining useful information from a large amount of unformatted data present in online blogs and online forums. We discuss an implementation of such a blog mining application. The application is broadly divided into two parts, the indexing process and the search module. Blogs pertaining to different organizations are fetched from a particular blog domain on the Internet. After analyzing the textual content of these blogs they are assigned a sentiment rating. Specific data from such blogs along with their sentiment ratings are then indexed on the physical hard drive. The search module searches through these indexes at run time for the input organization name and produces a list of blogs conveying both positive and negative sentiments about the organization.
机译:近年来,在互联网上维护在线博客的用户数量巨大增加。特别是公司已经意识到这一沟通媒介,并通过这种个人博客对他们所说的是敏锐的兴趣。这使得从在线博客和在线论坛中存在的大量未格式化数据中挖掘有用信息的新研究领域。我们讨论这种博客挖掘应用程序的实施。该应用程序大致分为两部分,索引过程和搜索模块。属于不同组织的博客从互联网上的特定博客域获取。在分析这些博客的文本内容后,它们被分配了情绪评级。然后,来自这种博客以及它们的情绪评级的具体数据被索引在物理硬盘上。搜索模块在输入组织名称的运行时搜索这些索引,并生成包含关于组织的正面和负面情绪的博客列表。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号