首页> 外文会议>International Conference on Informatics and Computing >Preprocessing For Crawler Of Short Message Social Media
【24h】

Preprocessing For Crawler Of Short Message Social Media

机译:短信社交媒体爬虫的预处理

获取原文

摘要

Social media can be utilized source information in the form of text that is widely exploit as an analytical tool to understand the attitudes, preferences and opinions of society. Companies can use for produce decisions about the needs, attitudes, opinions or trends about customers or potential customers. One of the popular social media now is Twitter. Research aims to design web applications crawl on twitter social media Indonesian language for Natural Language Processing needs. Methodology promote in this study is crawl twitter and preprocessing of data that includes parsing and tokenize, emoticon conversion, cleansing, case folding, normalization, negation handling, stop word and stemming. Research can be further exploited in text processing for classification of analytical sentiments.
机译:社交媒体可以以文本形式利用源信息,被广泛用作理解社会态度,偏好和观点的分析工具。公司可以使用有关客户或潜在客户的需求,态度,观点或趋势的决策。 Twitter现在是流行的社交媒体之一。研究旨在设计可在Twitter社交媒体印度尼西亚语上爬网的Web应用程序,以满足自然语言处理的需求。这项研究中推广的方法是抓取Twitter和对数据进行预处理,包括解析和标记化,图释转换,清理,大小写折叠,规范化,否定处理,停用词和词干。可以在文本处理中进一步利用研究来对分析情感进行分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号