【24h】

Got a Complaint?- Keep Calm and Tweet It!

机译:有投诉吗?-保持冷静并发布推文!

获取原文

摘要

Research shows that many public service agencies use Twitter to share information and reach out to the public. Recently, Twitter is also being used as a platform to collect complaints from citizens and resolve them in an efficient time and manner. However, due to the dynamic nature of the website and presence of free-form-text, manual identification of complaint posts is overwhelmingly impractical. We formulate the problem of complaint identification as an ensemble classification problem. We perform several text enrichment processes such as hashtag expansion, spell correction and slang conversion on raw tweets for identifying linguistic features. We implement a one-class SVM classification and evaluate the performance of various kernel functions for identifying complaint tweets. Our result shows that linear kernel SVM outperforms polynomial and RBF kernel functions and the proposed approach classifies the complaint tweets with an overall precision of 76 %. We boost the accuracy of our approach by performing an ensemble on all three kernels. Result shows that one-class parallel ensemble SVM classifier outperforms cascaded ensemble learning with a margin of approximately 20%. By comparing the performance of each kernel against ensemble classifier, we provide an efficient method to classify complaint reports.
机译:研究表明,许多公共服务机构使用Twitter共享信息并与公众接触。最近,Twitter还被用作收集市民投诉并以有效的时间和方式解决投诉的平台。但是,由于网站的动态性质和自由格式文本的存在,手动识别投诉帖子是绝对不切实际的。我们将投诉识别问题表达为整体分类问题。我们在原始推文上执行了一些文本充实过程,例如主题标签扩展,拼写校正和语转换,以识别语言特征。我们实施一类SVM分类,并评估各种内核功能的性能以识别投诉消息。我们的结果表明,线性核支持向量机优于多项式和RBF核函数,并且所提出的方法对投诉推文进行分类,总体精度为76%。我们通过对所有三个内核执行合奏来提高方法的准确性。结果表明,一类并行集成支持向量机分类器的性能优于级联集成学习,约有20%的余量。通过将每个内核的性能与集成分类器进行比较,我们提供了一种有效的方法来对投诉报告进行分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号