首页> 外文会议>Systems and Information Engineering Design Symposium >Online Review Content Moderation Using Natural Language Processing and Machine Learning Methods : 2021 Systems and Information Engineering Design Symposium (SIEDS)
【24h】

Online Review Content Moderation Using Natural Language Processing and Machine Learning Methods : 2021 Systems and Information Engineering Design Symposium (SIEDS)

机译:在线审查内容适度使用自然语言处理和机器学习方法:2021系统和信息工程设计研讨会(SIEDS)

获取原文

摘要

With the ubiquity of Internet-based words-of-mouth to inform decisions on various products and services, people have become reliant on the authenticity of website reviews. These reviews may be manually evaluated for publishability onto a website, however increasing volumes of user-submitted content may strain a website’s resources for accurate content moderation. Recognizing the important for patients to receive authentic reviews of cosmetic surgery procedures, we considered a corpus of 523,564 user-submitted reviews to the RealSelf.com website spanning the dates of 2018-01-01 through 2020-05-31. Prior binary classifications of "published" or "unpublished" were applied to these reviews by the RealSelf content moderation team. Textual and behavioral machine learning models were developed in this study to predict the classification of RealSelf’s reviews. An ensemble model, constructed from the top-performing textual and behavioral models in this study, was found to have a classification accuracy of 82.9 percent.
机译:随着基于互联网的夸张的口碑,可以为各种产品和服务的决策提供信息,人们已经依赖了网站评论的真实性。可以手动评估这些评论以在网站上发出发布性,但是增加了用户提交的内容的卷可能会强调网站的资源以获得准确的内容审核。认识到患者的重要患者接受美容手术程序的真实审查,我们认为为皇家网站的523,564个用户提交的审查的语料库,该网站跨越2018-01-01至2020-05-31的日期。 “已发布”或“未发表”的先前二进制分类被批量审核团队应用于这些审查。在本研究中开发了文本和行为机器学习模型,以预测素材的分类。从本研究中的顶级文本和行为模型构建的集合模型被发现,分类准确性为82.9%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号