首页> 外文会议>International Conference on Soft Computing and Intelligent Systems;International Symposium on Advanced Intelligent Systems >Harmful comments extraction from a Bulletin Board System using word meaning and impression on thread context
【24h】

Harmful comments extraction from a Bulletin Board System using word meaning and impression on thread context

机译:使用单词含义和对线程上下文的印象从公告板系统中提取有害评论

获取原文

摘要

Harmful documents make readers unpleasant on the Web. In order to hide the harmful documents from the public, machine learning methods have been proposed, which learn words used in harmful documents and hide them automatically. The learned words often have bad meanings. Though word meanings are not changed, word impression may be changed on context. Even if a word with bad impression is contained in a document, the previous learning methods can not learn the word, and fail to hide documents. We select the following approach: word impression may be changed on context. If a word has been used with other words of good meaning, it is considered that impression of the word is also good. In contrast, if a word has been used with others of bad meaning, impression of the word may be bad. This paper proposes a new extraction method of harmful comments in a thread of a Bulletin Board System. The proposed method extracts comments using word meanings and word impression on thread context. We evaluated the proposed method using comments collected from four threads in Japanese BBS "2-channel." The averaged precision of extraction was 0.47, and the averaged recall was 0.68. We verified that the proposed method was suitable for extraction of harmful comments from a thread of a BBS.
机译:有害的文档使读者对Web不满意。为了向公众隐藏有害文件,已经提出了机器学习方法,该方法学习有害文件中使用的单词并自动隐藏它们。学到的单词通常含义不好。虽然单词的含义没有改变,但单词的印象可能会根据上下文而改变。即使文档中包含印象较差的单词,以前的学习方法也无法学习该单词,并且无法隐藏文档。我们选择以下方法:可能会根据上下文更改单词印象。如果一个单词已与其他含义良好的单词一起使用,则认为该单词的印象也很好。相反,如果一个单词已与其他含义不好的单词一起使用,则该单词的印象可能不好。本文提出了一种新的公告板系统线程中有害评论的提取方法。所提出的方法在线程上下文中使用单词含义和单词印象提取注释。我们使用从日语BBS“ 2-channel”中四个线程收集的注释评估了该建议方法。提取的平均精度为0.47,平均召回率为0.68。我们验证了所提出的方法适用于从BBS线程中提取有害评论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号