Harmful comments extraction from a Bulletin Board System using word meaning and impression on thread context

机译：使用单词含义和对线程上下文的印象从公告板系统中提取有害评论

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Harmful documents make readers unpleasant on the Web. In order to hide the harmful documents from the public, machine learning methods have been proposed, which learn words used in harmful documents and hide them automatically. The learned words often have bad meanings. Though word meanings are not changed, word impression may be changed on context. Even if a word with bad impression is contained in a document, the previous learning methods can not learn the word, and fail to hide documents. We select the following approach: word impression may be changed on context. If a word has been used with other words of good meaning, it is considered that impression of the word is also good. In contrast, if a word has been used with others of bad meaning, impression of the word may be bad. This paper proposes a new extraction method of harmful comments in a thread of a Bulletin Board System. The proposed method extracts comments using word meanings and word impression on thread context. We evaluated the proposed method using comments collected from four threads in Japanese BBS "2-channel." The averaged precision of extraction was 0.47, and the averaged recall was 0.68. We verified that the proposed method was suitable for extraction of harmful comments from a thread of a BBS.

机译：有害的文档使读者对Web不满意。为了向公众隐藏有害文件，已经提出了机器学习方法，该方法学习有害文件中使用的单词并自动隐藏它们。学到的单词通常含义不好。虽然单词的含义没有改变，但单词的印象可能会根据上下文而改变。即使文档中包含印象较差的单词，以前的学习方法也无法学习该单词，并且无法隐藏文档。我们选择以下方法：可能会根据上下文更改单词印象。如果一个单词已与其他含义良好的单词一起使用，则认为该单词的印象也很好。相反，如果一个单词已与其他含义不好的单词一起使用，则该单词的印象可能不好。本文提出了一种新的公告板系统线程中有害评论的提取方法。所提出的方法在线程上下文中使用单词含义和单词印象提取注释。我们使用从日语BBS“ 2-channel”中四个线程收集的注释评估了该建议方法。提取的平均精度为0.47，平均召回率为0.68。我们验证了所提出的方法适用于从BBS线程中提取有害评论。

著录项

来源
《International Conference on Soft Computing and Intelligent Systems;International Symposium on Advanced Intelligent Systems》|2014年|1398-1402|共5页
会议地点
作者
Nishihara Yoko; Iwasa Kazuki; Fukumoto Junichi; Yamanishi Ryosuke;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Internet; document handling; feature extraction; learning (artificial intelligence); Japanese BBS; bad meaning; bulletin board system; harmful comments extraction method; harmful documents; harmful documents hiding; machine learning methods; thread context; word impression; word meaning; word meanings; Context; Data mining; Educational institutions; Postal services; TV; Web pages;

机译：互联网;文档处理;特征提取;学习（人工智能）;日语BBS;不良含义;公告板系统;有害注释提取方法;有害文档;有害文档隐藏;机器学习方法;线程上下文;单词印象;单词含义;单词含义;上下文;数据挖掘;教育机构;邮政;电视;网页;

相似文献

外文文献
中文文献
专利

1. A method of extracting malicious expressions in bulletin board systems by using context analysis [J] . Hiroshi Hanafusa, Kazuhiro Morita, Masao Fuketa, Information Processing & Management . 2011,第3期

机译：一种使用上下文分析在公告板系统中提取恶意表达的方法
2. The meaning of words (letter; comment) (see comments) [J] . Wilcox JR Jr Radiology . 1998,第3期

机译：单词的含义（字母;注释）（请参阅注释）
3. Blackboard and Black Board—Accentuation and Deaccentuation and Their Influences on Meanings of Words — [J] . Yuji HATAKEYAMA Interdisciplinary Information Sciences . 2000,第2期

机译：黑板和黑板—重音和重音及其对单词含义的影响—
4. Harmful comments extraction from a Bulletin Board System using word meaning and impression on thread context [C] . Nishihara Yoko, Iwasa Kazuki, Fukumoto Junichi, International Conference on Soft Computing and Intelligent Systems;International Symposium on Advanced Intelligent Systems . 2014

机译：使用Word含义和线程上下文的印象来提取公告板系统的有害评论
5. Learning words in context: An ERP investigation of word experience effects on familiarity and meaning acquisition. [D] . Balass, Michal. 2011

机译：在上下文中学习单词：ERP调查单词体验对熟悉度和含义习得的影响。
6. Contiguity-based sound iconicity: The meaning of words resonates with phonetic properties of their immediate verbal contexts [O] . Jan Auracher, Mathias Scharinger, Winfried Menninghaus 2015

机译：基于连续性的声音象似性：单词的含义会与其直接语言环境的语音特性产生共鸣
7. The word as a unit of meaning. The role of context in words meaning [O] . Проняєва Вікторія Едуардівна, Проняева Виктория Эдуардовна, Proniaieva Viktoriia Eduardivna 2015

机译：以词为义的单位。语境在词义中的作用
8. Context as the Building Blocks of Meaning: A Retrieval Model for the Semantic Representation of Words. [R] . Kwantes, P. J. 2003

机译：作为意义构建块的语境：词语语义表征的检索模型。

Harmful comments extraction from a Bulletin Board System using word meaning and impression on thread context

摘要

著录项

相似文献

相关主题

期刊订阅