Understanding and predicting Web content credibility using the Content Credibility Corpus

Michal Kakol; Radoslaw Nielek; Adam Wierzbicki

首页> 外文期刊>Information Processing & Management >Understanding and predicting Web content credibility using the Content Credibility Corpus

【24h】

Understanding and predicting Web content credibility using the Content Credibility Corpus

机译：使用内容可信语料库了解和预测Web内容可信度

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The goal of our research is to create a predictive model of Web content credibility evaluations, based on human evaluations. The model has to be based on a comprehensive set of independent factors that can be used to guide user's credibility evaluations in crowdsourced systems like WOT, but also to design machine classifiers of Web content credibility. The factors described in this article are based on empirical data. We have created a dataset obtained from an extensive crowdsourced Web credibility assessment study (over 15 thousand evaluations of over 5000 Web pages from over 2000 participants). First, online participants evaluated a multi-domain corpus of selected Web pages. Using the acquired data and text mining techniques we have prepared a code book and conducted another crowdsourcing round to label textual justifications of the former responses. We have extended the list of significant credibility assessment factors described in previous research and analyzed their relationships to credibility evaluation scores. Discovered factors that affect Web content credibility evaluations are also weakly correlated, which makes them more useful for modeling and predicting credibility evaluations. Based on the newly identified factors, we propose a predictive model for Web content credibility. The model can be used to determine the significance and impact of discovered factors on credibility evaluations. These findings can guide future research on the design of automatic or semiautomatic systems for Web content credibility evaluation support. This study also contributes the largest credibility dataset currently publicly available for research: the Content Credibility Corpus (C3).

机译：我们研究的目的是基于人工评估，创建Web内容可信度评估的预测模型。该模型必须基于一组全面的独立因素，这些因素可用于指导诸如WOT的众包系统中的用户可信度评估，还可以设计Web内容可信度的机器分类器。本文中描述的因素是基于经验数据。我们已经创建了一个数据集，该数据集是从广泛的众包Web信誉评估研究（来自2000多个参与者的5000多个Web页面进行的超过1.5万次评估）获得的。首先，在线参与者评估了所选网页的多域语料库。使用获得的数据和文本挖掘技术，我们准备了代码簿，并进行了另一轮众包，以标记先前响应的文本理由。我们扩展了先前研究中描述的重要信誉评估因素的清单，并分析了它们与信誉评估分数的关系。发现的影响Web内容可信度评估的因素之间的关联也很弱，这使它们对于建模和预测可信度评估更加有用。基于新发现的因素，我们提出了一种用于Web内容可信度的预测模型。该模型可用于确定发现因素对可信度评估的重要性和影响。这些发现可以指导将来对Web内容信誉评估支持的自动或半自动系统设计的研究。这项研究还贡献了目前可供研究的最大可信度数据集：内容可信度语料库（C3）。

著录项

来源
《Information Processing & Management》 |2017年第5期|1043-1061|共19页
作者
Michal Kakol; Radoslaw Nielek; Adam Wierzbicki;
展开▼
作者单位

Polish-Japanese Academy of Information Technology, Warsaw 02-008, Poland;

Polish-Japanese Academy of Information Technology, Warsaw 02-008, Poland;

Polish-Japanese Academy of Information Technology, Warsaw 02-008, Poland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Web credibility; Crowdsourcing; Evaluating web site content; Credibility evaluation; Credibility issues;

机译：网络信誉;众包;评估网站内容;信誉评估;信誉问题;
入库时间 2022-08-17 23:20:11

相似文献

外文文献
中文文献
专利

1. Improving Search and Information Credibility Analysis from Interaction between Web1.0 and Web2.0 Content [J] . Katsumi Tanaka, Satoshi Nakamura, Hiroaki Ohshima, Journal of software . 2010,第2期

机译：通过Web1.0和Web2.0内容之间的交互来改进搜索和信息可信度分析
2. Hoax news-inspector: a real-time prediction of fake news using content resemblance over web search results for authenticating the credibility of news articles [J] . Varshney Deepika, Vishwakarma Dinesh Kumar Journal of ambient intelligence and humanized computing . 2021,第9期

机译：骗局新闻检查员：使用内容相似在Web搜索结果上使用内容相似进行假新闻的实时预测，以验证新闻文章的可信度
3. Contextual Bias in Verbal Credibility Assessment: Criteria-Based Content Analysis, Reality Monitoring and Scientific Content Analysis [J] . GLYNIS BOGAARD, EWOUT H. MEIJER, ALDERT VRIJ, Applied cognitive psychology . 2014,第1期

机译：言语可信度评估中的语境偏差：基于标准的内容分析，现实监控和科学内容分析
4. Credibility Microscope: Relating Web Page Credibility Evaluations to Their Textual Content [C] . Jaworski W., Rejmund E., Wierzbicki A. IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies . 2014

机译：可信度显微镜：将网页可信度评估与其文本内容相关联
5. The Perceived Credibility of Professional Photojournalism Compared to User-Generated Content Among American News Media Audiences [D] . Gayle, Gina. 2020

机译：与美国新闻媒体观众中的用户生成的内容相比，专业摄影性的信誉
6. Breast cancer on social media: a quali-quantitative study on the credibility and content type of the most shared news stories [O] . Priscila Biancovilli, Lilla Makszin, Alexandra Csongor 2021

机译：社交媒体乳腺癌：对最具共享新闻报道的可信度和内容类型的质量研究
7. nyk of the Lviv University. Series Law KEYWORDS abuse of authority, abuse of power, abuse of official status, abuse of office acts of the European Union, international legal regulation, employment, right to free movement advocacy, advocate activity, advocacy science, advocatologie, theory of advocacy appeal proceeding, grounds to judgement revision, inconsistency of the court’s findings at first instance with the actual circumstances of the criminal proceedings, cancellation or alteration of the judgment charity organization, founder, assets of charity organization, constituent documents criminal proceedings, subjects of criminal proceedings, the suspect, the suspect law, criminal procedure, international standards employer’s duty, right to the moral injury compensation, social insurance from an industrial accident, social need, labour dispute forms of the legal actions of the collective of employees historical and legal science department, scientific activity law enforcement equipment, individual legal act, the means of forming the content of the enforcement act, requirements for registration of individual legal act attributes (properties) of acts of law legal formula (construction), qualified corpus delicti of a crime, degree of social danger, crime-forming feature legal social community legal technique, technology, legal act, legal system, lawmaking legitimacy, investigation of crime, concept of criminalistics, criminalistics recommendations, tactical methods measures aimed at providing criminal proceedings, procedural sanction, monetary penalty, pre-trial investigation national implementation, forms of implementation, implementation practice, European states, international treaties participant, shareholder, partnership, acquisition, changing, suspension proof, probability, likelihood, credibility in evidence, reliability scientific school, research, development land, agricultural and environmental law, Lviv Scientific School land, agricultural and environmental law the High Council of Justice of Ukraine, the National Council of Justice of the Republic of Poland, judges’ independence, international standards of the judiciary the presumption of the labor legal personality OPEN JOURNAL SYSTEMS Journal Help USER Username Password Remember me Login NOTIFICATIONS View Subscribe LANGUAGE Select LanguageSubmit JOURNAL CONTENT Search Search Scope Search Browse By Issue By Author By Title Other Journals FONT SIZE Make font size smallerMake font size defaultMake font size larger INFORMATION For Readers For Authors For Librarians HOME ABOUT LOGIN REGISTER SEARCH CURRENT ARCHIVES ANNOUNCEMENTS Home > No 67 (2018) > Марін ONCE AGAIN ON THE RETROACTIVE EFFECT OF THE CRIMINAL LAW IN TIME IN THE ASPECT OF INDIRECT CRIMINALIZATION [O] . Oleksandr Marin 2018

机译：LVIV大学的尼克。系列律师关键词滥用权威，滥用权力，滥用官方地位，滥用欧盟办公室行为，国际法律监管，就业，自由运动倡导，倡导活动，倡导科学，倡导性宣传理论，倡导呼吁的理论，理由修改，法院调查结果的不一致事实，判决慈善组织的实际情况，取消或改变判决慈善机构，慈善组织的创始人，慈善机构资产，组成文件刑事诉讼，刑事诉讼主题，嫌疑人，嫌疑法，刑事诉讼，国际标准雇主的责任，道德伤害赔偿权，社会保险从工业事故，社会需求，劳动争端形式的员工历史和法律科学部的法律行为，科学活动执法设备，个人法律法案，制定执法法案的内容，法律法律公式（建设）行为的个人法律法案（属性），犯罪的合格犯罪，社会危险程度，犯罪形成特征法律社会社会法律技术，技术，法律法，法律制度，立法合法性，犯罪调查，犯罪概念，犯罪建议，战术方法旨在提供刑事诉讼，程序制裁，货币罚款，预审调查预审国家实施，执行形式，实施实践，欧洲国家，国际条约参与者，股东，伙伴关系，收购，不断变化，暂停证明，概率，可能性，可信度在证据，可靠性科学学校，研究，开发土地，农业和环境法，利沃科学校园，农业和环境法律议理乌克兰的冰，波兰共和国国家司法委员会，法官的独立，国际标准的司法部门劳动法人的推定开放期刊系统期刊帮助用户用户名密码记住我登录通知查看订阅语言选择lobsumberubmit期刊内容搜索搜索范围搜索按问题浏览作者标题在间接刑事定罪方面，再次对刑法的追溯效应及时

Understanding and predicting Web content credibility using the Content Credibility Corpus

摘要

著录项

相似文献

相关主题

期刊订阅