首页> 外文会议>International Conference on Artificial Intelligence >Machine Learning with Selective Word Statistics for Automated Classification of Citation Subjectivity in Online Biomedical Articles

【24h】

Machine Learning with Selective Word Statistics for Automated Classification of Citation Subjectivity in Online Biomedical Articles

机译：机器学习与在线生物医学文章中的引文主体自动分类的选择性词汇统计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

There is growing interest in automatically classifying author's sentiment expressed within citation sentences in scientific literature to provide effective tools for researchers who are seeking relevant previous work or approaches for a certain research purpose. We propose an automated method of determining whether a given citation sentence contains an author's subjective opinion (positive or negative) or objective factual information, as the first step to analyze and identify the citing author's sentiments toward the cited external sources. Our method uses a support vector machine (SVM)-based text categorization technique to identify the subjective citations specifically toward Comment-on (CON) articles. CON, a MEDLINE citation field, indicates previously published articles commented on by authors of a given article expressing possibly complimentary or contradictory opinions. We introduce a bag of unigrams based on selective word statistics, which is derived from a text region of interest within a sentence containing a description of author's reason of citation and lexical linguistic cues to build an input feature vector for the SVM classifier. Experiments conducted on a set of CON sentences collected from 414 different online biomedical journal titles show that the SVM classifier yields a comparable result for the proposed a bag of unigrams input feature selectively extracted from a text of interest, compared to another bag of unigrams from the entire sentence. Moreover, we achieve a significant performance boost of the SVM with an input feature vector combining two types of statistical bag of unigrams and sentiment word lexicon.

机译：在科学文献中的引文句子中自动追查作者的情感日益增长的感兴趣，为正在寻求某种研究目的的相关工作或方法的研究人员提供有效的工具。我们提出了一种自动化方法，即确定给定的引文判决是否包含作者的主观意见（积极或负面）或客观事实信息，作为分析和识别引用作者对引用的外部来源的情绪的第一步。我们的方法使用支持向量机（SVM）的文本分类技术，专门针对评论（CON）文章来识别主观引用。 CON，一个MEDLINE引文，表明了以前发表的文章评论由特定文章的作者表示可能是互补或矛盾的意见。我们基于选择性单词统计来介绍一袋Unigrams，它来自包含作者引文和词汇语言线索的描述的句子中的句子中的文本区域，以构建SVM分类器的输入特征向量。从414个不同在线生物医学期刊标题收集的一组CON句子的实验表明，与来自感兴趣的文本相比，SVM分级器产生了所提出的一袋Unigram的输入特征的比较结果。整句。此外，我们通过输入特征向量实现SVM的显着性能提升，其中组合了两种类型的Unigrams和情感词词典。

著录项

来源
《International Conference on Artificial Intelligence》|2017年|351p|共7页
会议地点
作者
Incheol Kim; George R. Thoma;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Subjectivity classification; Selective word statistics; Comment-on; Support vector machine; MEDLINE;

机译：主观性分类;选择性单词统计;注释;支持向量机;MEDLINE;

相似文献

外文文献
中文文献
专利

1. Automated Arabic Text Classification With P-Stemmer, Machine Learning, and a Tailored News Article Taxonomy [J] . Tarek Kanan, Edward A. Fox Journal of the American Society for Information Science and Technology . 2016,第11期

机译：具有P-Stemmer，机器学习和量身定制的新闻文章分类法的自动化阿拉伯文本分类
2. Improving MeSH classification of biomedical articles using citation contexts. [J] . Aljaber B, Martinez D, Stokes N, Journal of biomedical informatics. . 2011,第5期

机译：使用引用语境改善生物医学文章的MeSH分类。
3. Exploiting Contextual Word Embedding of Authorship and Title of Articles for Discovering Citation Intent Classification [J] . Muhammad Roman, Abdul Shahid, Muhammad Irfan Uddin, Complexity . 2021,第a期

机译：利用上下文词嵌入作者身份和文章标题，用于发现引用意图分类
4. Machine Learning with Selective Word Statistics for Automated Classification of Citation Subjectivity in Online Biomedical Articles [C] . Incheol Kim, George R. Thoma International Conference on Artificial Intelligence . 2017

机译：机器学习与在线生物医学文章中的引文主体自动分类的选择性词汇统计
5. Improving biomedical information retrieval citation metrics using machine learning [D] . Fu, Lawrence D. 2008

机译：使用机器学习改进生物医学信息检索引文指标
6. Automated Classification of Radiology Reports for Acute Lung Injury: Comparison of Keyword and Machine Learning Based Natural Language Processing Approaches [O] . Imre Solti, Colin R. Cooke, Fei Xia, -1

机译：放射学报告的急性肺损伤的自动分类：基于和机械关键字的比较学习自然语言处理途径
7. Acreditation Certificate Acreditation No. 21/E/KPT/2018 Article Tools Print this article Indexing metadata How to cite item Email this article Email the author About The Authors Ainun Ramadhani Tri Wahyuni ORCID iD https://orcid.org/0000-0002-4071-3406 Fisheries and Marine Science Faculty, Brawijaya University Indonesia Endang Yuli Herawati Fisheries and Marine Science Faculty, Brawijaya University Indonesia Andi Kurniawan ORCID iD Fisheries and Marine Science Faculty, Brawijaya University Indonesia Abd. Aziz Amin ORCID iD Coastal and Marine Research Center, University of Brawijaya, Indonesia Indonesia About RJLS Aim and Scope Editorial Board Reviewer Acknowledgement Publication Ethics Visitor Statistic Information for Author Author Guidelines (online version) Online Submission Guideline Online Registration Author Fees Download Template User You are logged in as... riris_rjlsub My Profile Log Out Tools Mendeley User Guide Insert Citation using Mendeley Journal Index Visitor Statistic Notifications View (141 new) Manage Journal Content Search Search Scope Browse By Issue By Author By Title Information For Readers For Authors For Librarians Keywords Antioxidant Bali Strait Biogeography CODIS 13 Calamaria DPPH Dyslipidemia Eucheuma cottonii ICP11 Litopenaeus vannamei Macrobrachium rosenbergii Morphology Pandanus Physalis minima RFLP Sardinella lemuru Sperm WSSV birth weight fermentation rats Isolation, and Identification of Diesel Oil Degrading Bacteria in Water Contamination Site and Preliminary analysis with Potential Bacterial Gordonia terrae [O] . Ainun Ramadhani Tri Wahyuni, Endang Yuli Herawati, Andi Kurniawan, 2019

机译：Acreditation证书Acreditation号21 / E / KPT / 2018条工具打印这篇文章索引元数据如何引用文章项目将该文章发送给作者发邮件作者简介艾南·拉马扎尼三Wahyuni ORCID的iD https://orcid.org/0000-0002- 4071-3406渔业和海洋科学学院，Brawijaya大学印尼Endang玉立Herawati渔业和海洋科学学院，Brawijaya大学印度尼西亚安迪Kurniawan ORCID的iD渔业和海洋科学学院，Brawijaya大学印尼阿卜杜勒。阿齐兹阿明ORCID的iD沿海和海洋研究中心，Brawijaya大学，印度尼西亚印度尼西亚关于RJLS目标实现作者作者准则的范围编委会审阅确认出版道德访客统计信息（网络版）在线投稿指南在线注册作者费下载模板用户你是登录为... riris_rjlsub使用Mendeley杂志指数访客统计通知视图（141新）管理期刊内容搜索范围浏览按问题按作者按标题信息供读者对于作者为馆员关键词我的个人资料注销工具Mendeley用户指南插入引文抗氧化剂巴厘海峡生物地理学CODIS 13铁线蛇属DPPH血脂异常麒麟菜cottonii ICP11凡纳滨对虾罗氏沼虾形态露兜小酸浆RFLP黄泽小沙丁鱼精子WSSV出生体重发酵鼠隔离，并在水污染网站和预柴油降解菌的鉴定与潜在的细菌大头terrae liminary分析

Machine Learning with Selective Word Statistics for Automated Classification of Citation Subjectivity in Online Biomedical Articles

摘要

著录项

相似文献

相关主题

期刊订阅