Automated Identification of Biomedical Article Type Using Support Vector Machines

机译：使用支持向量机自动识别生物医学物品类型

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Authors of short papers such as letters or editorials often express complementary opinions, and sometimes contradictory ones, on related work in previously published articles. The MEDLINE? citations for such short papers are required to list bibliographic data on these "commented on" articles in a "CON" field. The challenge is to automatically identify the CON articles referred to by the author of the short paper (called "Comment-in" or CIN paper). Our approach is to use support vector machines (SVM) to first classify a paper as either a CIN or a regular full-length article (which is exempt from this requirement), and then to extract from the CIN paper the bibliographic data of the CON articles. A solution to the first part of the problem, identifying CIN articles, is addressed here. We implement and compare the performance of two types of SVM, one with a linear kernel function and the other with a radial basis kernel function (RBF). Input feature vectors for the SVMs are created by combining four types of features based on statistics of words in the article title, words that suggest the article type (letter, correspondence, editorial), size of body text, and cue phrases. Experiments conducted on a set of online biomedical articles show that the SVM with a linear kernel function yields a significantly lower false negative error rate than the one with an RBF. Our experiments also show that the SVM with a linear kernel function achieves a significantly higher level of accuracy, and lower false positive and false negative error rates by using input feature vectors created by combining all four types of features rather than any single type.

机译：诸如信件或社论这样的简短论文的作者通常对先前发表的文章中的相关工作发表补充意见，有时甚至是相互矛盾的意见。 MEDLINE？要求引用此类简短论文，以便在“ CON”字段中列出这些“已评论”文章的书目数据。面临的挑战是自动识别短论文（称为“ Comment-in”或CIN论文）作者所引用的CON文章。我们的方法是使用支持向量机（SVM）首先将论文分类为CIN或常规的全长文章（此要求除外），然后从CIN论文中提取CON的书目数据文章。此处解决了问题的第一部分，即识别CIN文章。我们实现并比较了两种类型的SVM的性能，一种具有线性核函数，另一种具有径向基核函数（RBF）。通过基于文章标题中的单词，建议文章类型的单词（字母，信函，社论），正文文本大小和提示短语的统计信息，组合四种类型的功能来创建SVM的输入特征向量。对一组在线生物医学文章进行的实验表明，具有线性核函数的SVM产生的假阴性错误率明显低于具有RBF的SVM。我们的实验还表明，使用线性核函数的SVM通过使用组合了所有四种类型的特征而不是任何一种类型的特征而创建的输入特征向量，可以显着提高准确性，并降低误报率和误报率。

著录项

来源
《Document recognition and retrieval XVIII》|2011年|p.787403.1-787403.9|共9页
会议地点 San Francisco CA(US)
作者
In Cheol Kim; Daniel X. Le; George R. Thoma;
展开▼
作者单位

Lister Hill National Center for Biomedical Communications National Library of Medicine, 8600 Rockville Pike, Bethesda, MD 20894;

Lister Hill National Center for Biomedical Communications National Library of Medicine, 8600 Rockville Pike, Bethesda, MD 20894;

Lister Hill National Center for Biomedical Communications National Library of Medicine, 8600 Rockville Pike, Bethesda, MD 20894;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词
"Comment-on"; "Comment-in"; Online biomedical documents; Support vector machine;

机译：“给某事物发表意见”; “评论”；在线生物医学文件；支持向量机;

相似文献

外文文献
中文文献
专利

1. Automated and precise event detection method for big data in biomedical imaging with support vector machine [J] . Yuan Lufeng, Yao Erlin, Tan Guangming International Journal of Computer Systems Science & Engineering . 2018,第2期

机译：支持向量机在生物医学成像中自动精确的大数据事件检测方法
2. Automated Identification of Hookahs (Waterpipes) on Instagram: An Application in Feature Extraction Using Convolutional Neural Network and Support Vector Machine Classification [J] . Youshan Zhang, Jon-Patrick Allem, Jennifer Beth Unger, Journal of medical Internet research . 2018,第11期

机译：Instagram上水烟筒的自动识别：卷积神经网络和支持向量机分类在特征提取中的应用
3. Automated plant identification using artificial neural network and support vector machine [J] . Soon Jye Kho, Sugumaran Manickam, Sorayya Malek, Frontiers in Life Science . 2017,第1期

机译：使用人工神经网络和支持向量机的植物自动识别
4. Examining Effects of the Support Vector Machines Kernel Types on Biomedical Data Classification [C] . Ïbrahim Berkan AYDÏLEK International Conference on Artificial Intelligence and Data Processing . 2018

机译：支持向量机内核类型对生物医学数据分类的检查效果
5. Nonlinear systems identification using support vector machines . [D] . Al-Dhaifallah, Mujahed. 2010

机译：基于支持向量机的非线性系统辨识。
6. Regional Context-Sensitive Support Vector Machine Classifier to Improve Automated Identification of Regional Patterns of Diffuse Interstitial Lung Disease [O] . Jonghyuck Lim, Namkug Kim, Joon Beom Seo, 2011

机译：区域上下文敏感支持向量机分类器以提高对弥漫性间质性肺疾病区域模式的自动识别
7. Identification of “comment-on sentences ” in online biomedical documents using support vector machines [O] . In Cheol Kim, Daniel X. Le, George R. Thoma 2008

机译：使用支持向量机识别在线生物医学文档中的“注释句”
8. Developing Human-Machine Interfaces to Support Appropriate Trust and Reliance on Automated Combat Identification Systems (Developpement d'Interfaces Homme-Machine Pour Appuyer la Confiance dans les Systemes Automatises d'Identification au Combat); Contra [R] . Jamieson, G. A., Wang, L., Neyedli, H. F. 2008

机译：开发人机界面以支持对自动作战识别系统的适当信任和依赖（开发人机界面人员自动化战斗应用程序）反对

Automated Identification of Biomedical Article Type Using Support Vector Machines

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅