A Method for Eliminating Articles by Homonymous Authors From the Large Number of Articles Retrieved by Author Search

Natsuo Onodera; Mariko Iwasawa; Nobuyuki Midorikawa; Fuyuki Yoshikane; Kou Amano; Yutaka Ootani; Tadashi Kodama; Yasuhiko Kiyama; Hiroyuki Tsunoda; Shizuka Yamazaki

首页> 外文期刊>Journal of the American Society for Information Science and Technology >A Method for Eliminating Articles by Homonymous Authors From the Large Number of Articles Retrieved by Author Search

【24h】

A Method for Eliminating Articles by Homonymous Authors From the Large Number of Articles Retrieved by Author Search

机译：从作者搜索检索到的大量文章中删除同名作者的文章的方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a methodology which discriminates the articles by the target authors ("true" articles) from those by other homonymous authors ("false" articles). Author name searches for 2,595 "source" authors in six subject fields retrieved about 629,000 articles. In order to extract true articles from the large amount of the retrieved articles, including many false ones, two filtering stages were applied. At the first stage any retrieved article was eliminated as false if either its affiliation addresses had little similarity to those of its source article or there was no citation relationship between the journal of the retrieved article and that of its source article. At the second stage, a sample of retrieved articles was subjected to manual judgment, and utilizing the judgment results, discrimination functions based on logistic regression were defined. These discrimination functions demonstrated both the recall ratio and the precision of about 95% and the accuracy (correct answer ratio) of 90-95%. Existence of common coauthor(s), address similarity, title words similarity, and interjournal citation relationships between the retrieved and source articles were found to be the effective discrimination predictors. Whether or not the source author was from a specific country was also one of the important predictors. Furthermore, it was shown that a retrieved article is almost certainly true if it was cited by, or cocited with, its source article. The method proposed in this study would be effective when dealing with a large number of articles whose subject fields and affiliation addresses vary widely.

机译：本文提出了一种将目标作者的文章（“真实”文章）与其他同名作者的文章（“假”文章）区分开的方法。作者名称在六个主题字段中搜索2,595名“源”作者，共检索了约629,000条文章。为了从大量检索到的文章（包括许多错误的文章）中提取真实的文章，应用了两个过滤阶段。在第一阶段，如果任何检索到的文章的从属地址与其来源文章的相似性不高，或者检索到的文章的期刊与其来源文章的期刊之间没有引文关系，则将其排除为假。在第二阶段，对检索到的物品样本进行人工判断，并利用判断结果定义基于逻辑回归的判别函数。这些判别函数表明召回率和准确率均约为95％，准确率（正确答案率）约为90-95％。发现共同作者的存在，地址相似性，标题词相似性以及检索到的文章和源文章之间的期刊间引用关系是有效的判别指标。来源作者是否来自特定国家也是重要的预测因素之一。此外，研究表明，检索到的文章如果被其来源文章引用或引用，几乎可以肯定是正确的。本研究中提出的方法在处理主题领域和隶属地址差异很大的大量文章时将是有效的。

著录项

来源
《Journal of the American Society for Information Science and Technology》 |2011年第4期|p.677-690|共14页
作者
Natsuo Onodera; Mariko Iwasawa; Nobuyuki Midorikawa; Fuyuki Yoshikane; Kou Amano; Yutaka Ootani; Tadashi Kodama; Yasuhiko Kiyama; Hiroyuki Tsunoda; Shizuka Yamazaki;
展开▼
作者单位

Graduate School of Library, Information and Media Studies, University of Tsukuba, 1-2, Kasuga, Tsukuba, Ibaraki 305-8550, Japan;

Graduate School of Library, Information and Media Studies, University of Tsukuba, 1-2, Kasuga, Tsukuba, Ibaraki 305-8550, Japan;

Graduate School of Library, Information and Media Studies, University of Tsukuba, 1-2, Kasuga, Tsukuba, Ibaraki 305-8550, Japan;

Graduate School of Library, Information and Media Studies, University of Tsukuba, 1-2, Kasuga, Tsukuba, Ibaraki 305-8550, Japan;

Bioresource Information Division, RIKEN BioResource Center, 3-1-1, Koyadai, Tsukuba, Ibaraki 305-0074, Japan;

Toho University Medical Media Center, 5-21-16, Omori-Nishi, Ota-ku, Tokyo 143-8540, Japan;

Toho University Medical Media Center, 5-21-16, Omori-Nishi, Ota-ku, Tokyo 143-8540, Japan;

Juntendo University Library, 2-2-26, Hongo, Bunkyo-ku, Tokyo 113-0033, Japan;

Department of Culture and Language, Shokei University, 6-5-1, Nirenoki, Kumamoto 861-8538, Japan;

International Medical Information Center, 35, Shinanomachi, Shinjuku-ku, Tokyo 160-0016, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Finding articles written by authors with a specific professional degree: a methodology for retrieving D.P.M.-authored reports. [J] . C R Fikar, R J Baglio Bulletin of the Medical Library Association. . 1997,第4期

机译：查找具有特定专业学位的作者撰写的文章：检索D.P.M.撰写的报告的方法。
2. Luck and the Gendered Social Structure The author would like to thank Jacob Levy, Patchen Markell, Martha Nussbaum, Ingrid Robeyns, and Iris Marion Young for their helpful comments on earlier versions of this article. The author dedicates this article to the late Professor Iris Marion Young in honor of her enthusiastic guardianship in the writing of this article. View all notes [J] . HeeâKang Kima* Journal of Women, Politics & Policy . 2010,第1期

机译：运气和性别结构的作者作者要感谢Jacob Levy，Patchen Markell，Martha Nussbaum，Ingrid Robeyns和Iris Marion Young对本文的早期版本提供了有益的评论。作者将本文献给已故的Iris Marion Young教授，以纪念她在撰写本文时的热情监护。查看所有笔记
3. Female-Authored Articles Are More Likely to Include Methods-Trained Authors [J] . Briget da Graca, Benjamin D. Pollock, Teresa K. Phan, Mayo Clinic Proceedings: Innovations, Quality & Outcomes . 2019,第1期

机译：女性授权的文章更可能包含方法培训的作者
4. An Automatic Classification of the Primary and the Corresponding Authors in Research Articles [C] . Sukhwan Jung, Rituparna Datta, Aviv Segev IEEE International Conference on Big Data . 2020

机译：研究文章中的主要分类和相应作者的自动分类
5. Classification and Prediction of Newspaper Articles on the Basis of Author Gender [D] . Singh, Devisha. 2018

机译：作者性别的报纸文章分类与预测
6. Finding articles written by authors with a specific professional degree: a methodology for retrieving D.P.M.-authored reports. [O] . C R Fikar, R J Baglio 1997

机译：查找具有特定专业学位的作者撰写的文章：检索D.P.M.撰写的报告的方法。
7. A method for eliminating articles by homonymous authors from the large number of articles retrieved by author search [O] . Onodera Natsuo, Iwasawa Mariko, Midorikawa Nobuyuki, 2011

机译：一种从作者搜索检索到的大量文章中删除同名作者的文章的方法
8. Mining-Related Articles by NIOSH (National Institute for Occupational Safety and Health) Authors through December 1974 with Subject and Author Indexes [R] . Jimerson, M. M. 1975

机译：NIOsH（国家职业安全与健康研究所）的采矿相关文章截至1974年12月作者主题和作者索引

A Method for Eliminating Articles by Homonymous Authors From the Large Number of Articles Retrieved by Author Search

摘要

著录项

相似文献

相关主题

期刊订阅