A method to track dataset reuse in biomedicine: filtered GEO accession numbers in PubMed Central

机译：一种跟踪生物医学数据集重用的方法：PubMed Central中经过过滤的GEO登录号

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reusing research data has important potential benefits:rngenerative science and efficient resource use. Tracking thernreuse of research datasets would allow us to understandrnwhether the potential benefits are indeed realized, enablernrecognition of investigators who produce, annotate, andrnshare useful data, and inform data sharing and reuserninitiatives, tools, and policies.rnUnfortunately, the lack of clear attribution practices for datarnmake automated tracking of data reuse difficult. I present arnmethod for tracking research data reuse that takesrnadvantage of the community norms around gene expressionrnmicroarray data sharing and the rich NCBI Entrezrnresources. Specifically, the full-text of papers stored inrnPubMed Central are queried for accession numbers ofrndatasets archived in NCBI’s Gene Expression Omnibusrn(GEO) repository. Studies known to have createdrnmicroarray data are excluded through automated filters andrnguided manual curation. MeSH terms attached to the datarncreation and data reuse studies provide additionalrninformation for analysis. Finally, I extrapolate the findingsrnto all of PubMed.rnAutomated portions of this method have been implementedrnin python and are openly available. Although imperfect,rnthis dataset is a valuable initial resource for research intornpatterns of data reuse.

机译：重用研究数据具有重要的潜在好处：生成科学和有效利用资源。跟踪研究数据集的重复使用将使我们能够了解是否确实实现了潜在的好处，能够识别产生，注释和共享有用数据的研究人员，并为数据共享和重用提供了倡议，工具和政策。不幸的是，缺乏明确的归因做法datarn使自动跟踪数据重用变得困难。我提出了一种用于跟踪研究数据重用的方法，该方法利用了围绕基因表达，微阵列数据共享和丰富的NCBI Entrezrn资源的社区规范。具体来说，将查询存储在rnPubMed Central中的论文全文，以获取在NCBI的Gene Expression Omnibusrn（GEO）存储库中归档的rndataset的登录号。通过自动过滤器和引导的手动管理排除了已知已创建微阵列数据的研究。数据创建和数据重用研究附带的MeSH术语为分析提供了其他信息。最后，我将结果推算到所有PubMed中。该方法的自动化部分已在python中实现，并且可以公开获得。尽管不完善，但该数据集是研究数据重用模式的宝贵初始资源。

著录项

来源
《Proceedings of the 73rd ASISamp;T annual meeting: navigating streams in an information ecosystems》|2010年|p.1-2|共2页
会议地点 Pittsburgh PA(US);Pittsburgh PA(US)
作者
Heather A Piwowar;
展开▼
作者单位

National Evolutionary Synthesis Center 2024 W. Main Street, Suite A200 Durham, NC 27705 hpiwowar@nescent.org;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息与知识传播;
关键词
data sharing; data reuse; method; bioinformatics; bibliometrics; human information behavior;

机译：数据共享;数据重用;方法;生物信息学;文献计量学;人类信息行为;
入库时间 2022-08-26 14:25:36

相似文献

外文文献
中文文献
专利

1. OvidSP Medline-to-PubMed search filter translation: a methodology for extending search filter range to include PubMed's unique content [J] . Raechel A Damarell, Jennifer J Tieman, Ruth M Sladek BMC Medical Research Methodology . 2013,第1期

机译：OvidSP Medline到PubMed搜索过滤器转换：一种扩展搜索过滤器范围以包含PubMed独特内容的方法
2. Biomedicine's electronic publishing paradigm shift: copyright policy and PubMed Central. [J] . Markovitz BP Journal of the American Medical Informatics Association : . 2000,第3期

机译：生物医学的电子出版范式转变：版权政策和PubMed Central。
3. Biomedicine's electronic publishing paradigm shift: copyright policy and PubMed Central. [J] . Markovitz BP Journal of the American Medical Informatics Association : . 2000,第3期

机译：生物医学的电子出版范式转变：版权政策和PubMed Central。
4. A method to track dataset reuse in biomedicine: filtered GEO accession numbers in PubMed Central [C] . Heather A Piwowar Annual meeting of the American Society for Information Science and Technology . 2010

机译：一种跟踪生物医学中数据集重用的方法：PubMed Central中的过滤的Geo Incession Numbers
5. Multisensor multitarget tracking with the CPHD filter on Sonar Datasets [D] . Georgescu, Ramona 2012

机译：使用Sonar数据集上的CPHD过滤器进行多传感器多目标跟踪
6. OvidSP Medline-to-PubMed search filter translation: a methodology for extending search filter range to include PubMeds unique content [O] . Raechel A Damarell, Jennifer J Tieman, Ruth M Sladek 2013

机译：OvidSP Medline到PubMed搜索过滤器转换：一种扩展搜索过滤器范围以包含PubMed独特内容的方法
7. OvidSP Medline-to-PubMed search filter translation: a methodology for extending search filter range to include PubMed's unique content [O] . 2013

机译：OvidSP Medline到PubMed搜索过滤器转换：一种扩展搜索过滤器范围以包含PubMed独特内容的方法

A method to track dataset reuse in biomedicine: filtered GEO accession numbers in PubMed Central

摘要

著录项

相似文献

相关主题

期刊订阅