A framework for semi-automatic identification, disambiguation and storage of protein-related abbreviations in scientific literature

机译：科学文献中与蛋白质相关的缩写词的半自动识别，歧义消除和存储的框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a framework for identifying, disambiguating and storing protein-related abbreviations as found in the full texts of scientific papers, in order to build and maintain a publicly available abbreviation repository via a semi-automatic process. This process involves information extraction methods and techniques for acronym identification and resolution, based on lexical clues and syntactical, largely domain-independent criteria. A dictionary and an ontology for proteins provide the means for matching and disambiguating the biological entities. User feedback is gathered at the end of the process and the confirmed entries are then stored and made available to the scientific community for further reviewing.

机译：我们提出了一种用于识别，消除歧义和存储科学论文全文中发现的蛋白质相关缩写的框架，以便通过半自动过程来建立和维护公开可用的缩写存储库。此过程涉及基于词汇线索和句法，很大程度上与领域无关的标准的信息提取方法和用于缩写词识别和解析的技术。蛋白质的字典和本体提供了匹配和消除生物学实体歧义的方法。在过程结束时收集用户反馈，然后将确认的条目存储起来，并提供给科学界以供进一步检查。

著录项

来源
《2011 IEEE 27th International Conference on data Engineering Workshops》|2011年|p.59-61|共3页
会议地点
作者
Atzeni Paolo; Polticelli Fabio; Toti Daniele;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类数据处理、数据处理系统;
关键词

相似文献

外文文献
中文文献
专利

1. Subcellular localization charts: A new visual methodology for the semi-automatic localization of protein-related data sets [J] . Sommer B., Kormeier B., Demenkov P.S., Journal of Bioinformatics and Computational Biology . 2013,第1期

机译：亚细胞定位图：一种新的可视化方法，用于蛋白质相关数据集的半自动定位
2. Training without training data Improving the generalizability of automated medical abbreviation disambiguation [J] . Marta Skreta, Aryan Arbabi, Jixuan Wang, JMLR: Workshop and Conference Proceedings . 2020,第2010期

机译：没有培训数据的培训，提高了自动医学缩小歧义的易用性
3. Link-topic model for biomedical abbreviation disambiguation [J] . Kim Seonho, Yoon Juntae Journal of biomedical informatics. . 2015,第1期

机译：生物医学缩写歧义的链接主题模型
4. A framework for semi-automatic identification, disambiguation and storage of protein-related abbreviations in scientific literature [C] . Atzeni Paolo, Polticelli Fabio, Toti Daniele International Conference on Data Engineering Workshops . 2011

机译：科学文献中蛋白质相关缩写的半自动鉴定，消歧和储存框架
5. Automatic word sense disambiguation of acronyms and abbreviations in clinical texts [D] . Moon, Sungrim 2012

机译：临床文本中首字母缩写词和缩写词的自动词义消除
6. A long journey to short abbreviations: developing an open-source framework for clinical abbreviation recognition and disambiguation (CARD) [O] . Yonghui Wu, Joshua C Denny, S Trent Rosenbloom, 2017

机译：短期缩写的长途旅行：开发临床缩写识别和消歧（卡）的开源框架
7. Abbreviation Explorer - an interactive system for pre-evaluation of Unsupervised Abbreviation Disambiguation [O] . Manuel Ciosici, Ira Assent 2019

机译：缩写资源管理器 - 用于预测缩写歧义的预测预测预测的交互式系统
8. Formal Framework for Semi-Automatic Parallel Program Generation [R] . van Gemund, A. J. C., Paalvast, E. M. R. M., Sips, H. J. 1988

机译：半自动并行程序生成的形式框架

A framework for semi-automatic identification, disambiguation and storage of protein-related abbreviations in scientific literature

摘要

著录项

相似文献

相关主题

期刊订阅