Detecting Invalid Dictionary Entries for Biomedical Text Mining

机译：检测用于生物医学文本挖掘的无效词典条目

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In text mining, to calculate precise keyword frequency distributions in a particular document collection, we need to map different keywords that denote the same entity to a canonical form. In the life science domain, we can construct a large dictionary that contains the canonical forms and their variants based on the information from external resources and use this dictionary for the term aggregation. However, in this automatically generated dictionary, there are many invalid entries that have negative effects on the calculations of keyword frequencies. In this paper, we propose and test methods to detect invalid entries in the dictionary.

机译：在文本挖掘中，要计算特定文档集中精确的关键字频率分布，我们需要将表示同一实体的不同关键字映射为规范形式。在生命科学领域，我们可以根据来自外部资源的信息构建一个包含规范形式及其变体的大型词典，并将该词典用于术语聚合。但是，在此自动生成的词典中，有许多无效条目会对关键字频率的计算产生负面影响。在本文中，我们提出并测试了检测字典中无效条目的方法。

著录项

来源
《PAKDD 2006 International Workshop on Knowledge Discovery in Life Science Literature(KDLL 2006); 20060409; Singapore(SG)》|2006年|P.112-122|共11页
会议地点 Singapore(SG)
作者
Hironori Takeuchi; Issei Yoshida; Yohei Ikawa; Kazuo Iida; Yoko Fukui;
展开▼
作者单位

IBM Research, Tokyo Research Laboratory, IBM Japan, Ltd., Shimotsuruma 1623-14 Yamato-shi Kanagawa, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类定量生物学;
关键词

相似文献

外文文献
中文文献
专利

1. An Updated Protocol to Detect Invalid Entries in an Online Survey of Men Who Have Sex with Men (MSM): How Do Valid and Invalid Submissions Compare? [J] . Grey Jeremy A., Konstan Joseph, Iantaffi Alex, AIDS and behavior . 2015,第10期

机译：一种更新的协议，用于在与男男性接触者（MSM）进行的在线调查中检测无效条目：有效提交和无效提交之间的比较？
2. Creating an invalid defect classification model using text mining on server development [J] . Yihsiung Su, Pin Luarn, Yue-Shi Lee, The Journal of Systems and Software . 2017,第Mara期

机译：在服务器开发中使用文本挖掘创建无效的缺陷分类模型
3. Status of text-mining techniques applied to biomedical text. [J] . Erhardt RA, Schneider R, Blaschke C Drug discovery today . 2006,第7a8期

机译：应用于生物医学文本的文本挖掘技术的现状。
4. Detecting Invalid Dictionary Entries for Biomedical Text Mining [C] . Hironori Takeuchi, Issei Yoshida, Yohei Ikawa, PAKDD International Workshop on Knowledge Discovery in Life Science Literature . 2006

机译：检测生物医学文本挖掘的无效字典条目
5. Using text mining to extract gene and protein synonyms from biomedical texts [D] . Duong, Duc C. 2007

机译：使用文本挖掘从生物医学文本中提取基因和蛋白质同义词
6. A text mining approach to detect mentions of protein glycosylation in biomedical text [O] . Daksha Shukla, Valadi K Jayaraman 2012

机译：一种文本挖掘方法用于检测生物医学文本中蛋白质糖基化的提及
7. Topic Modeling Technique for Text Mining Over Biomedical Text Corpora Through Hybrid Inverse Documents Frequency and Fuzzy K-Means Clustering [O] . Junaid Rashid, Syed Muhammad Adnan Shah, Aun Irtaza, 2019

机译：通过混合逆文档频率和模糊k叶片频率和模糊k型群体挖掘生物医学文本语料主题建模技术

Detecting Invalid Dictionary Entries for Biomedical Text Mining

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅