Supervised learning for robust term extraction

机译：监督学习，可进行可靠的术语提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a machine learning method to automatically classify the extracted ngrams from a corpus into terms and non-terms. We use 10 common statistics in previous term extraction literature as features for training. The proposed method, applicable to term recognition in multiple domains and languages, can help 1) avoid the laborious work in the post-processing (e.g. subjective threshold setting); 2) handle the skewness and demonstrate noticeable resilience to domain-shift issue of training data. Experiments are carried out on 6 corpora of multiple domains and languages, including GENIA and ACLRD-TEC(1.0) corpus as training set and four TTC subcorpora of wind energy and mobile technology in both Chinese and English as test set. Promising results are found, which indicate that this approach is capable of identifying both single word terms and multiword terms with reasonably good precision and recall.

机译：我们提出了一种机器学习方法，用于将从语料库中提取的ngram自动分类为术语和非术语。我们在上一学期提取文献中使用10种常见统计数据作为训练的特征。所提出的方法适用于多种领域和语言的术语识别，可以帮助1）避免后期处理中的繁琐工作（例如主观阈值设置）； 2）处理偏斜，并表现出对训练数据的域转移问题的显着适应力。实验以6个具有多种领域和语言的语料库进行，其中包括GENIA和ACLRD-TEC（1.0）语料库作为训练集，以及四个中，英文风能和移动技术的TTC子语料库作为测试集。发现有希望的结果，表明该方法能够以相当好的精度和召回率识别单个单词和多个单词。

著录项

来源
《International conference on Asian language processing》|2017年|302-305|共4页
会议地点 Singapore(SG)
作者
Yu Yuan; Jie Gao; Yue Zhang;
展开▼
作者单位

School of Languages and Cultures Nanjing Uni. of Info. Sci. Tech;

Department of Computer Science University of Sheffield;

Information Systems Technology Design Singapore Uni. of Tech. Design;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Training; Supervised learning; Task analysis; Testing; Static VAr compensators; Correlation;

机译：特征提取;训练;监督学习；任务分析；测试；静态无功补偿器；相关性;

相似文献

外文文献
中文文献
专利

1. Aspect Term Extraction using Graph-based Semi-Supervised Learning [J] . Gunjan Ansari, Chandni Saxena, Tanvir Ahmad, Procedia Computer Science . 2020,第5期

机译：基于图的半监督学习的术语术语提取
2. Does Distributionally Robust Supervised Learning Give Robust Classifiers? [J] . Weihua Hu, Gang Niu, Issei Sato, JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：分布强大的监督学习给予强大的分类器吗？
3. Automatic lung segmentation in CT images using mask R-CNN for mapping the feature extraction in supervised methods of machine learning using transfer learning [J] . Luis Fabricio Souza, Gabriel Holanda, Francisco Hercules Silva, International Journal of Hybrid Intelligent Systems . 2020,第4期

机译：使用掩模R-CNN的CT图像中的自动肺分割，用于使用转移学习在机器学习的监督方法中映射特征提取
4. Supervised learning for robust term extraction [C] . Yu Yuan, Jie Gao, Yue Zhang International Conference on Asian Language Processing . 2017

机译：鲁棒术语提取学习
5. Plant Segmentation by Supervised Machine Learning Methods and Phenotypic Trait Extraction of Soybean Plants Using Deep Convolutional Neural Networks with Transfer Learning [D] . Adams, Jason R. 2020

机译：植物分割通过深度卷积神经网络与转移学习的豆豆植物的植物分割和表型特性
6. Robust Semi-Supervised Traffic Sign Recognition via Self-Training and Weakly-Supervised Learning [O] . Obed Tettey Nartey, Guowu Yang, Sarpong Kwadwo Asare, 2020

机译：通过自我训练和弱监督学习实现可靠的半监督交通标志识别
7. Supervised Learning for Robust Term Extraction [O] . Yuan, Y., Gao, J., Zhang, Y. 2017

机译：用于稳健术语提取的监督学习
8. Simple, Robust, Scalable Semi-supervised Learning via Expectation Regularization [R] . Mann, G. S., McCallum, A. 2007

机译：通过期望正则化实现简单，稳健，可扩展的半监督学习

Supervised learning for robust term extraction

摘要

著录项

相似文献

相关主题

期刊订阅