An Empirical Study of Active Learning with Support Vector Machines for Japanese Word Segmentation

机译：支持向量机主动学习日语分词的实证研究

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We explore how active learning with Support Vector Machines works well for a non-trivial task in natural language processing. We use Japanese word segmentation as a test case. In particular, we discuss how the size of a pool affects the learning curve. It is found that in the early stage of training with a larger pool, more labeled examples are required to achieve a given level of accuracy than those with a smaller pool. In addition, we propose a novel technique to use a large number of unlabeled examples effectively by adding them gradually to a pool. The experimental results show that our technique requires less labeled examples than those with the technique in previous research. To achieve 97.0% accuracy, the proposed technique needs 59.3% of labeled examples that are required when using the previous technique and only 17.4% of labeled examples with random sampling.

机译：我们探讨了使用支持向量机进行主动学习如何在自然语言处理中完成一项重要任务的良好方法。我们使用日语分词作为测试用例。特别是，我们讨论了池的大小如何影响学习曲线。发现在较大池中进行训练的早期阶段，与较小池中的样本相比，需要更多标记示例才能达到给定的准确性。另外，我们提出了一种新颖的技术，通过将它们逐渐添加到池中来有效地使用大量未标记的示例。实验结果表明，与先前研究中的技术相比，我们的技术所需的标记示例更少。为了达到97.0％的准确度，所提出的技术需要使用以前的技术时需要59.3％的标记示例，而只有17.4％的带有随机采样的标记示例。

著录项

来源
《40th Annual Meeting of the Association for Computational Linguistics, Jul 7-12, 2002, Philadelphia, Pennsylvania, USA》|2002年|p.505-512|共8页
会议地点 Philadelphia Pennsylvania USA
作者
Manabu Sassano;
展开▼
作者单位

Fujitsu Laboratories Ltd. 4-1 -1, Kamikodanaka, Nakahara-ku, Kawasaki 211-8588, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Segmentation of handwritten words using structured support vector machine [J] . Sharma Manoj Kumar, Dhaka Vijaypal Singh Pattern Analysis and Applications . 2020,第3期

机译：使用结构化支持向量机的手写单词的分割
2. Improved landslide assessment using support vector machine with bagging, boosting, and stacking ensemble machine learning framework in a mountainous watershed, Japan [J] . Dou Jie, Yunus Ali P., Dieu Tien Bui, Landslides . 2020,第3期

机译：利用支持向量机改进了滑坡评估，其中袋装，升压和堆叠集合机器学习框架，日本山区流域
3. Semi-supervised learning combining transductive support vector machine with active learning [J] . Wang Xibin, Wen Junhao, Alam Shafiq, Neurocomputing . 2016,第JANa15PTa3期

机译：半监督学习，将支持向量机与主动学习相结合
4. An Empirical Study of Active Learning with Support Vector Machines for Japanese Word Segmentation [C] . Manabu Sassano Annual meeting of the Association for Computational Linguistics . 2002

机译：日本词分割支持向量机的主动学习实证研究
5. Active learning with support vector machines for imbalanced datasets and a method for stopping active learning based on stabilizing predictions. [D] . Bloodgood, Michael. 2009

机译：支持向量机用于不平衡数据集的主动学习，以及一种基于稳定预测的主动学习停止方法。
6. Active Contour Based Segmentation and Classification for Pleura Diseases Based on Otsu’s Thresholding and Support Vector Machine (SVM) [O] . M Malathi, P Sinthia, K Jalaldeen 2019

机译：基于大津市阈值和支持向量机（SVM）的主动轮廓基于胸膜疾病的分割和分类
7. An Empirical Study of Active Learning with Support Vector Machines for Japanese Word Segmentation [O] . Manabu Sassano 2002

机译：支持向量机主动学习日语分词的实证研究

An Empirical Study of Active Learning with Support Vector Machines for Japanese Word Segmentation

摘要

著录项

相似文献

相关主题

期刊订阅