首页> 外文会议>International Conference on Problems of Infocommunications. Science and Technology >Sentence Segmentation from Unformatted Text using Language Modeling and Sequence Labeling Approaches
【24h】

Sentence Segmentation from Unformatted Text using Language Modeling and Sequence Labeling Approaches

机译:使用语言建模和序列标记方法从未格式文本的句子分割

获取原文

摘要

Current research devoted to the Natural Language Processing problem of sentence segmentation from raw text. The focus was directed to the task of segmentation of auto-generated transcripts for videos that do not have any punctuation and segmentation. Two general approaches to solve the problem of sentence segmentation were proposed and experiments concluded on a comparison of results of pre-trained transformer-based models. Research on how different approach of solving problem affects results were carried out. As a result, the sequence labeling approach turned out to be the most suitable.
机译:目前研究致力于原始文本句子分割的自然语言处理问题。 重点是针对没有任何标点符号和分割的视频的自动生成转录物分割的任务。 提出了解决句子分割问题的两种普遍方法,并在比较训练的基于变压器的模型的结果比较方面的实验结束。 进行了解决问题的不同方法如何影响结果的研究。 结果,序列标记方法结果是最合适的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号