首页> 外文会议>International Conference on Information and Computer Technologies >Word Segmentation for Arabic Abstractive Headline Generation
【24h】

Word Segmentation for Arabic Abstractive Headline Generation

机译:阿拉伯抽象标题生成的词分割

获取原文

摘要

Neural headline generation (NHG) has achieved impressive performance on the abstractive headline generation task in recent years. However, there was a lack of studies conducted to enhance NHGs for agglutinative morphologically rich languages. Part of this is due to the scarcity of resources. This work presents a dataset for the Arabic abstractive headline generation task. The dataset was used in experiments conducted to compare different word segmentation methods like morphological word segmentation and Subword Regularization (SR). Experimental results show that morphological word segmentation and SR outperforms or is at par with the state-of-the-art Byte Pair Encoding (BPE) quantitatively and qualitatively.
机译:神经标题生成(NHG)近年来对抽象标题生成任务取得了令人印象深刻的表现。 然而,缺乏研究,以增强NHGs凝集形态丰富的语言。 部分是由于资源的稀缺。 这项工作介绍了阿拉伯语抽象标题生成任务的数据集。 数据集用于进行的实验中,以比较不同词分割方法,如形态字分割和子字正则化(SR)。 实验结果表明,形态学词分割和SR优于定量和定性地与最先进的字节对编码(BPE)表示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号