首页> 外文会议>Workshop on Financial Technology and Natural Language Processing >aiai at the FinSBD-2 task: Sentence, List, and Item Boundary Detection and Items Classification of Noisy Financial Texts Using Data Augmentation and Attention Model
【24h】

aiai at the FinSBD-2 task: Sentence, List, and Item Boundary Detection and Items Classification of Noisy Financial Texts Using Data Augmentation and Attention Model

机译:AIAI在FINSBD-2任务:句子,清单和项目边界检测和项目使用数据增强和注意模型进行嘈杂的财务文本分类

获取原文

摘要

This paper describes the method that we submitted to the FinSBD2-shared task in IJCAI-2020 to detect the sentence, list, and item boundaries and classify the items from noisy unstructured English and French financial texts. We used the spatial and semantic information of text to augment each tokenized word of text as a fixed-length sentence, and we labeled each word sentence as different boundary types. Then, we proposed the deep attention model based on word embedding to detect the sentence, list, and items boundaries in noisy English and French texts extracted from the financial documents and classified the item sentences into different item types. The experiment shows that the proposed method could be an effective solution to deal with the FinSBD2-shared task.
机译:本文介绍了我们在IJCAI-2020中提交给FinsBD2共享任务的方法,以检测句子,列表和项目边界,并将项目与嘈杂的非结构化英语和法国财务文本分类。 我们使用文本的空间和语义信息将每个令牌的文本中的文本添加为固定长度句子,我们将每个单词句子标记为不同的边界类型。 然后,我们提出了基于Word嵌入的深入关注模型来检测嘈杂的英语和法语文本中的句子,列表和项目边界,并将物品句子分为不同的项目类型。 该实验表明,该方法可以是处理芬兰人共享任务的有效解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号