首页> 外文会议>International Conference on Applications of Natural Language to Information Systems >Human Language Comprehension in Aspect Phrase Extraction with Importance Weighting
【24h】

Human Language Comprehension in Aspect Phrase Extraction with Importance Weighting

机译:以重要的重量加权提取宽伤的人类语言理解

获取原文

摘要

In this study, we describe a text processing pipeline that transforms user-generated text into structured data. To do this, we train neural and transformer-based models for aspect-based sentiment analysis. As most research deals with explicit aspects from product or service data, we extract and classify implicit and explicit aspect phrases from German-language physician review texts. Patients often rate on the basis of perceived friendliness or competence. The vocabulary is difficult, the topic sensitive, and the data user-generated. The aspect phrases come with various wordings using insertions and are not noun-based, which makes the presented case equally relevant and reality-based. To find complex, indirect aspect phrases, up-to-date deep learning approaches must be combined with supervised training data. We describe three aspect phrase datasets, one of them new, as well as a newly annotated aspect polarity dataset. Alongside this, we build an algorithm to rate the aspect phrase importance. All in all, we train eight transformers on the new raw data domain, compare 54 neural aspect extraction models and, based on this, create eight aspect polarity models for our pipeline. These models are evaluated by using Precision, Recall, and F-Score measures. Finally, we evaluate our aspect phrase importance measure algorithm.
机译:在本研究中,我们描述了一个文本处理流水线,它将用户生成的文本转换为结构化数据。为此,我们培养基于神经和变压器的基于宽高的情感分析模型。由于大多数研究涉及产品或服务数据的明确方面,我们从德语医师审查文本中提取和分类隐式和显式的方向短语。患者经常根据感知友好或能力的基础来汇率。词汇量困难,主题敏感和用户生成的数据。方面短语具有各种措辞,使用插入而不是基于名词的,这使得呈现的案例同样相关和现实。要查找复杂的,间接方面短语,最新的深度学习方法必须与监督培训数据相结合。我们描述了三个方面短语数据集,其中一个是新的,以及一个新的被注释的宽高学极性数据集。除此之外,我们构建了一种算法来评估宽度短语的重要性。总而言之,我们在新的原始数据域中训练八个变形金刚,比较54神经字体提取模型,并根据此基于此,为我们的管道创建八个宽高的极性模型。这些型号通过使用精度,召回和F分度测量来评估。最后,我们评估了我们的宽度短语重要测量算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号