首页> 外文期刊>ACM Transactions on Management Information Systems >A Weakly Supervised WordNet-Guided Deep Learning Approach to Extracting Aspect Terms from Online Reviews
【24h】

A Weakly Supervised WordNet-Guided Deep Learning Approach to Extracting Aspect Terms from Online Reviews

机译:从在线评论中提取方面的深度学习方法是弱监督的Wordnet-指导的深度学习方法

获取原文
获取原文并翻译 | 示例
       

摘要

The unstructured nature of online reviews makes it inefficient and inconvenient for prospective consumers to research and use in support of purchase decision making. The aspects of products provide a fine-grained meaningful perspective for understanding and organizing review texts. Traditional aspect term extraction approaches rely on discrete language models that treat words in isolation. Despite that continuous-space language models have demonstrated promise in addressing a wide range of problems, their application in aspect term extraction faces significant challenges. For instance, existing continuous-space language models typically require large collections of labeled data, which remain difficult to obtain in many domains. More importantly, previous methods are largely data driven but overlook the role of human knowledge in guiding model development. To address these limitations, this study designs and develops weakly supervised WordNet-guided deep learning to aspect term extraction. The approach draws on deep-level semantic information from WordNet to guide not only the selection representative seed terms but also the pruning of aspect candidate terms. The weak supervision is provided by a very small set of labeled data. We conduct a comprehensive evaluation of the proposed method using both direct and indirect methods. The evaluation results with Yelp restaurant reviews demonstrate that our proposed method consistently outperforms all baseline methods including discrete models and the state-of-the-art continuous-space language models for aspect term extraction across both direct and indirect evaluations. The research findings have broad research, technical, and practical implications for various stakeholders of online reviews.
机译:在线评论的非结构化性质使得前瞻性消费者研究和使用支持购买决策的效率和不方便。产品的各个方面为理解和组织审查文本提供了一个细粒度的有意义的视角。传统的术语提取方法依赖于分离处理单词的离散语言模型。尽管连续空间语言模型已经证明了解广泛的问题,但它们在阶段术语提取中的应用面临重大挑战。例如,现有的连续空间语言模型通常需要大集合标记数据,这仍然难以在许多域中获得。更重要的是,以前的方法在很大程度上是数据驱动,但忽略了人类知识在指导模型发展中的作用。为了解决这些限制,这项研究设计并开发了弱监督的Wordnet引导的深度学习,以方面术语提取。该方法在Wordnet中借鉴了WordNet的深度语义信息,不仅指导选择代表种子术语,而且引导了方面候选术语的修剪。弱监督由一组非常小的标记数据提供。我们对使用直接和间接方法进行了综合评估方法。与Yelp Restaurant评论的评估结果表明,我们的提出方法始终如一地优于所有基线方法,包括离散模型和最先进的连续空间语言模型,用于跨直接和间接评估。研究结果对各种利益相关者进行了广泛的研究,技术和实际意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号