首页> 外文会议>International conference on computational linguistics >Separating Brands from Types: an Investigation of Different Features for the Food Domain
【24h】

Separating Brands from Types: an Investigation of Different Features for the Food Domain

机译:从类型中分离品牌:对食品领域不同功能的调查

获取原文

摘要

We examine the task of separating types from brands in the food domain. Framing the problem as a ranking task, we convert simple textual features extracted from a domain-specific corpus into a ranker without the need of labeled training data. Such method should rank brands (e.g. sprite) higher than types (e.g. lemonade). Apart from that, we also exploit knowledge induced by semi-supervised graph-based clustering for two different purposes. On the one hand, we produce an auxiliary categorization of food items according to the Food Guide Pyramid, and assume that a food item is a type when it belongs to a category unlikely to contain brands. On the other hand, we directly model the task of brand detection using seeds provided by the output of the textual ranking features. We also harness Wikipedia articles as an additional knowledge source.
机译:我们研究了在食品领域从品牌中分离类型的任务。将问题视为排名任务,我们将从特定领域语料库中提取的简单文本特征转换为排名,而无需标记训练数据。这样的方法应该使品牌(例如雪碧)的排名高于类型(例如柠檬水)的排名。除此之外,我们还利用基于半监督图的聚类所产生的知识来实现​​两个不同的目的。一方面,我们根据《食品指南金字塔》对食品进行了辅助分类,并假定食品属于不太可能包含品牌的类别时,该食品为一种类型。另一方面,我们使用文本排名特征输出提供的种子直接对品牌检测任务建模。我们还利用Wikipedia文章作为其他知识来源。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号