首页> 外文会议>International conference on computational linguistics >Separating Brands from Types: an Investigation of Different Features for the Food Domain
【24h】

Separating Brands from Types: an Investigation of Different Features for the Food Domain

机译:从类型中分离品牌:对食品领域不同特征的调查

获取原文

摘要

We examine the task of separating types from brands in the food domain. Framing the problem as a ranking task, we convert simple textual features extracted from a domain-specific corpus into a ranker without the need of labeled training data. Such method should rank brands (e.g. sprite) higher than types (e.g. lemonade). Apart from that, we also exploit knowledge induced by semi-supervised graph-based clustering for two different purposes. On the one hand, we produce an auxiliary categorization of food items according to the Food Guide Pyramid, and assume that a food item is a type when it belongs to a category unlikely to contain brands. On the other hand, we directly model the task of brand detection using seeds provided by the output of the textual ranking features. We also harness Wikipedia articles as an additional knowledge source.
机译:我们检查食品领域中的品牌分离类型的任务。 将问题框架作为排名任务,我们将从域特定语料库中提取的简单文本功能转换为Ranker,而无需标记的培训数据。 这种方法应该将品牌(例如精灵)高于类型(例如柠檬水)。 除此之外,我们还利用了基于基于图形的基于图形的聚类诱导的知识,以实现两种不同的目的。 一方面,我们根据食物指南金字塔产生食物项目的辅助分类,并假设食品项目是一种不太可能包含品牌的类别。 另一方面,我们使用文本排序特征的输出提供的种子直接模拟品牌检测的任务。 我们还将维基百科文章作为额外的知识来源。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号