...
首页> 外文期刊>電子情報通信学会技術研究報告. 思考と言語. Thought and Language >Keywords and weighting for product specifications extraction
【24h】

Keywords and weighting for product specifications extraction

机译:产品规格提取的关键字和权重

获取原文
获取原文并翻译 | 示例
           

摘要

Product specifications contain many data. It is not, however, clear which is the characteristic data in them. We are developing a multi-specifications summarization system using extracted characteristic data from the product specifications. The specifications are written in a tag. The presence of the
tag in an HTML document does not necessarily indicate the presence of specifications. Less than 30% of HTML
tags are real tables in one particular domain. In this paper, we propose a method for keyword extraction for product specifications extraction. We evaluate the performance for two keyword sets, which are constructed by entropy and a Bayes theorem based method.
机译:产品规格包含许多数据。但是,不清楚其中是哪些特征数据。我们正在使用从产品规格中提取的特征数据来开发多规格摘要系统。规范写在
标记中。 HTML文档中
标记的存在并不一定表示规范的存在。在一个特定域中,不到30%的HTML
标记是真实表。在本文中,我们提出了一种用于产品规格提取的关键字提取方法。我们评估了两个关键字集的性能,这两个关键字集是通过熵和基于贝叶斯定理的方法构造的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号