Transfer learning of syntactic structures for building taxonomies for search engines

Boris A. Galitsky

首页> 外文期刊>Engineering Applications of Artificial Intelligence >Transfer learning of syntactic structures for building taxonomies for search engines

【24h】

Transfer learning of syntactic structures for building taxonomies for search engines

机译：转移学习语法结构以建立搜索引擎分类标准

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We apply a paradigm of transfer learning to build a taxonomy of entities intended to improve search engine relevance in a vertical domain. The taxonomy construction process starts from the seed entities and mines available source domains for new entities associated with these seed entities. New entities are formed by applying the machine learning of syntactic parse trees (their generalizations) to the search results for existing entities to form commonalities between them. These commonality expressions then form parameters of existing entities, and are turned into new entities at the next learning iteration. To match natural language expressions between source and target domains, we use syntactic generalization, an operation which finds a set of maximal common sub-trees of constituency parse trees of these expressions. Taxonomy and syntactic generalization are applied to relevance improvement in search and text similarity assessment. We conduct an evaluation of the search relevance improvement in vertical and horizontal domains and observe significant contribution of the learned taxonomy in the former, and a noticeable contribution of a hybrid system in the latter domain. We also perform industrial evaluation of taxonomy and syntactic generalization-based text relevance assessment and conclude that a proposed algorithm for automated taxonomy learning is suitable for integration into industrial systems. The proposed algorithm is implemented as a component of Apache OpenNLP project.

机译：我们采用转移学习的范例来建立实体的分类法，以改善垂直领域中搜索引擎的相关性。分类法构建过程从种子实体开始，并为与这些种子实体关联的新实体挖掘可用的源域。通过将语法分析树的机器学习（它们的概括）应用于现有实体的搜索结果以形成它们之间的共性，从而形成新实体。这些公共性表达式然后形成现有实体的参数，并在下一次学习迭代时变成新的实体。为了匹配源域和目标域之间的自然语言表达，我们使用了语法概括，该操作可找到一组最大的子域，这些子域构成了这些表达的选区解析树。分类法和句法概括被应用于搜索和文本相似性评估中的相关性改进。我们对垂直和水平域中的搜索相关性改进进行了评估，并观察到学习分类法在前者中的显着贡献，以及混合系统在后者中的显着贡献。我们还进行了分类学的工业评估和基于句法归纳的文本相关性评估，并得出结论，提出的自动分类学学习算法适合于集成到工业系统中。该算法被实现为Apache OpenNLP项目的一个组成部分。

著录项

来源
《Engineering Applications of Artificial Intelligence》 |2013年第10期|2504-2515|共12页
作者
Boris A. Galitsky;
展开▼
作者单位

Ebay Inc., San Jose, CA 95125, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Learning taxonomy; Learning syntactic parse tree; Transfer learning; Syntactic generalization; Search relevance;

机译：学习分类法;学习语法分析树;转移学习;句法概括;搜索相关性;

相似文献

外文文献
中文文献
专利

1. Building a Self-Learning Search Engine:rnFrom Research to Business [J] . Manos Tsagkias, Wouter Weerkamp ACM SIGIR FORUM . 2016,第JULa17a21CD期

机译：建立自学习搜索引擎：从研究到业务
2. Building a Search Engine to Drive Problem-Based Learning [J] . Steven Bird, James R. Curran SIGCSE bulletin . 2006,第3期

机译：构建搜索引擎以推动基于问题的学习
3. Student Engagement in a Structured Problem-Based Approach to Learning: A First-Year Electronic Engineering Study Module on Heat Transfer [J] . Montero E., Gonzalez M.J. Education, IEEE Transactions on . 2009,第2期

机译：基于结构化的基于问题的学习方法中的学生参与：关于传热的第一年电子工程学习模块
4. Building Domain-Specific Search Engines with Machine Learning Techniques [C] . Andrew McCallum, Kamal Nigam, Jason Rennie, AAAI Workshop . 1999

机译：使用机器学习技术构建特定于域的搜索引擎
5. Learning and Transfer from an Engineering Design Task: The Roles of Goals, Contrasting Cases, and Focusing on Deep Structure [D] . Malkiewich, Laura. 2018

机译：从工程设计任务中学习和转移：目标，对比案例和关注深层结构的作用
6. A Taxonomic Search Engine: Federating taxonomic databases using web services [O] . Roderic DM Page 2005

机译：分类搜索引擎：使用Web服务联合分类数据库
7. ENVIRONMENTAL ENGINEERING STUDIES ON THE STRUCTURE OF NATURAL WIND IN URBAN AREA AND ITS EFFECTS ON BUILDINGS : Report No.7 : Convective Heat Transfer on External Surface of Building Walls Exposed to Natural Wind [O] . AKIRA SATO, SIGERU GOTO, TAKASHI SEKINE, 1972

机译：城市地区天然风结构的环境工程研究及其对建筑的影响：报告7：建筑墙体外表面暴露于自然风的热传热

Transfer learning of syntactic structures for building taxonomies for search engines

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅