首页> 外国专利> DOMAIN-SPECIFIC METHOD FOR DISTINGUISHING TYPE-DENOTING DOMAIN TERMS FROM ENTITY-DENOTING DOMAIN TERMS

DOMAIN-SPECIFIC METHOD FOR DISTINGUISHING TYPE-DENOTING DOMAIN TERMS FROM ENTITY-DENOTING DOMAIN TERMS

机译:从实体否定域术语中区分类型否定域术语的特定域方法

摘要

Large lists of domain-specific terms are classified as a particular kind of linguistic object, e.g., lexical answer type T versus canonical answer E, based on features from a domain-specific corpus which have been found to distinguish between the linguistic objects. The distinguishing features can be identified in the corpus based on sets of the linguistic objects derived from question-and-answer pairs. A classifier can be trained using the distinguishing features, and the classification carried out using that classifier. The distinguishing features can include one or more syntactic features or one or more lexical features. The linguistic objects (the T and E training sets) can be extracted from the question-and-answer pairs automatically via text analysis if manually curated lists are not available. The classified terms can be included in a domain-specific lexicon which facilitates a deep question answering system to yield an answer to a question.
机译:根据发现特定领域语料库中的语言特征,将大量特定领域术语分类为特定类型的语言对象,例如词汇答案类型T与规范答案E。可以基于从问题和答案对中获得的语​​言对象集,在语料库中识别出区别特征。可以使用区别特征来训练分类器,并使用该分类器进行分类。区别特征可以包括一个或多个句法特征或一个或多个词法特征。如果没有手动策划的列表,则可以通过文本分析从问答对中自动提取语言对象(T和E训练集)。可以将分类的术语包括在特定领域的词典中,该词典有助于深度问题回答系统产生问题的答案。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号