首页>
外国专利>
Answer selection using a compare-aggregate model with language model and condensed similarity information from latent clustering
Answer selection using a compare-aggregate model with language model and condensed similarity information from latent clustering
展开▼
机译:通过与语言模型的比较聚合模型以及来自潜在聚类的浓缩相似信息的回答选择
展开▼
页面导航
摘要
著录项
相似文献
摘要
Embodiments of the present invention provide systems, methods, and computer storage media for techniques for identifying textual similarity and performing answer selection. A textual-similarity computing model can use a pre-trained language model to generate vector representations of a question and a candidate answer from a target corpus. The target corpus can be clustered into latent topics (or other latent groupings), and probabilities of a question or candidate answer being in each of the latent topics can be calculated and condensed (e.g., downsampled) to improve performance and focus on the most relevant topics. The condensed probabilities can be aggregated and combined with a downstream vector representation of the question (or answer) so the model can use focused topical and other categorical information as auxiliary information in a similarity computation. In training, transfer learning may be applied from a large-scale corpus, and the conventional list-wise approach can be replaced with point-wise learning.
展开▼