首页> 外文期刊>Engineering Applications of Artificial Intelligence >SpaceLDA: Topic distributions aggregation from a heterogeneous corpus for space systems
【24h】

SpaceLDA: Topic distributions aggregation from a heterogeneous corpus for space systems

机译:spacelda:主题分布来自空间系统的异构语料库的聚合

获取原文
获取原文并翻译 | 示例
           

摘要

The design of highly complex systems such as spacecraft entails large amounts of documentation. Tracking relevant information, including hundreds of requirements, throughout several design stages is a challenge. In this study, we propose a novel strategy based on Topic Modelling to facilitate the management of spacecraft design requirements. We introduce spaceLDA, a novel domain-specific semi-supervised Latent Dirichlet Allocation (LDA) model enriched with lexical priors and an optimised Weighted Sum (WS). We collect and curate the first large collection of unstructured data related to space systems, combining several sources: Wikipedia pages, books, and feasibility reports provided by the European Space Agency (ESA). We train the spaceLDA model on three subsets of our heterogeneous training corpus. To combine the resulting per-document topic distributions, we enrich our model with an aggregation method based on an optimised WS. We evaluate our model through a case study, a categorisation of spacecraft design requirements. We finally compare our model's performance with an unsupervised LDA model and with a literature aggregation method. The results demonstrate that the spaceLDA model successfully identifies the topics of requirements and that our proposed approach surpasses the use of a classic LDA model and the state of the art aggregation method.
机译:高度复杂的系统如航天器的设计需要大量的文档。追踪相关信息,包括数百个要求,整个设计阶段都是一个挑战。在本研究中,我们提出了一种基于主题建模的新型战略,以促进航天器设计要求的管理。我们介绍了Spacelda,一种新型的域特定的半监督潜在Dirichlet分配(LDA)模型,富有词汇前沿和优化的加权和(WS)。我们收集并策划与空间系统相关的第一个大型非结构化数据集合,结合了欧洲航天局(ESA)提供的若干来源:维基百科页面,书籍,书籍和可行性报告。我们在我们异质训练语料库的三个子集中训练Spacelda模型。要组合生成的每个文档主题分布,我们将通过基于优化的WS的聚合方法来丰富我们的模型。我们通过案例研究评估我们的模型,对航天器设计要求进行分类。我们终于将模型的性能与无人监督的LDA模型进行了比较,并具有文献聚合方法。结果表明,Spacelda模型成功地识别了要求的主题,并且我们所提出的方法超越了经典LDA模型的使用和最先进的聚合方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号