首页> 外国专利> TEXT MINING SYSTEM AND TOOL

TEXT MINING SYSTEM AND TOOL

机译:文本挖掘系统和工具

摘要

A text mining system for extracting relevant text from a plurality of input data sets is provided. The text mining system includes an input interface module configured to enable one or more users to select a plurality of sources for a plurality of input data sets. The text mining system also includes a text analysis module configured to receive the plurality of input data sets and to generate an output data set by analyzing the plurality of input data sets. The text analysis module includes a data handling module configured to convert the plurality of input data sets to an analytics text set. The text analysis module also includes an exploratory analysis module configured to determine a plurality of correlations within the analytics text set. The text analysis module further includes a topic modeling module configured to identify a plurality of topics repeatedly occurring in the analytics text set and a reporting module configured to generate a plurality of reports for the text analysis module. The text mining system further includes memory circuitry configured to store the plurality of input data sets, the analytics text set and the output data set.
机译:提供了一种文本挖掘系统,用于从多个输入数据集中提取相关文本。文本挖掘系统包括输入接口模块,该输入接口模块被配置为使一个或多个用户能够为多个输入数据集选择多个源。文本挖掘系统还包括文本分析模块,该文本分析模块被配置为接收多个输入数据集并通过分析多个输入数据集来生成输出数据集。文本分析模块包括数据处理模块,该数据处理模块被配置为将多个输入数据集转换为分析文本集。文本分析模块还包括配置为确定分析文本集中的多个相关性的探索性分析模块。文本分析模块还包括:主题建模模块,被配置为识别在分析文本集中重复出现的多个主题;以及报告模块,被配置为生成用于文本分析模块的多个报告。文本挖掘系统还包括被配置为存储多个输入数据集,分析文本集和输出数据集的存储器电路。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号