首页> 外文会议>Recent advances in neural networks, fuzzy systems amp; evolutionary computing >Text Mining Documents in Electronic Data Interchange Environment
【24h】

Text Mining Documents in Electronic Data Interchange Environment

机译:电子数据交换环境中的文本挖掘文档

获取原文
获取原文并翻译 | 示例

摘要

The internet is a huge source of documents, containing a massive number of texts presented in multilingual languages on a wide range of topics. These texts are demonstrating in an electronic documents format hosted on the web. The documents exchanged using special forms in an Electronic Data Interchange (EDI) environment. Using web text mining approaches to mine documents in EDI environment could be new challenging guidelines in web text mining. Applying text-mining approaches to discover knowledge previously unknown patters retrieved from the web documents by using partitioned cluster analysis methods such as k- means methods using Euclidean distance measure algorithm for EDI text document datasets is unique area of research these days. Our experiments employ the standard K-means algorithm on EDI text documents dataset that most commonly used in electronic interchange. We also report some results using text mining clustering application solution called WEKA. This study will provide high quality services to any organization that is willing to use the system.
机译:互联网是文档的巨大来源,其中包含大量以多种语言显示的,涉及广泛主题的文本。这些文本以网络上托管的电子文档格式进行演示。在电子数据交换(EDI)环境中使用特殊形式交换的文档。使用Web文本挖掘方法在EDI环境中挖掘文档可能是Web文本挖掘中新的具有挑战性的指南。通过使用分区聚类分析方法(例如使用Euclidean距离测量算法的k-means方法)对EDI文本文档数据集应用文本挖掘方法来发现从Web文档中检索到的先前未知模式的知识,这是当今研究的独特领域。我们的实验在电子交换中最常用的EDI文本文档数据集中采用了标准的K-means算法。我们还使用称为WEKA的文本挖掘群集应用程序解决方案报告了一些结果。这项研究将为愿意使用该系统的任何组织提供高质量的服务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号