首页> 外文会议>Indian International Conference on Artificial Intelligence >Standardization of Unstructured Textual Data into Semantic Web Format
【24h】

Standardization of Unstructured Textual Data into Semantic Web Format

机译:非结构化文本数据的标准化为语义Web格式

获取原文

摘要

Analysis done on the nature of the data posted on the WWW (World Wide Web) reveal that more than 80% of the data over the WWW is in unstructured text format. Hence extracting information from text is of paramount importance both for academic and business purposes. Simultaneously, evolution of web technology led to the novel concept of Semantic Web, which is an extension of the current web in which information is given well-defined meaning, enabling computers and people to work in cooperation in a better way. Integration of voluminous, legacy text data that are unstructured and semi-structured, into Semantic Web format is a challenging and daunting task for the research community. This paper is an attempt to marry the concept of Semantic Web format with unstructured text, thus to enable the computers to discover the previously unknown information, by automatically extracting information from different written resources.
机译:对WWW(万维网)发布的数据的性质进行了分析,揭示了WWW超过80%的数据是非结构化的文本格式。因此,从文本中提取信息对于学术和商业目的来说是至关重要的。同时,Web技术的演变导致了语义Web的新颖概念,这是当前网络的扩展,其中信息被提供了明确定义的意义,使计算机和人们能够以更好的方式合作地工作。积分庞大的,遗留文本数据,是一个非结构化和半结构化的,进入语义Web格式是研究社区的具有挑战性和艰巨的任务。本文是一种尝试用非结构化文本嫁给语义Web格式的概念,从而使计算机能够通过自动从不同书面资源中提取信息来发现先前未知的信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号