首页> 外文会议>International Conference on Internet of Things: Smart Innovation and Usages >Extraction and Analysis of Information in News Domain Using Semantic Web
【24h】

Extraction and Analysis of Information in News Domain Using Semantic Web

机译:使用语义网络提取和分析新闻域中的信息

获取原文

摘要

News-papers, blogs, and web-pages are a rich and diverse source of textual information. However, the information contained in these sources cannot be manually extracted, recorded, and indexed, mainly because it comes in a massive size. Moreover, the extraction of some information sometimes requires specific knowledge or technical background. This is the case in the news domain where we need to extract the relevant news from a lot of available information. In order to scale knowledge extraction to the large size of available textual information, and build extractors specific to a certain field various techniques are applied over the unstructured data so that it can be made available to the users. This could help the researchers and the news readers or users to find relevant information in less time and with great ease. This study aims to review all the approaches and techniques done so far, for the information retrieval, search ability and its analysis and it also proposed an idea for better searching that reduces the time complexity to extract the data and also reduces human intervention. This is a better idea to put forward which also helps in filtering of irrelevant data and thus integrates only the relevant data to create a better space for the news data.
机译:新闻报道,博客和网页是一种丰富而多样化的文本信息。但是,这些来源中包含的信息不能手动提取,记录和索引,主要是因为它具有大小。此外,一些信息的提取有时需要具体的知识或技术背景。这是新闻领域的情况,我们需要从大量可用信息中提取相关新闻。为了使知识提取缩放到大尺寸的可用文本信息,并且在非结构化数据上应用特定于某个字段的构建提取器,以便可以向用户提供。这可以帮助研究人员和新闻读者或用户在更短的时间内找到相关信息,并且很放松。本研究旨在审查到目前为止所做的所有方法和技术,用于了解信息检索,搜索能力及其分析,并提出了一个更好地搜索的想法,这减少了提取数据的时间复杂性并降低了人为干预。这是一个更好的想法,提出它还有助于过滤无关数据,因此仅集成相关数据以为新闻数据创建更好的空间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号