首页> 外文会议>IEEE Congress on Services >Automated Data Augmentation Services Using Text Mining, Data Cleansing and Web Crawling Techniques
【24h】

Automated Data Augmentation Services Using Text Mining, Data Cleansing and Web Crawling Techniques

机译:使用文本挖掘,数据清洁和网络爬网技术自动化数据增强服务

获取原文

摘要

There is a large amount of information about celebrities spread all over the web hidden inside innumerable news and blogs, pictures on Flickr or videos on YouTube. Having this information combined and aggregated would be of great benefit to many customers. In this document we will describe the architecture and the (business) value of a system that not only collates information pre-formatted by other web services but also provides a self-developed named entity recognition algorithm for extracting the names of celebrities from different data sources and then processes and enriches them by our mash-up application.
机译:有些关于名人在无数新闻和博客中的网络传播的大量信息,在Flickr或YouTube上的视频上的图片。将这些信息合并和汇总对许多客户有很大的好处。在本文档中,我们将描述系统的架构和(业务)值,该系统不仅会通过其他Web服务预先格式化的信息,还提供了一种自开发的命名实体识别算法,用于从不同的数据源中提取名人的名称然后通过我们的混搭应用来处理并丰富它们。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号