首页> 外文会议>International Symposium on Symbolic and Numeric Algorithms for Scientific Computing >What's Been Happening in the Romanian News Landscape? A Detailed Analysis Grounded in Natural Language Processing Techniques
【24h】

What's Been Happening in the Romanian News Landscape? A Detailed Analysis Grounded in Natural Language Processing Techniques

机译:罗马尼亚新闻景观发生了什么?在自然语言处理技术中接地的详细分析

获取原文

摘要

People strive to be connected to events happening worldwide in terms of politics, technology, sports, business, and many other domains. The main source of news today resides in online publications which can strongly influence the public opinion. Our purpose is to build a comprehensive automated pipeline, integrating various Natural Language Processing techniques, to process online news written in the Romanian language. Our dataset consists of 631,565 news articles from various Romanian publications between May 2004 and December 2019 which are used to detect semantic similarities between articles and rank various publications in terms of their influence. Furthermore, we created visualizations to ease the understanding of results and ensure efficient text retrieval over the gathered articles. In the future, we plan to apply opinion mining, geographical names extraction and content quality assessments relating, for example, to the likelihood of being a fake news.
机译:人们在政治,技术,体育,商业和许多其他领域方面努力与全球发生的事件。今天的主要新闻来源纳入在线出版物,这会强烈影响舆论。我们的目的是建立一个全面的自动化管道,整合各种自然语言处理技术,处理用罗马尼亚语编写的在线新闻。我们的数据集由2004年5月至2019年5月之间的各种罗马尼亚出版物的631,565名新闻文章组成,用于检测文章之间的语义相似之处,并在其影响方面排名各种出版物。此外,我们创建了可视化以简化对结果的理解,并确保通过收集的文章进行有效的文本检索。未来,我们计划申请意见采矿,地理名称提取和内容质量评估,例如,有可能是假新闻的可能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号