首页> 外文会议>Pacific Asia Conference on Language, Information and Computation >Comparative Analyses of Textual Contents and Styles of Five Major Japanese Newspapers
【24h】

Comparative Analyses of Textual Contents and Styles of Five Major Japanese Newspapers

机译:五大日本报刊文本内容与风格的比较分析

获取原文

摘要

Newspapers remain an important media from which people obtain a wide variety of information. In Japan, there are five major newspapers, having their own opinions and ideologies. Although these are readily recognized, they are infrequently investigated from the viewpoint of their textual characteristics. This study analyzes these differences among the five newspaper editorials. We apply morphological analysis and count the frequency of morphemes within the text data. We then apply principal component analysis and random forests classification experiments to examine their similarities and differences. Throughout these statistical analyses, we use function words and content words as features, which enables us to determine which of the two characteristics -styles or content- more powerfully affects the classification types. This study contributes to text classification studies by deliberately comparing the classification performances provided by different feature sets, function words and content words. In addition, this study will provide an empirical basis for understanding the similarities and differences among the five newspapers.
机译:报纸仍然是人们获得各种信息的重要媒体。在日本,有五大报纸,拥有自己的意见和意识形态。虽然这些是容易认识到的,但从文本特征的观点出发时,它们很少调查。本研究分析了五家报纸编辑之间的这些差异。我们应用形态分析并计算文本数据中的语素频率。然后,我们将主成分分析和随机森林分类实验应用以检查其相似性和差异。在整个统计分析中,我们使用功能单词和内容单词作为特征,使我们能够确定两个特征 - 或内容中的哪一个 - 更有力地影响分类类型。本研究通过故意比较不同特征集,功能单词和内容词提供的分类性能来有助于文本分类研究。此外,本研究将为理解五个报纸的异同和差异提供实证基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号