首页> 外文会议>International Conference on Information Science and Technology >An Empirical Study of Linear Dimensionality Reduction for Judicial Predictive Models
【24h】

An Empirical Study of Linear Dimensionality Reduction for Judicial Predictive Models

机译:司法预测模型线性降维的实证研究

获取原文

摘要

Judicial cases can be modeled with the textual frequency vectors under the Bag-of-Words assumption to predict the decision outcome. However, such models are often with much more numbers of features than training samples, which usually leads to the over fitting problem. In this paper, we conduct an empirical investigation on linear dimensionality reduction of the high-dimensional judicial predictive models via the wide spread principal component analysis approach. The experimental results show that these high-dimensional models do not suffer from the overfitting problem, but the under fitting problem. Moreover, the higher-order dependency in the textual frequency data cannot be decorrelated by the linear dimensionality reduction approach, which restrains the performance of judicial classification models subject to the unchanged level of signal-noise ratio in the derived low-dimensional features.
机译:可以在“词袋”假设下使用文本频率向量对司法案件进行建模,以预测决策结果。但是,此类模型通常比训练样本具有更多的特征,这通常会导致过度拟合的问题。在本文中,我们通过广泛使用的主成分分析方法对高维司法预测模型的线性降维进行了实证研究。实验结果表明,这些高维模型不存在过度拟合问题,而是存在拟合不足的问题。此外,文本频率数据中的高阶依存关系不能通过线性降维方法去相关,这限制了司法分类模型的性能,因为在导出的低维特征中信噪比保持不变。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号