首页> 外文会议>Pacific Rim international conference on artificial intelligence >Automatically Classifying Chinese Judgment Documents Using Character-Level Convolutional Neural Networks
【24h】

Automatically Classifying Chinese Judgment Documents Using Character-Level Convolutional Neural Networks

机译:使用字符级卷积神经网络对中国裁判文书进行自动分类

获取原文

摘要

Judgment is a decision by a court or other tribunal that resolves a controversy and determines the rights and obligations of the parties. Since the establishment of the China Judgments Online System, more and more judgment documents have been stored online. With the explosive growth of the number of Chinese judgment documents, the need for automated classification methods is getting increasingly urgent. For Chinese data sets, traditional word-level methods often bring extra errors in word segmentation. In this paper, we proposed an approach based on character-level convolutional neural networks to automatically classify Chinese judgment documents. Different from traditional machine learning methods, we hand over the work of feature detection to the model. Throughout the process, the only part that requires human labor is labeling the category of each original documents. In order to prevent overfitting when the amount of training data is not very large, we use a shallow model which has only one convolution layer. The proposed approach does well in achieving high classification accuracy based on 7923 pieces of Chinese judgment documents. In the meanwhile, the effectiveness of our model is satisfactory.
机译:判决是法院或其他法庭解决争议并确定当事方权利和义务的决定。自中国裁判在线系统建立以来,越来越多的裁判文件被在线存储。随着中国判决书数量的爆炸性增长,对自动分类方法的需求日益迫切。对于中文数据集,传统的单词级方法经常在分词中带来额外的错误。在本文中,我们提出了一种基于字符级卷积神经网络的自动对中文判断文档进行分类的方法。与传统的机器学习方法不同,我们将特征检测的工作移交给了模型。在整个过程中,唯一需要人工的部分是标记每个原始文档的类别。为了防止训练数据量不是很大时的过度拟合,我们使用了只有一个卷积层的浅层模型。所提出的方法在基于7923份中文判断文件的基础上,可以很好地实现较高的分类精度。同时,我们模型的有效性令人满意。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号