...
首页> 外文期刊>Complex & Intelligent Systems >An innovative multi-segment strategy for the classification of legal judgments using the k-nearest neighbour classifier
【24h】

An innovative multi-segment strategy for the classification of legal judgments using the k-nearest neighbour classifier

机译:使用k最近邻分类器对法律判决进行分类的创新多段策略

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Abstract The classification of legal documents has been receiving considerate attention over the last few years. This is mainly because of the over-increasing amount of legal information that is being produced on a daily basis in the courts of law. In the Republic of Mauritius alone, a total of 141,164 cases were lodged in the different courts in the year 2015. The Judiciary of Mauritius is becoming more efficient due to a number of measures which were implemented and the number of cases disposed of in each year has also risen significantly; however, this is still not enough to catch up with the increase in the number of new cases that are lodged. In this paper, we used the k-nearest neighbour machine learning classifier in a novel way. Unlike news article, judgments are complex documents which usually span several pages and contains a variety of information about a case. Our approach consists of splitting the documents into equal-sized segments. Each segment is then classified independently of the others. The selection of the predicted category is then done through a plurality voting procedure. Using this novel approach, we have been able to classify law cases with an accuracy of over 83.5%, which is 10.5% higher than when using the whole documents dataset. To the best of our knowledge, this type of process has never been used earlier to categorise legal judgments or other types of documents. In this work, we also propose a new measure called confusability to measure the degree of scatteredness in a confusion matrix.
机译:摘要近年来,法律文件的分类一直受到重视。这主要是由于每天在法院产生的法律信息过多。仅在毛里求斯共和国,2015年就在不同法院受理了141,164起案件。由于采取了许多措施,每年司法机关的效率也有所提高。也显着上升;然而,这仍然不足以跟上新案件数量的增加。在本文中,我们以新颖的方式使用了k最近邻机器学习分类器。与新闻不同,判决是复杂的文档,通常跨越几页,并且包含有关案件的各种信息。我们的方法包括将文档分成相等大小的段。然后,将每个段独立于其他段进行分类。然后,通过多个投票程序来完成对预测类别的选择。使用这种新颖的方法,我们已经能够以超过83.5%的准确性对法律案件进行分类,这比使用整个文档数据集的准确性高10.5%。据我们所知,这种类型的过程从未被用来对法律判决或其他类型的文件进行分类。在这项工作中,我们还提出了一种称为可混淆性的新度量,用于度量混淆矩阵中的分散程度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号