【24h】

Legal Area Classification: A Comparative Study of Text Classifiers on Singapore Supreme Court Judgments

机译:法律领域分类:新加坡最高法院判决文本分类器的比较研究

获取原文

摘要

This paper conducts a comparative study on the performance of various machine learning ('ML') approaches for classifying judgments into legal areas. Using a novel dataset of 6,227 Singapore Supreme Court judgments, we investigate how state-of-the-art NLP methods compare against traditional statistical models when applied to a legal corpus that comprised few but lengthy documents. All approaches tested, including topic model, word embedding, and language model-based classifiers, performed well with as little as a few hundred judgments. However, more work needs to be done to optimize state-of-the-art methods for the legal domain.
机译:本文对各种将判决分类为法律领域的机器学习('ML')方法的性能进行了比较研究。我们使用一个包含6,227个新加坡最高法院判决的新颖数据集,研究了将最先进的NLP方法与传统统计模型相比较时,该方法适用于包含很少但冗长文档的法律语料库。经过测试的所有方法,包括主题模型,单词嵌入和基于语言模型的分类器,都可以在仅进行几百次判断的情况下就表现良好。但是,需要做更多的工作来优化法律领域的最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号