首页> 外文会议>IEEE International Conference on Data Mining Workshops >Using Unlabeled Data for US Supreme Court Case Classification
【24h】

Using Unlabeled Data for US Supreme Court Case Classification

机译:使用未标记数据的美国最高法院案例分类

获取原文

摘要

The Supreme Court Database provided by Washington University (in St. Louis) School of Law is an essential legal research tool. The Supreme Court Database is organized and categorized to Issue Areas to make it easy for legal researchers to find on-point cases for an area of law. This paper used a semi-supervised learning approach to automatically categorize the Supreme Court's opinions to Issue Areas. An inductive method of clustering then labeling approach was used by employing a nonmetric space of a fast Hierarchical Navigable Small World graph index containing USE (Universal Sentence Encoder) embeddings. After obtaining the labels from the semi-supervised approach, we evaluate several classification approaches to use with the data achieving the weighted average F1-Scores: SVM with Max Norm Features 0.75, RNN 0.78, and BERT 0.68
机译:华盛顿大学(在圣路易斯)提供的最高法院数据库是一个必不可少的法律研究工具。最高法院数据库组织并分类为发出领域,使法律研究人员容易找到一个法律领域的点案例。本文采用半监督学习方法自动将最高法院对发布领域的意见进行分类。通过使用包含使用(通用句子编码器)嵌入的快速分级导航的小型世界图索引的非格式空间来使用群集的归纳方法。从半监督方法获取标签后,我们评估了多种分类方法,以便与实现加权平均f1分数的数据一起使用:SVM具有MAX规范的特点0.75,RNN 0.78和BERT 0.68

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号