Using Unlabeled Data for US Supreme Court Case Classification

机译：使用未标记数据的美国最高法院案例分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Supreme Court Database provided by Washington University (in St. Louis) School of Law is an essential legal research tool. The Supreme Court Database is organized and categorized to Issue Areas to make it easy for legal researchers to find on-point cases for an area of law. This paper used a semi-supervised learning approach to automatically categorize the Supreme Court's opinions to Issue Areas. An inductive method of clustering then labeling approach was used by employing a nonmetric space of a fast Hierarchical Navigable Small World graph index containing USE (Universal Sentence Encoder) embeddings. After obtaining the labels from the semi-supervised approach, we evaluate several classification approaches to use with the data achieving the weighted average F1-Scores: SVM with Max Norm Features 0.75, RNN 0.78, and BERT 0.68

机译：华盛顿大学（在圣路易斯）提供的最高法院数据库是一个必不可少的法律研究工具。最高法院数据库组织并分类为发出领域，使法律研究人员容易找到一个法律领域的点案例。本文采用半监督学习方法自动将最高法院对发布领域的意见进行分类。通过使用包含使用（通用句子编码器）嵌入的快速分级导航的小型世界图索引的非格式空间来使用群集的归纳方法。从半监督方法获取标签后，我们评估了多种分类方法，以便与实现加权平均f1分数的数据一起使用：SVM具有MAX规范的特点0.75，RNN 0.78和BERT 0.68

著录项

来源
《IEEE International Conference on Data Mining Workshops》|2020年|799-804|共6页
会议地点
作者
George Sanchez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Law; Databases; Semisupervised learning; Data models; Task analysis; Testing;

机译：培训;法律;数据库;半草学习;数据模型;任务分析;测试;

相似文献

外文文献
中文文献
专利

1. Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data [J] . Tomoya Sakai, Marthinus Christoffel Plessis, Gang Niu, JMLR: Workshop and Conference Proceedings . 2017,第4期

机译：基于来自正数据和未标记数据的分类的半监督分类
2. Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data [J] . Tomoya SAKAI, Marthinus CHRISTOFFEL DU PLESSIS, Gang NIU, 電子情報通信学会技術研究報告. 情報論的学習理論と機械学習 . 2016,第300期

机译：基于来自正数据和未标记数据的分类的半监督分类
3. Employing unlabeled data to improve the classification performance of SVM, and its application in audio event classification [J] . Leng Yan, Sun Chengli, Xu Xinyan, Knowledge-Based Systems . 2016,第Apra15期

机译：利用未标记数据提高支持向量机的分类性能及其在音频事件分类中的应用
4. Legal Area Classification: A Comparative Study of Text Classifiers on Singapore Supreme Court Judgments [C] . Jerrold Soh Tsin Howe, Lim How Khang, Ian Ernst Chai Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies;Natural legal language processing workshop . 2019

机译：法律领域分类：新加坡最高法院判决文本分类器的比较研究
5. Using unlabeled data to improve text classification. [D] . Nigam, Kanal Paul. 2001

机译：使用未标记的数据来改善文本分类。
6. Fetal rights: Supreme Court tosses ball back in Parliaments court. [O] . E H Kluge 1991

机译：胎儿权利：最高法院将球扔回议会法院。
7. Case Law of the European Court of Human Rights and the Supreme Court of Estonia in Disclosing Personal Data in Court Judgments [O] . Hansen Tuuli 2015

机译：欧洲人权法院和爱沙尼亚最高法院的判例法在法院判决中披露个人数据

Using Unlabeled Data for US Supreme Court Case Classification

摘要

著录项

相似文献

相关主题

期刊订阅