首页> 外文会议>IEEE International Conference on Big Data >Explainable Text Classification in Legal Document Review A Case Study of Explainable Predictive Coding

【24h】

Explainable Text Classification in Legal Document Review A Case Study of Explainable Predictive Coding

机译：法律文件审查中的可解释文本分类-以可解释预测编码为例

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In today's legal environment, lawsuits and regulatory investigations require companies to embark upon increasingly intensive data-focused engagements to identify, collect and analyze large quantities of data. When documents are staged for review - where they are typically assessed for relevancy or privilege - the process can require companies to dedicate an extraordinary level of resources, both with respect to human resources, but also with respect to the use of technology-based techniques to intelligently sift through data. Companies regularly spend millions of dollars producing `responsive' electronically-stored documents for these types of matters. For several years, attorneys have been using a variety of tools to conduct this exercise, and most recently, they are accepting the use of machine learning techniques like text classification (referred to as predictive coding in the legal industry) to efficiently cull massive volumes of data to identify responsive documents for use in these matters. In recent years, a group of AI and Machine Learning researchers have been actively researching Explainable AI. In an explainable AI system, actions or decisions are human understandable. In typical legal `document review' scenarios, a document can be identified as responsive, as long as one or more of the text snippets (small passages of text) in a document are deemed responsive. In these scenarios, if predictive coding can be used to locate these responsive snippets, then attorneys could easily evaluate the model's document classification decision. When deployed with defined and explainable results, predictive coding can drastically enhance the overall quality and speed of the document review process by reducing the time it takes to review documents. Moreover, explainable predictive coding provides lawyers with greater confidence in the results of that supervised learning task. The authors of this paper propose the concept of explainable predictive coding and simple explainable predictive coding methods to locate responsive snippets within responsive documents. We also report our preliminary experimental results using the data from an actual legal matter that entailed this type of document review. The purpose of this paper is to demonstrate the feasibility of explainable predictive coding in the context of professional services in the legal space.

机译：在当今的法律环境中，诉讼和监管调查要求公司着手进行越来越密集的以数据为中心的活动，以识别，收集和分析大量数据。当文件准备进行审核时（通常会评估它们的相关性或特权），该流程可能会要求公司在人力资源以及使用基于技术的技术方面投入非凡的资源水平。智能筛选数据。公司通常会花费数百万美元来针对此类事件制作“响应式”电子存储文档。几年来，律师一直在使用各种工具来进行此练习，最近，他们接受了机器学习技术的使用，例如文本分类（在法律行业中称为预测编码），以有效地剔除大量数据以识别用于这些问题的响应文件。近年来，一群AI和机器学习研究人员一直在积极研究Explainable AI。在可解释的AI系统中，动作或决定是人类可以理解的。在典型的法律“文档审阅”方案中，只要文档中的一个或多个文本片段（少量文本段落）被认为是响应性的，就可以将文档标识为响应性的。在这些情况下，如果可以使用预测编码来定位这些响应片段，那么律师可以轻松评估模型的文档分类决策。当部署具有定义和可解释的结果时，预测编码可以通过减少审阅文档所花费的时间来极大地提高文档审阅过程的整体质量和速度。此外，可解释的预测编码使律师对监督学习任务的结果更有信心。本文的作者提出了可解释的预测编码和简单的可解释的预测编码方法的概念，以在响应文档中定位响应片段。我们还使用涉及此类文件审查的实际法律事务中的数据报告了初步的实验结果。本文的目的是证明在法律领域的专业服务范围内，可解释性预测编码的可行性。

著录项

来源
《IEEE International Conference on Big Data》|2018年|1905-1911|共7页
会议地点
作者
Rishi Chhatwal; Peter Gronvall; Nathaniel Huber-Fliflet; Robert Keeling; Jianping Zhang; Haozhen Zhao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Predictive coding; Law; Predictive models; Text categorization; Machine learning;

机译：预测编码;法律;预测模型;文本分类;机器学习;
入库时间 2022-08-26 14:45:25

相似文献

外文文献
中文文献
专利

1. Building legal indexes to explain recovery rates: An analysis of the French and English bankruptcy codes☆ [J] . Regis Blazy, Bertrand Chopard, Nirjhar Nigam Journal of banking & finance . 2013,第6期

机译：建立解释恢复率的法律指标：对法国和英国破产法的分析☆
2. Explaining Data-Driven Document Classifications [J] . David Martens, Foster Provost MIS quarterly . 2014,第1期

机译：解释数据驱动的文档分类
3. autoBOT: evolving neuro-symbolic representations for explainable low resource text classification [J] . Skrlj Blaz, Martinc Matej, Lavrac Nada, Machine Learning . 2021,第5期

机译：autobot：演化的神经符号表示可解释的低资源文本分类
4. Explainable Text Classification in Legal Document Review A Case Study of Explainable Predictive Coding [C] . Rishi Chhatwal, Peter Gronvall, Nathaniel Huber-Fliflet, IEEE International Conference on Big Data . 2018

机译：法律文献审查中的可解释文本分类审查的案例研究可解释的预测编码
5. Predictive Coding Techniques With Manual Review to Identify Privileged Documents in E-Discovery [D] . Vinjumur, Jyothi K. 2018

机译：具有人工审查功能的预测编码技术，以识别电子发现中的特权文档
6. Predictive Coding Explains Auditory and Tactile Influences on Vision during Binocular Rivalry [O] . Bradley N. Jack, Graeme Hacker 2014

机译：预测性编码解释了双眼竞争中听觉和触觉对视觉的影响
7. Empirical Study of Deep Learning for Text Classification in Legal Document Review [O] . Fusheng Wei, Han Qin, Shi Ye, 2018

机译：法律文献综述中文本分类深度学习的实证研究

Explainable Text Classification in Legal Document Review A Case Study of Explainable Predictive Coding

摘要

著录项

相似文献

相关主题

期刊订阅