首页> 外文会议>IEEE International Conference on Big Data >Explainable Text Classification in Legal Document Review A Case Study of Explainable Predictive Coding
【24h】

Explainable Text Classification in Legal Document Review A Case Study of Explainable Predictive Coding

机译:法律文件审查中的可解释文本分类-以可解释预测编码为例

获取原文

摘要

In today's legal environment, lawsuits and regulatory investigations require companies to embark upon increasingly intensive data-focused engagements to identify, collect and analyze large quantities of data. When documents are staged for review - where they are typically assessed for relevancy or privilege - the process can require companies to dedicate an extraordinary level of resources, both with respect to human resources, but also with respect to the use of technology-based techniques to intelligently sift through data. Companies regularly spend millions of dollars producing `responsive' electronically-stored documents for these types of matters. For several years, attorneys have been using a variety of tools to conduct this exercise, and most recently, they are accepting the use of machine learning techniques like text classification (referred to as predictive coding in the legal industry) to efficiently cull massive volumes of data to identify responsive documents for use in these matters. In recent years, a group of AI and Machine Learning researchers have been actively researching Explainable AI. In an explainable AI system, actions or decisions are human understandable. In typical legal `document review' scenarios, a document can be identified as responsive, as long as one or more of the text snippets (small passages of text) in a document are deemed responsive. In these scenarios, if predictive coding can be used to locate these responsive snippets, then attorneys could easily evaluate the model's document classification decision. When deployed with defined and explainable results, predictive coding can drastically enhance the overall quality and speed of the document review process by reducing the time it takes to review documents. Moreover, explainable predictive coding provides lawyers with greater confidence in the results of that supervised learning task. The authors of this paper propose the concept of explainable predictive coding and simple explainable predictive coding methods to locate responsive snippets within responsive documents. We also report our preliminary experimental results using the data from an actual legal matter that entailed this type of document review. The purpose of this paper is to demonstrate the feasibility of explainable predictive coding in the context of professional services in the legal space.
机译:在当今的法律环境中,诉讼和监管调查要求公司着手进行越来越密集的以数据为中心的活动,以识别,收集和分析大量数据。当文件准备进行审核时(通常会评估它们的相关性或特权),该流程可能会要求公司在人力资源以及使用基于技术的技术方面投入非凡的资源水平。智能筛选数据。公司通常会花费数百万美元来针对此类事件制作“响应式”电子存储文档。几年来,律师一直在使用各种工具来进行此练习,最近,他们接受了机器学习技术的使用,例如文本分类(在法律行业中称为预测编码),以有效地剔除大量数据以识别用于这些问题的响应文件。近年来,一群AI和机器学习研究人员一直在积极研究Explainable AI。在可解释的AI系统中,动作或决定是人类可以理解的。在典型的法律“文档审阅”方案中,只要文档中的一个或多个文本片段(少量文本段落)被认为是响应性的,就可以将文档标识为响应性的。在这些情况下,如果可以使用预测编码来定位这些响应片段,那么律师可以轻松评估模型的文档分类决策。当部署具有定义和可解释的结果时,预测编码可以通过减少审阅文档所花费的时间来极大地提高文档审阅过程的整体质量和速度。此外,可解释的预测编码使律师对监督学习任务的结果更有信心。本文的作者提出了可解释的预测编码和简单的可解释的预测编码方法的概念,以在响应文档中定位响应片段。我们还使用涉及此类文件审查的实际法律事务中的数据报告了初步的实验结果。本文的目的是证明在法律领域的专业服务范围内,可解释性预测编码的可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号