...
首页> 外文期刊>ACM Transactions on Information Systems >Jointly Minimizing the Expected Costs of Review for Responsiveness and Privilege in E-Discovery
【24h】

Jointly Minimizing the Expected Costs of Review for Responsiveness and Privilege in E-Discovery

机译:共同最小化电子发现中的响应性和特权的预期审查成本

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Discovery is an important aspect of the civil litigation process in the United States of America, in which all parties to a lawsuit are permitted to request relevant evidence from other parties. With the rapid growth of digital content, the emerging need for "e-discovery" has created a strong demand for techniques that can be used to review massive collections both for "responsiveness" (i.e., relevance) to the request and for "privilege" (i.e., presence of legally protected content that the party performing the review may have a right to withhold). In this process, the party performing the review may incur costs of two types, namely, annotation costs (deriving from the fact that human reviewers need to be paid for their work) and misclassification costs (deriving from the fact that failing to correctly determine the responsiveness or privilege of a document may adversely affect the interests of the parties in various ways). Relying exclusively on automatic classification would minimize annotation costs but could result in substantial misclassification costs, while relying exclusively on manual classification could generate the opposite consequences. This article proposes a risk minimization framework (called MINECORE, for "minimizing the expected costs of review") that seeks to strike an optimal balance between these two extreme stands. In MINECORE (a) the documents are first automatically classified for both responsiveness and privilege, and then (b) some of the automatically classified documents are annotated by human reviewers for responsiveness (typically by junior reviewers) and/or, in cascade, for privilege (typically by senior reviewers), with the overall goal of minimizing the expected cost (i.e., the risk) of the entire process. Risk minimization is achieved by optimizing, for both responsiveness and privilege, the choice of which documents to manually review. We present a simulation study in which classes from a standard text classification test collection (RCV1-v2) are used as surrogates for responsiveness and privilege. The results indicate that MINECORE can yield substantially lower total cost than any of a set of strong baselines.
机译:发现是美利坚合众国民事诉讼程序的重要方面,在该程序中,诉讼的所有当事方均可以要求其他当事方提供相关证据。随着数字内容的快速增长,对“电子发现”的新需求已引起了对可用于审查大量馆藏的“对请求”的“响应性”(即相关性)的技术的强烈需求。的“特权”(即,存在进行审查的当事方有权保留的受法律保护的内容)。在此过程中,进行审核的一方可能会产生两种类型的费用,即注释费用(由于需要为工作人员支付酬劳的事实)和分类错误的费用(由于未能正确确定审核费用的事实)文件的响应性或特权可能会以各种方式对当事方的利益产生不利影响)。完全依靠自动分类将最大程度地减少注释成本,但可能导致大量误分类成本,而仅依靠手动分类可能会产生相反的结果。本文提出了一个风险最小化框架(称为MINECORE,用于“最小化预期的审查成本”),力求在这两个极端立场之间取得最佳平衡。在MINECORE中(a)首先自动对文档进行响应性和特权分类,然后(b)由人工审阅者对某些自动分类的文档进行响应性注释(通常由初级审阅者注释)和/或级联地对特权进行注释(通常由高级审核人员负责),其总体目标是将整个过程的预期成本(即风险)降至最低。通过针对响应性和特权优化对哪些文档进行手动审核的选择来实现风险最小化。我们提供了一个模拟研究,其中标准文本分类测试集合(RCV1-v2)中的类用作响应和特权的替代。结果表明,MINECORE可以产生的总成本大大低于一组强有力的基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号