...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Likelihood ratio estimation in forensic identification using similarity and rarity
【24h】

Likelihood ratio estimation in forensic identification using similarity and rarity

机译:基于相似度和稀有度的法医鉴定中的似然比估计

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Forensic identification is the task of determining whether or not observed evidence arose from a known source. It is useful to associate probabilities with identification/exclusion opinions, either for presentation in court or to evaluate the discriminative power of a given set of attributes. At present, in most forensic domains outside of DNA evidence, it is not possible to make such a statement since the necessary probability distributions cannot be computed with reasonable accuracy, although the probabilistic approach itself is well-understood. In principle, it involves determining a likelihood ratio (LR) - the ratio of the joint probability of the evidence and source under the identification hypothesis (that the evidence came from the source) and under the exclusion hypothesis (that the evidence did not arise from the source). Evaluating the joint probability is computationally intractable when the number of variables is even moderately large. It is also statistically infeasible since the number of parameters to be determined from the data is exponential with the number of variables. An approximate method is to replace the joint probability by another probability: that of distance (or similarity) between evidence and object under the two hypotheses. While this reduces to linear complexity with the number of variables, it is an oversimplification leading to errors. We consider a third method which decomposes the LR into a product of two factors, one based on distance and the other on rarity. This result, which is exact for the univariate Gaussian case, has an intuitive appeal - forensic examiners assign higher importance to rare feature values in the evidence and low importance to common feature values. We generalize this approach to more complex data such as vectors and graphs, which makes LR estimation computationally tractable. Empirical evaluations of the three methods, done with several data types (continuous features, binary features, multinomial and graph) and several modalities (handwriting with binary features, handwriting with multinomial features and footwear impressions with continuous features), show that the distance and rarity method is significantly better than the distance only method.
机译:法医鉴定是确定观察到的证据是否来自已知来源的任务。将概率与识别/排除意见相关联,这对于在法庭上陈述或评估给定属性集的判别力很有用。目前,在DNA证据之外的大多数法证领域,由于概率方法本身已被很好地理解,因此无法以合理的准确性计算必要的概率分布,因此无法做出这样的陈述。原则上,它涉及确定似然比(LR)-识别假设(证据来自来源)和排除假设(证据不来自证据)下证据与来源的联合概率之比。来源)。当变量的数量适中时,评估联合概率在计算上是棘手的。这也是统计上不可行的,因为根据数据确定的参数数量与变量数量成指数关系。一种近似方法是用另一种概率代替联合概率:在两个假设下,证据与物体之间的距离(或相似性)。尽管这减少了变量数量的线性复杂度,但过分简化会导致错误。我们考虑第三种方法,它将LR分解为两个因素的乘积,一个基于距离,另一个基于稀有度。这个结果对于单变量高斯案例是准确的,具有直观的吸引力-法医检查员将证据中的稀有特征值分配为更高的重要性,而将共同特征值分配为较低的重要性。我们将这种方法推广到更复杂的数据,例如矢量和图形,这使得LR估计在计算上易于处理。对这三种方法的实证评估是对几种数据类型(连续特征,二进制特征,多项式和图形)和几种模态(具有二进制特征的手写,具有多项特征的手写以及具有连续特征的鞋模印象)进行的,表明距离和稀有性该方法明显优于仅距离方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号