首页> 外文会议>Australasian Language Technology Association workshop >An Effect of Background Population Sample Size on the Performance of a Likelihood Ratio-based Forensic Text Comparison System:A Monte Carlo Simulation with Gaussian Mixture Model
【24h】

An Effect of Background Population Sample Size on the Performance of a Likelihood Ratio-based Forensic Text Comparison System:A Monte Carlo Simulation with Gaussian Mixture Model

机译:背景人群样本量对基于似然比的法医文本比较系统性能的影响:基于高斯混合模型的蒙特卡罗模拟

获取原文

摘要

This is a Monte Carlo simulation-based study that explores the effect of the sample size of the background database on a likelihood ratio (LR)-based forensic text comparison (FTC) system built on multivariate authorship attribution features. The text messages written by 240 authors who were randomly selected from an archive of chatlog messages were used in this study. The strength of evidence (= LR) was estimated using the multivariate kernel density likelihood ratio (MVKD) formula with a logistic-regression calibration. The results are reported along two points: the system performance (= accuracy) and the stability of performance based on the standard metric for LR-based systems; namely the log-likelihood-ratio cost (C_(ur)). It was found in this study that the system performance and its stability improve as a function of the sample size (= author count) in the background database in a non-linear manner, and that the more features used for modelling, the more background data the system generally requires for optimal results. The implications of the findings to the real casework are also discussed.
机译:这是一项基于蒙特卡洛模拟的研究,探讨了背景数据库的样本大小对基于多元作者权归属特征构建的基于似然比(LR)的法医文本比较(FTC)系统的影响。这项研究使用了240位作者的文本消息,这些消息是从聊天日志消息的存档中随机选择的。使用多元核密度似然比(MVKD)公式和对数回归校准来估算证据强度(= LR)。报告的结果有两点:系统性能(=精度)和基于基于LR的系统的标准度量的性能稳定性;即对数似然比成本(C_(ur))。在这项研究中发现,系统性能及其稳定性以非线性方式随背景数据库中样本数量(=作者数)的变化而提高,并且用于建模的功能越多,背景数据就越多该系统通常需要最佳结果。还讨论了调查结果对实际案例的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号