SpeechToText: An open-source software for automatic detection and transcription of voice recordings in digital forensics

Negra Miguel; Domingues Patricio

首页> 外文期刊>Digital investigation >SpeechToText: An open-source software for automatic detection and transcription of voice recordings in digital forensics

【24h】

SpeechToText: An open-source software for automatic detection and transcription of voice recordings in digital forensics

机译：SpeemTotext：用于自动检测和转录数字取证中的录音的开源软件

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Voice is the most natural way for humans to communicate with each other, and more recently, to interact with voice controlled digital machines. Although text is predominant in digital platforms, voice and video are becoming increasingly important, with communication applications supporting voice messages and videos. This is relevant for digital forensic examinations, as content held in voice format can hold relevant evidence for the investigation. In this paper, we present the open source SpeechToText software, which resorts to state-of-the art Voice Activity Detection (VAD) and Automatic Speech Recognition (ASR) modules to detect voice content, and then to transcribe it to text. This allows integrating voice content into the regular flow of a digital forensic investigation, with transcribed audio indexed by text search engines. Although SpeechToText can be run independently, it also provides a Jython-based software module for the well-known Autopsy software. The paper also analyzes the availability, storage location and audio format of voice-recorded content in 14 popular Android applications featuring voice recordings. SpeechToText achieves 100% accuracy for detecting voice in unencrypted audio/video files, a word error rate (WER) of 27.2% when transcribing English voice messages by non-native speakers and a WER of 7.80% for the test-clean set of LibriSpeech. It achieves a real time factor of 0.15 for the detection and transcription process in a medium-range laptop, meaning that 1 min of speech is processed in roughly 9 s. (c) 2021 Elsevier Ltd. All rights reserved.

机译：语音是人类互相沟通的最自然的方式，最近，与语音控制的数字机器相互作用。虽然在数字平台中主要是卓越的数字平台，但语音和视频变得越来越重要，并且通信应用支持语音消息和视频。这与数字法医检查有关，因为以语音格式持有的内容可以保持调查的相关证据。在本文中，我们介绍了开源演示文本软件，该软件令艺术艺术语音活动检测（VAD）和自动语音识别（ASR）模块来检测语音内容，然后将其转录为文本。这允许将语音内容集成到数字法医调查的常规流程中，通过文本搜索引擎索引转录的音频。虽然SpeicTotext可以独立运行，但它还为众所周知的尸检软件提供了基于Jython的软件模块。本文还分析了14个流行的Android应用程序中的可用性，存储位置和音频格式，其中包含了语音录制。 SpeemTotext在未加密音频/视频文件中检测语音的100％精度，当非母语扬声器和WER为3.80％的LibrisPeech套装的英语语音消息时，单词错误率（WER）为27.2％。它在中档笔记本电脑中达到0.15的实时因子0.15，这意味着在大约9秒内处理1分钟的语音。（c）2021 elestvier有限公司保留所有权利。

著录项

来源
《Digital investigation》 |2021年第1期|301223.1-301223.10|共10页
作者
Negra Miguel; Domingues Patricio;
展开▼
作者单位

Polytech Inst Leiria Sch Technol & Management Leiria Portugal|Comp Sci & Commun Res Ctr Leiria Portugal;

Polytech Inst Leiria Sch Technol & Management Leiria Portugal|Comp Sci & Commun Res Ctr Leiria Portugal|Inst Telecomunicacoes Aveiro Portugal;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Voice recordings; Automatic speech recognition; Automatic speech transcription; Digital forensics; Android applications;

机译：录音;自动语音识别;自动语音转录;数字取证;Android应用程序;

相似文献

外文文献
中文文献
专利

1. Novice Language Teachers' Selection Criteria and Uses for Digital Voice Recording Software [J] . Pete Swanson International journal of open source software & processes . 2014,第1期

机译：数字语音记录软件的新手语言教师选择标准和用途
2. The impact in forensic voice comparison of lack of calibration and of mismatched conditions between the known-speaker recording and the relevant-population sample recordings [J] . Morrison Geoffrey Stewart Forensic science international . 2018,第期

机译：已知扬声器记录和相关人口样本录制之间缺乏校准和错配条件的缺乏校准和错配的影响
3. Digital dictation and voice transcription software enhances outpatient clinic letter production a crossover study [J] . Kinesh Patel, Marcus Harbord Frontline gastroenterology . 2012,第3期

机译：数字听写和语音转录软件增强了跨部门研究的门诊信
4. VOICE SAMPLES RECORDING AND SPEECH QUALITY ASSESSMENT FOR FORENSIC AND AUTOMATIC SPEAKER IDENTIFICATION [C] . Andrey Barinov Audio Engineering Society convention . 2011

机译：语音样本记录和语音质量评估，用于法医和自动扬声器识别
5. Automatic Detection and Classification of Toothed Whale Echolocations Clicks in Diverse Long Term Recordings [D] . Lindeneau, Scott M. 2018

机译：长期记录中的齿鲸回声点击的自动检测和分类
6. SpotMetrics: An Open-Source Image-Analysis Software Plugin for Automatic Chromatophore Detection and Measurement [O] . Stavros P. Hadjisolomou, George El-Haddad 2017

机译：SpotMetrics：用于自动检测和测量染色体的开源图像分析软件插件
7. The Voice Transcription Technique: Use of Voice Recognition Software to Transcribe Digital Interview Data in Qualitative Research [O] . Matheson Jennifer L. 2007

机译：语音转录技术：使用语音识别软件在定性研究中转录数字面试数据
8. Digital Voice Recording: An Efficient Alternative for Data Collection [R] . Rumble, M. A., Juntti, T. M., Bonnot, T. W., 2009

机译：数字录音：数据收集的有效替代方案

SpeechToText: An open-source software for automatic detection and transcription of voice recordings in digital forensics

摘要

著录项

相似文献

相关主题

期刊订阅