Domain-Independent Automated Processing of Free-Form Text Data in Telecom

机译：在电信中独立于自动自动处理自由形式文本数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Free-form, unstructured and semi-structured textual data has become increasingly more prevalent in the telecommunications industry, with service and equipment providers alike. Some typical examples include textual data from customer care tickets, machine logs, alarm and alerting systems, and diagnostics. There is a growing business need to rapidly and automatically understand the underlying key topics and categories of this bulk collection of text. With the present mode of operation of relying on domain experts to analyze textual data, there is a clear need to apply text analytics to automate the process. Difficulties arise due to the jargon-filled and fragmented, incomplete nature of textual data in this field. In this paper, we propose a domain-agnostic, unsupervised approach that deploys a multi-stage text processing pipeline for automatically discovering the key topics and categories from free-form text documents. Using anonymized datasets retrieved from actual customer care tickets and system logs, we show that our approach outperforms traditional text mining approaches, and performs comparably to manual categorization tasks that were undertaken by domain experts with full system knowledge.

机译：自由形式，非结构化和半结构化文本数据在电信行业中越来越普遍，服务和设备提供商相似。一些典型的示例包括来自客户服务票证，机器日志，报警和警报系统以及诊断的文本数据。越来越多的业务需要迅速，自动理解该批量收集文本的基础关键主题和类别。通过依赖域专家对域专家进行分析文本数据的目前的操作模式，有明确需要将文本分析应用于自动化过程。由于行话填充和碎片，文本数据中的文本数据的不完全性质，出现困难。在本文中，我们提出了一个域名无神不可化的方法，部署了一个多级文本处理管道，用于自动发现自由窗体文本文档的关键主题和类别。使用从实际的客户服务票证和系统日志检索的匿名数据集，我们表明我们的方法优于传统的文本挖掘方法，并与手动分类任务相对，这些任务是由域专家进行全面的系统知识。

著录项

来源
《IEEE International Conference on Data Engineering》|2019年|721p|共9页
会议地点
作者
Rajarshi Bhowmik; Ahmet Akyamac;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类数据处理、数据处理系统;
关键词
Text processing; Pipelines; Telecommunications; Data mining; Feature extraction; Frequency measurement; Business;

机译：文本处理;管道;电信;数据挖掘;特征提取;频率测量;业务;

相似文献

外文文献
中文文献
专利

1. A domain-independent process for automatic ontology population from text [J] . Carla Faria, Ivo Serra, Rosario Girardi Science of Computer Programming . 2014,第pta1期

机译：来自文本的自动本体填充的域独立过程
2. Automating Stroke Data Extraction From Free-Text Radiology Reports Using Natural Language Processing: Instrument Validation Study [J] . Amy Y X Yu, Zhongyu A Liu, Chloe Pou-Prom, JMIR Medical Informatics . 2021,第5期

机译：自动语言处理自由文本放射学报告自动化冲程数据提取：仪器验证研究
3. Extracting statistical data from free-form text [J] . Hill L. Owen, Zein David A. Circuits and Devices Magazine, IEEE . 1986,第3期

机译：从自由格式文本中提取统计数据
4. Domain-Independent Automated Processing of Free-Form Text Data in Telecom [C] . Rajarshi Bhowmik, Ahmet Akyamac IEEE International Conference on Data Engineering . 2019

机译：电信中格式自由的文本数据的域独立自动处理
5. Automated generation of metadata for mining image and text data. [D] . Al-Shameri, Faleh Jassem. 2006

机译：自动生成用于挖掘图像和文本数据的元数据。
6. Natural Language Processing and Automatic SNOMED-Encoding of Free Text: An Analysis of Free Text Data from a Routine Electronic Patient Record Application with a Parsing Tool Using the German SNOMED II [O] . Joerg H. Hohnloser, Matthias Holzer, Martin R.G. Fischer, 1996

机译：自然语言处理和自由文本的自动SNOMED编码：使用德语SNOMED II的解析工具对例行电子病历应用中的自由文本数据进行分析
7. Methodology of data collection and processing for the creation of associative and metaphoric dictionary of the Russian language designed for automated text processing systems (AMD-ATPS) [O] . Nikolay V. Golovko 2018

机译：用于创建用于自动化文本处理系统的俄语和隐喻词典的数据收集和处理的方法（AMD-ATP）
8. Security Classification Using Automated Learning (SCALE): Optimizing Statistical Natural Language Processing Techniques to Assign Security Labels to Unstructured Text [R] . Brown, J. D., Charlebois, D. 2010

机译：使用自动学习的安全性分类（sCaLE）：优化统计自然语言处理技术，将安全标签分配给非结构化文本

Domain-Independent Automated Processing of Free-Form Text Data in Telecom

摘要

著录项

相似文献

相关主题

期刊订阅