Natural Language Processing Techniques for Document Classification in IT Benchmarking Automated Identification of Domain Specific Terms

机译：自然语言处理文档分类的技术在其基准测试自动识别域特定术语

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the domain of IT benchmarking collected data are often stored in natural language text and therefore intrinsically unstructured. To ease data analysis and data evaluations across different types of IT benchmarking approaches a semantic representation of this information is crucial. Thus, the identification of conceptual (se-mantical) similarities is the first step in the development of an integrative data management in this domain. As an ontology is a specification of such a conceptualization an association of terms, relations between terms and related instances must be developed. Building on previous research we present an approach for an automated term extraction by the use of natural language processing (NLP) techniques. Terms are automatically extracted out of existing IT benchmarking documents leading to a domain specific dictionary. These extracted terms are representative for each document and describe the purpose and content of each file and server as a basis for the ontology development process in the domain of IT benchmarking.

机译：在它的域名中，基准测试收集的数据通常存储在自然语言文本中，因此本质上是非结构化的。为了简化不同类型的数据分析和数据评估，基准测试方法此信息的语义表示至关重要。因此，识别概念（SE-LANTICAL）相似性是在该域中开发集成数据管理的第一步。作为本体的规范是这种概念化的规范，必须开发术语和相关实例之间的关系。在以前的研究中构建我们通过使用自然语言处理（NLP）技术来提取自动化术语提取方法。术语被自动从现有的IT基准测试文档中提取，导致域特定词典。这些提取的术语是每个文档的代表性，并描述每个文件和服务器的目的和内容作为IT基准测试域中的本体开发过程的基础。

著录项

来源
《International Conference on Enterprise Information Systems》|2015年||共7页
会议地点
作者
Matthias Pfaff; Helmut Krcmar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G20-53;
关键词
IT Benchmarking; Natural Language Processing; Heterogeneous Data; Semantic Data Integration; Ontologies;

机译：它基准测试;自然语言处理;异构数据;语义数据集成;本体;

相似文献

外文文献
中文文献
专利

1. Automated Requirements Identification from Construction Contract Documents Using Natural Language Processing [J] . Journal of legal affairs and dispute resolution in engineering and construction . 2020,第2期

机译：使用自然语言处理从施工合同文件中自动识别需求
2. Automatic Document Summarization System Based on Natural Language Processing and Artificial Intelligent Techniques [J] . M. I. Elalami, A. E. Amin, M. G. Doweidar International Journal of Computer Trends and Technology . 2018,第1期

机译：基于自然语言处理和人工智能技术的自动文档汇总系统
3. Excavating grey literature: A case study on the rich indexing of archaeological documents via natural language-processing techniques and knowledge-based resources [J] . Andreas Vlachidis, Ceri Binding, Douglas Tudhope, Aslib Proceedings . 2010,第4a5期

机译：挖掘灰色文献：通过自然语言处理技术和基于知识的资源丰富考古文献索引的案例研究
4. Natural Language Processing Techniques for Document Classification in IT Benchmarking Automated Identification of Domain Specific Terms [C] . Matthias Pfaff, Helmut Krcmar International Conference on Enterprise Information Systems . 2015

机译：自然语言处理文档分类的技术在其基准测试自动识别域特定术语
5. Automated mapping of domain specific languages to application specific multiprocessors. [D] . Plishker, William Lester. 2006

机译：域特定语言到应用特定多处理器的自动映射。
6. Automated Encoding of Clinical Documents Based on Natural Language Processing [O] . Carol Friedman, Lyudmila Shagina, Yves Lussier, 2004

机译：基于自然语言处理的临床文件自动编码
7. An Automated Multiple-Choice Question Generation using Natural Language Processing Techniques [O] . Chidinma A. Nwafor, Ikechukwu E. Onyenwe 2021

机译：使用自然语言处理技术的自动化多项选择问题
8. Security Classification Using Automated Learning (SCALE): Optimizing Statistical Natural Language Processing Techniques to Assign Security Labels to Unstructured Text [R] . Brown, J. D., Charlebois, D. 2010

机译：使用自动学习的安全性分类（sCaLE）：优化统计自然语言处理技术，将安全标签分配给非结构化文本

Natural Language Processing Techniques for Document Classification in IT Benchmarking Automated Identification of Domain Specific Terms

摘要

著录项

相似文献

相关主题

期刊订阅