Text-Mining: Application Development Challenges

机译：文本挖掘：应用程序发展挑战

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper reviews the best practices and challenges for project managers and developers involved in implementing text-mining applications. With focus on rule-based information extraction, and references to actual cases, the authors share their experiences from having developed several text-mining applications in diverse industries. First, project management issues are discussed, including a process for capturing business requirements and mapping them into features and linguistic patterns, development of linguistic rules, rule development standards, performance metrics, and an evaluation methodology. Linguistic representations such as sub-syntactic, syntactic, semantic, and application-specific rules are identified. Special emphasis is placed on post-information extraction processing, such as improving the relevance of the extracted information, summarization models, techniques for handling typographical errors, resolution of temporal information, anaphora resolution, and a discussion on shallow vs. full parsing. Lastly, the paper discusses various utilities to help with the development of a text-mining application, such as feature analysis, visualization, source document pre-processing, and rule authoring tools.

机译：本文介绍了参与实施文本挖掘应用程序的项目经理和开发人员的最佳实践和挑战。专注于基于规则的信息提取和对实际情况的引用，作者分享了他们在各种行业开发了几种文本挖掘应用的经验。首先，讨论了项目管理问题，包括捕获业务需求并将其映射到特征和语言模式，语言规则，规则开发标准，绩效指标以及评估方法的过程中映射它们的过程。识别出语言表征，例如子句法，语法，语义和应用程序特定规则。特别强调在信息后提取处理中，例如提高提取的信息，摘要模型，处理印刷错误的技术，解决时间信息，Anaphora解决方案的技术以及浅与完全解析的讨论。最后，本文讨论了各种实用程序，帮助开发文本挖掘应用程序，例如特征分析，可视化，源文档预处理和规则创作工具。

著录项

来源
《SGAI international conference on knowledge based systems and applied artificial intelligence》|2003年||共14页
会议地点
作者
Sundar Varadarajan; Kas Kasravi; Ronen Feldman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. TEXT-MINING AND PATTERN-MATCHING BASED PREDICTION MODELS FOR DETECTING VULNERABLE FILES IN WEB APPLICATIONS [J] . Gupta Mukesh Kumar, Govil Mahesh Chandra, Singh Girdhari Journal of web engineering . 2018,第1a2期

机译：基于文本挖掘和模式匹配的Web应用程序中易受攻击文件的预测模型
2. Text-Mining Applications for Creation of Biofilm Literature Database [J] . Kanika Canadian Journal of Biotechnology . 2017,第SpecialaIssue期

机译：文本挖掘在创建生物膜文献数据库中的应用
3. Text-Mining Applications for Creation of Biofilm Literature Database [J] . Kanika Canadian Journal of Biotechnology . 2017,第SpecialaIssue期

机译：文本挖掘在创建生物膜文献数据库中的应用
4. Text-Mining: Application Development Challenges [C] . Sundar Varadarajan, Kas Kasravi, Ronen Feldman SGAI international conference on knowledge based systems and applied artificial intelligence . 2003

机译：文本挖掘：应用程序发展挑战
5. Development and application of state-of-the-art nuclear magnetic resonance (NMR) spectroscopic methodologies to analytical challenges relevant to the drug development process. [D] . Lucas, Laura Hamilton. 2004

机译：开发和应用最先进的核磁共振（NMR）光谱方法来解决与药物开发过程有关的分析难题。
6. Benchmarking of the 2010 BioCreative Challenge III text-mining competition by the BioGRID and MINT interaction databases [O] . Andrew Chatr-aryamontri, Andrew Winter, Livia Perfetto, 2011

机译：通过BioGRID和MINT交互数据库对2010年BioCreative Challenge III文本挖掘竞赛进行基准测试
7. Text-Mining: Application Development Challenges [O] . Sundar Varadarajan, Kas Kasravi, Ronen Feldman 2002

机译：文本挖掘：应用程序开发挑战

Text-Mining: Application Development Challenges

摘要

著录项

相似文献

相关主题

期刊订阅