Learning-based Transformation for Text Documents

机译：基于学习的文本文档转换

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a method to automatically transform semistructured (not necessarily tagged) text documents into content-tagged documents based on techniques from machine learning and computational linguistics. The method consists of two phases. First, a learning-based segmentation module is used to extract regions and sequences from the documents. Second, translation from region-marked documents to XML is done by a transformation-based learning (TBL) translator that is very effective even with a small set of training examples.

机译：本文提出了一种基于机器学习和计算语言学的技术，将半结构化（不一定带标签）的文本文档自动转换为带内容标签的文档的方法。该方法包括两个阶段。首先，基于学习的分割模块用于从文档中提取区域和序列。其次，从基于区域标记的文档到XML的转换是通过基于转换的学习（TBL）转换程序完成的，即使只有很少的培训示例，转换程序也非常有效。

著录项

来源
《World Multiconference on Systemics, Cybernetics and Informatics(SCI 2002) v.18: Information Systems Development III; 20020714-20020718; Orlando,FL; US》|2002年|P.180-185|共6页
会议地点 Orlando FL(US);Orlando FL(US)
作者
Liping Ma; John Shepherd; Raymond K. Wong;
展开▼
作者单位

School of Computer Science and Engineering University of New South Wales Sydney, NSW 2052, Australia;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
入库时间 2022-08-26 14:21:05

相似文献

外文文献
中文文献
专利

1. Deep Learning-based Extraction of Algorithmic Metadata in Full-Text Scholarly Documents [J] . Iqra Safder, Saeed-Ul Hassan, Anna Visvizi, Information Processing & Management . 2020,第6期

机译：全文学术文档中算法元数据的深度学习提取
2. Deep Learning-Based Document Modeling for Personality Detection from Text [J] . Navonil Majumder, Soujanya Poria, Alexander Gelbukh, IEEE intelligent systems . 2017,第2期

机译：基于深度学习的文档模型用于文本个性检测
3. Watermarks and Text Transformations in Visual Document Authentication [J] . Igor Fischer1, Thorsten Herfet2 Journal of Computers . 2007,第5期

机译：视觉文档身份验证的水印和文本转换
4. Learning-based Transformation for Text Documents [C] . Liping Ma. John Shepherd, Raymond K. Wong World Multi-conference on Systemics, Cybernetics and Informatics . 2002

机译：文本文件的学习转换
5. Document image analysis techniques for handwritten text segmentation, document image rectification and digital collation. [D] . Salvi, Dhaval. 2014

机译：用于手写文本分割，文档图像校正和数字整理的文档图像分析技术。
6. Texts and documents. Translation and analysis of a cuneiform text forming part of a Babylonian treatise on epilepsy. [O] . J V Wilson, E H Reynolds 1990

机译：文本和文件。翻译和分析楔形文字形成癫痫病的巴比伦论文的一部分。
7. ACC/AHA guidelines for the evaluation and management of chronic heart failure in the adult: executive summary A report of the american college of cardiology/american heart association task force on practice guidelines (committee to revise the 1995 guidelines for the evaluation and management of heart failure) developed in collaboration with the international society for heart and lung transplantation endorsed by the heart failure society of america51The document was approved by the American College of Cardiology Board of Trustees in November 2001 and the American Heart Association Science Advisory and Coordinating Committee in September 2001.52When citing this document, the American College of Cardiology and the American Heart Association would appreciate the following citation format: Hunt SA, Baker DW, Chin MH, Cinquegrani MP, Feldman AM, Francis GS, Ganiats TG, Goldstein S, Gregoratos G, Jessup ML, Noble RJ, Packer M, Silver MA, Stevenson LW. ACC/AHA guidelines for the evaluation and management of chronic heart failure in the adult: executive summary: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines (Committee to Revise the 1995 Guidelines for the Evaluation and Management of Heart Failure). J Am Coll Cardiol 2001;38:2101–13.53The American College of Cardiology and the American Heart Association make every effort to avoid any actual or potential conflicts of interest that may arise as a result of an outside relationship or a personal, professional, or business interest of a member of the writing panel. Specifically, all members of the writing group are required to provide disclosure statements of all such relationships that might be perceived as real or potential conflicts of interest. These statements are reviewed by the parent task force, reported orally to all members of the writing panel at the first meeting, and updated as changes occur.54This document, as well as the corresponding full-text guidelines, is available on the World Wide Web sites of the American College of Cardiology (www.acc.org) and the American Heart Association (www.americanheart.org). Single reprints of the executive summary are available for $5.00 each by calling 800-253-4636 (US only) or writing the American College of Cardiology, Educational Services, 9111 Old Georgetown Road, Bethesda, MD 20814-1699. To purchase additional reprints up to 999 copies, call 800-611-6083 (US only) or fax 413-665-2671; 1000 or more copies, call 214-706-1466, fax 214-691-6342, or e-mail pubauth@heart.org (specify version: Executive Summary—71-0125; Full Text—71-1026).55© 2001 American College of Cardiology and American Heart Association, Inc. [O] . Hunt Sharon A, Baker David W, Chin Marshall H, 2001

机译：ACC / AHA成人慢性心力衰竭评估和管理指南：执行摘要美国心脏病学会/美国心脏协会实践指南工作组的报告（委员会修订1995年心脏评估和管理指南与美国心力衰竭学会认可的国际心脏和肺移植协会合作51）该文件于2001年11月获得美国心脏病学会理事会的批准，并于2001年9月获得美国心脏协会科学咨询和协调委员会的批准.52引用本文件时，美国心脏病学会和美国心脏协会将赞赏以下引用格式：Hunt SA，Baker DW，Chin MH，Cinquegrani MP，Feldman AM，Francis GS，Ganiats TG，Goldstein S，Gregoratos G，Jessup ML，Noble RJ，Packer M，Silver MA，Stevenson LW。 ACC / AHA成人慢性心力衰竭评估和管理指南：摘要：美国心脏病学会/美国心脏协会实践指南工作组的报告（修订1995年《美国心脏病学会评估和管理指南》的委员会心脏衰竭）。 J Am Coll Cardiol 2001; 38：2101–13.53美国心脏病学会和美国心脏协会竭尽全力避免由于外部关系或个人，专业或其他原因而引起的任何实际或潜在的利益冲突。写作小组成员的商业利益。具体而言，要求写作小组的所有成员提供所有可能被视为实际或潜在利益冲突的关系的披露声明。这些声明由上级工作组审查，在第一次会议上口头报告给写作小组的所有成员，并在发生变化时进行更新。54本文档以及相应的全文指南可在万维网上找到。美国心脏病学院（www.acc.org）和美国心脏协会（www.americanheart.org）的网站。致电800-253-4636（仅适用于美国）或写信给美国心脏病，教育服务学院，地址为9111 Old Georgetown Road，Bethesda，MD 20814-1699，可以执行摘要的单个重印本，每张5.00美元。要购买最多999份的其他重印本，请致电800-611-6083（仅限美国）或传真413-665-2671； 1000或更多副本，请致电214-706-1466，传真214-691-6342或电子邮件pubauth@heart.org（指定版本：执行摘要-71-0125；全文-71-1026）。55©2001美国心脏病学会和美国心脏协会有限公司

Learning-based Transformation for Text Documents

摘要

著录项

相似文献

相关主题

期刊订阅