首页> 外文会议>Databases in networked information systems >Summary Extraction from Chinese Text for Data Archives of Online News
【24h】

Summary Extraction from Chinese Text for Data Archives of Online News

机译:在线新闻数据档案中文摘要摘录

获取原文
获取原文并翻译 | 示例

摘要

Electronic news media consistently use a specific language frame for efficient knowledge delivery and opinion formation. Since machine representation of logographs, and their derived forms, such as ideograms, and the Chinese characters in general, enumerates to a large set of symbols, the information content of particular text sequence interconnects context patterns across various scope ranges. Here we concern with the enumerated form of sinogram reflecting on the characters not only historically and culturally, but also educationally. Logographs visually invoke mutual functional relations by design and through their usage in overlaping scopes. Here we study the procedural summarization of text originally intended for online news distribution and the preferable evaluation method of its usability. Sinogrammatic electronic news sentences are analyzed for mutual similarity patterns both inward and outward, in order to facilitate sentence extraction for summary inclusion while reflecting on the principle of characters. Traditional partition of linguistic knowledge representation is aided by invocation of bypass routes in logographic text similar to software pictograms, for which design and usage frames are coeducational. Machine extracted summaries are compared with human chosen sentences while employing the Turing test to ascertain cohesion of Human - Human and Human - Machine comparison. The implementation of popularity-based summarization algorithm is available as a Java program.
机译:电子新闻媒体始终使用特定的语言框架进行有效的知识传递和意见形成。由于徽标的机器表示以及它们的派生形式(例如表意文字和汉字)通常会枚举大量符号,因此特定文本序列的信息内容会在各种范围范围内互连上下文模式。在这里,我们关注的是正弦图的枚举形式,不仅反映了历史和文化上的特征,而且还反映了教育上的特征。徽标在视觉上通过设计和通过在重叠范围内的使用来调用相互的功能关系。在这里,我们研究了最初用于在线新闻发布的文本的过程摘要以及其可用性的首选评估方法。分析了汉字电子新闻句子的内部和外部的相互相似性模式,以便于在考虑字符原理的同时方便句子提取以进行摘要收录。语言知识表示的传统划分是通过调用类似于软件象形文字的逻辑文字中的旁路路由来实现的,为此设计和使用框架是男女同校的。将机器提取的摘要与人类选择的句子进行比较,同时采用图灵测试来确定人与人之间以及人与机器之间的比较的内聚性。基于流行度的摘要算法的实现可作为Java程序获得。

著录项

  • 来源
  • 会议地点 Aizu-Wakamatsu(JP);Aizu-Wakamatsu(JP)
  • 作者

    Nozomi Mikami; Lukas Pichl;

  • 作者单位

    International Christian University Osawa 3-10-2, Mitaka, Tokyo, 181-8585, Japan;

    International Christian University Osawa 3-10-2, Mitaka, Tokyo, 181-8585, Japan;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 TP311.13;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号