首页> 外文会议>Culture and Computing 2011 >Term Extraction from Japanese Ancient Writings Using Probability of Character N-grams

【24h】

Term Extraction from Japanese Ancient Writings Using Probability of Character N-grams

机译：使用字符N-gram的概率从日本古代著作中提取术语

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Currently, there are no tools available to separate ancient Japanese sentence into words. Therefore, it is difficult to extract archaic Japanese terms from Japanese ancient writings. In this paper, we propose a method of term extraction for ancient Japanese documents. We calculate the likelihood of character n-grams to be a word, and extract character n-grams with higher likelihood as archaic Japanese terms. We conducted experiments of term separation using the term likelihood by the proposed method.

机译：当前，没有可用的工具将古日语句子分成单词。因此，很难从日本古代著作中提取古老的日语术语。在本文中，我们提出了一种用于古代日本文献的术语提取方法。我们计算字符n-gram为一个单词的可能性，并提取具有较高可能性的字符n-gram作为古日语术语。我们通过提出的方法使用术语似然进行了术语分离实验。

著录项

来源
《Culture and Computing 2011》|2011年|p.183-184|共2页
会议地点
作者
Kimura Fuminori; Yoshimura Mamoru; Maeda Akira;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类机器翻译;
关键词
Japanese ancient writings; character n-gram; term extraction; term likelihood;

机译：日本古代著作;字符n-gram;术语提取;术语似然;
入库时间 2022-08-26 15:06:38

相似文献

外文文献
中文文献
专利

1. Detection of Wrong Character Using Probability Transitional Patterns of Both-Direction N-gram Probabilities [J] . Takehiro KAWATA, Atsuyoshi NAKAMURA, Jun TOYAMA, 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2003,第295期

机译：使用双向N-gram概率的概率转换模式检测错误字符
2. Detection of Wrong Character Using Probability Transitional Patterns of Both-Direction N-gram Probabilities [J] . Takehiro KAWATA, Atsuyoshi NAKAMURA, Jun TOYAMA, 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2003,第295期

机译：使用两方向N-GRAM概率的概率过渡模式检测错误字符
3. Japanese Term Extraction Toward French-Japanese Bilingual Term Extraction on Wind Power Generation Domain [J] . Teruo KOYAMA, Shouzaburo MINAMOTO, Koichi TAKEUCHI, 電子情報通信学会技術研究報告 . 2012,第367期

机译：面向风能发电领域法日双语术语的日语术语提取
4. Term Extraction from Japanese Ancient Writings Using Probability of Character N-grams [C] . Kimura Fuminori, Yoshimura Mamoru, Maeda Akira International Conference on Culture and Computing . 2011

机译：使用字符N-GRAM的概率从日本古代着作中提取
5. Kana in the eighth century: An ancient Japanese writing system. [D] . Case, Theresa Leyden. 2000

机译：八世纪的假名：一种古老的日本文字系统。
6. Parietal Dysgraphia: Characterization of Abnormal Writing Stroke Sequences Character Formation and Character Recall [O] . Yasuhisa Sakurai, Yoshinobu Onuma, Gaku Nakazawa, 2007

机译：顶音障碍：异常笔画序列字符形成和字符召回的表征。
7. While quite a number of English-Japanese dictionaries of business and legal terms have been publishedin Japan, there seem to be very few Japanese-English dictionaries on the market which give clear explanations of thesubtle differences in meaning between the various English equivalents of terms in such specialized fields. Due to alack of clear guidance, Japanese users frequently hesitate over which term to select when writing business letters ordrafting legal documents. We have been engaged, since 2004, in compiling a user-friendly Japanese-Englishdictionary that not only lists distinctively-defined equivalents for business and legal terms, but also includes clearlywritten notes and comments on them. The following are some of the terms we have collected over the last threeyears. The dictionary is not intended to be a perfect commentary on business and legal terminology, but we hope thatit will be of some use in the preparation of business letters and legal documents. [O] . 木宮直仁, 平川平川博 2010

机译：虽然日本出版了大量商业和法律术语的英日词典，但市场上似乎很少有日英词典，这些词典清楚地解释了各种英语词汇之间的意义差异。这些专业领域的术语。由于没有明确的指导，日本用户在撰写商务信函或撰写法律文件时经常对选择哪个术语犹豫不决。自2004年以来，我们一直致力于编写一个用户友好的日语 - 英语 ndictionary，不仅列出了商业和法律术语的明确定义的等价物，而且还包括明确的书面注释和评论。以下是我们在过去三年中收集的一些术语。这本词典并不是对商业和法律术语的完美评论，但我们希望 nit将在商业信函和法律文件的准备中有所作为。

Term Extraction from Japanese Ancient Writings Using Probability of Character N-grams

摘要

著录项

相似文献

相关主题

期刊订阅