Alignment and Extraction of Bilingual Legal Terminology from Context Profiles

机译：从语境概况的双语法律术语的对准和提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this study, we propose a knowledge-independent method for aligning terms and thus extracting translations from a small, domain-specific corpus consisting of parallel English and Chinese court judgments from Hong Kong. With a sentence-aligned corpus, translation equivalences are suggested by analysing the frequency profiles of parallel concordances. The method overcomes the limitations of conventional statistical methods which require large corpora to be effective, and lexical approaches which depend on existing bilingual dictionaries. Pilot testing on a parallel corpus of about 113K Chinese words and 120K English words gives an encouraging 85% precision and 45% recall. Future work includes fine-tuning the algorithm upon the analysis of the errors, and acquiring a translation lexicon for legal terminology by filtering out general terms.

机译：在这项研究中，我们提出了一种独立的对齐术语的方法，从而提取由香港平行英语和中国法院判决组成的小型域特定语料库的翻译。通过句子对齐的语料库，通过分析并行协调的频率分布来提出翻译等效性。该方法克服了常规统计方法的局限性，这些方法需要大型语料库是有效的，以及依赖于现有双语词典的词汇方法。 Pilot测试在一个大约113k中文单词和120k英语单词的并行语料库中，给出了令人鼓舞的85％精度和45％的召回。未来的工作包括在分析错误时微调算法，并通过过滤概括的条款来获取法律术语的翻译词汇。

著录项

来源
《International conference on computational linguistics post-conference workshops》|2002年||共7页
会议地点
作者
Oi Yee Kwong; Benjamin K. Tsou; Tom B.Y. Lai; Robert W.P. Luk; Lawrence Y.L. Cheung; Francis C.Y. Chik;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. Bilingual terminology extraction from parallel corpora using chunk-based alignment [J] . Lieve Macken, Els Lefever, Veronique Hoste Terminology . 2013,第1期

机译：使用基于块的对齐方式从并行语料库中提取双语术语
2. TermFinder: log-likelihood comparison and phrase- based statistical machine translation models for bilingual terminology extraction [J] . Haque Rejwanul, Penkale Sergio, Way Andy Language Resources and Evaluation . 2018,第2期

机译：TermFinder：对数似然比较和基于短语的统计机器翻译模型，用于双语术语提取
3. Bilingual terminology extraction using multi-level termhood [J] . Chengzhi Zhang, Dan Wu The Electronic Library . 2012,第2期

机译：使用多级术语来提取双语术语
4. Alignment and Extraction of Bilingual Legal Terminology from Context Profiles [C] . Oi Yee Kwong, Benjamin K. Tsou, Tom B.Y. Lai, International conference on computational linguistics post-conference workshops . 2002

机译：从语境概况的双语法律术语的对准和提取
5. Automatic Compilation of Bilingual Terminologies From Comparable Corpora [D] . Kontonatsios, Georgios. 2015

机译：自动汇编来自可比语料库的双语术语
6. Alignment of gene expression profiles from test samples against a reference database: New method for context-specific interpretation of microarray data [O] . Sami K Kilpinen, Kalle A Ojala, Olli P Kallioniemi 2011

机译：来自测试样品的基因表达谱与参考数据库的比对：用于微阵列数据上下文特定解释的新方法
7. Alignment and extraction of bilingual legal terminology from context profiles [O] . Kwong, OY, T'Sou, BK, Lai, TBY, 2002

机译：从上下文配置文件中对齐和提取双语法律术语

Alignment and Extraction of Bilingual Legal Terminology from Context Profiles

摘要

著录项

相似文献

相关主题

期刊订阅