Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them

机译：比较专注的爬虫和从中获得的翻译资源的质量

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Comparable corpora have been used as an alternative for parallel corpora as resources for computational tasks that involve domain-specific natural language processing. One way to gather documents related to a specific topic of interest is to traverse a portion of the web graph in a targeted way, using focused crawling algorithms. In this paper, we compare several focused crawling algorithms using them to collect comparable corpora on a specific domain. Then, we compare the evaluation of the focused crawling algorithms to the performance of linguistic processes executed after training with the corresponding generated corpora. Also, we propose a novel approach for focused crawling, exploiting the expressive power of multiword expressions.

机译：可比较的Corpora已被用作并行Corpora作为涉及域特定的自然语言处理的计算任务的资源的替代方案。收集与感兴趣的特定话题相关的文档的一种方法是使用聚焦爬行算法以目标方式遍历一部分Web图。在本文中，我们比较了几种聚焦爬行算法，使用它们收集特定域上的可比较。然后，我们将聚焦爬行算法的评估进行比较与相应生成的语料库训练后执行的语言过程的性能。此外，我们提出了一种重点爬行的新方法，利用多个表达式的表现力。

著录项

来源
《9th International conference on language resources and evaluation》|2014年||共7页
会议地点
作者
B. R. Laranjeira; V. P. Moreira; A. Villavicencio; C. Ramisch; M. J. Finatto;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
Focused Crawling; Comparable Corpora; Machine Translation;

机译：重点爬行;可比较的小组;机器翻译;

相似文献

外文文献
中文文献
专利

1. The evaluation of text quality: expert-focused and reader-focused methods compared [J] . Lentz L., De Jong M. IEEE Transactions on Professional Communication . 1997,第3期

机译：文本质量评估：比较以专家为中心和以读者为中心的方法
2. Translational Research - Focusing on Quality of Pediatric Nursing Care and Reducing Safety Risks for Children and Families [J] . Christian Becky J. Journal of pediatric nursing . 2019,第期

机译：翻译研究 - 专注于儿科护理保健的质量，减少儿童和家庭的安全风险
3. THE TRADE-OFF BETWEEN QUANTITY AND QUALITY. COMPARING A LARGE CRAWLED CORPUS AND A SMALL FOCUSED CORPUS FOR MEDICAL TERMINOLOGY EXTRACTION [J] . Hoste Veronique, Vanopstal Klaar, Terryn Ayla Rigouts, Nature reviews neuroscience . 2019,第2期

机译：数量和质量之间的权衡。比较大型爬行的语料库和用于医学术语提取的小型专注语料库
4. Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them [C] . B. R. Laranjeira, V. P. Moreira, A. Villavicencio, 9th International conference on language resources and evaluation . 2014

机译：比较重点爬虫的质量和从中获得的翻译资源
5. Translation Quality Assessment of Non-Literary Translation: Comparing Two Arabic Translations of Edward Said's Orientalism [D] . Hedaya, Samy. 2017

机译：非文学翻译的翻译质量评估：比较爱德华·赛义德的东方主义的两种阿拉伯语翻译
6. Translating Evidence for Low Back Pain Management into a Consumer-Focussed Resource for Use in Community Pharmacies: A Cluster-Randomised Controlled Trial [O] . Helen Slater, Andrew M. Briggs, Kim Watkins, -1

机译：将腰痛管理的证据转化为以消费者为中心的资源以用于社区药房：一项集群随机对照试验
7. A Focused Crawler in order to Get Semantic Web Resources (CSR) [O] . Barbosa Santillán Liliana Ibeth, Campos Quirarte Juana Elizabeth, Castro Munguía Aldo 2013

机译：集中抓取工具以获取语义Web资源（CSR）

Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them

摘要

著录项

相似文献

相关主题

期刊订阅