Building Representative Corpora from Illiterate Communities: A Review of Challenges and Mitigation Strategies for Developing Countries

机译：来自文盲社区的建立代表Corpora：对发展中国家的挑战和缓解战略进行审查

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most well-established data collection methods currently adopted in NLP depend on the assumption of speaker literacy. Consequently, the collected corpora largely fail to represent swathes of the global population, which tend to be some of the most vulnerable and marginalised people in society, and often live in rural developing areas. Such underrepre-sented groups are thus not only ignored when making modeling and system design decisions, but also prevented from benefiting from development outcomes achieved through data-driven NLP. This paper aims to address the under-representation of illiterate communities in NLP corpora: we identify potential biases and ethical issues that might arise when collecting data from rural communities with high illiteracy rates in Low-Income Countries, and propose a set of practical mitigation strategies to help future work.

机译：NLP目前采用的最良好的数据收集方法取决于扬声器素养的假设。因此，收集的Corpora很大程度上没有代表全球人口的息息，这往往是社会中最脆弱和最边缘化的人，并且经常生活在农村发展中地区。因此，在制定建模和系统设计决策时不仅忽略了这种不足的群体，而且还防止受益于通过数据驱动的NLP实现的发展结果。本文旨在解决NLP Grouora中文盲社区的陈述：我们确定可能在从低收入国家的高文盲率的农村社区收集数据时可能出现的潜在偏见和道德问题，并提出了一套实际缓解策略帮助未来的工作。

著录项

来源
《Conference of the European Chapter of the Association for Computational Linguistics》|2021年|2176-2189|共14页
会议地点
作者
Stephanie Hirmer; Alycia Leonard; Josephine Tumwesige; Costanza Conforti;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A review of climate change implications for built environment: Impacts, mitigation measures and associated challenges in developed and developing countries [J] . Andric Ivan, Koc Muammer, Al-Ghamdi Sami G. Journal of Cleaner Production . 2019,第FEBa20期

机译：审查气候变化对建筑环境的影响：发达国家和发展中国家的影响，缓解措施和相关挑战
2. Adopting green building constructions in developing countries through capacity building strategy: survey of Enugu State, Nigeria [J] . Daniel Uchenna Chukwu, Edmund A. Anaele, Hyginus O. Omeje, Sustainable buildings . 2019,第4期

机译：通过能力建设战略采用发展中国家的绿色建筑结构：尼日利亚恩鲁州的调查
3. Current scenario and future perspective of community pharmacy in developed, developing and sub-developing countries: A review [J] . Muhammad Abdullah, Abdul Wahab, Naqab Khan, International Journal of Basic Medical Sciences and Pharmacy . 2018,第1期

机译：发达国家，发展中国家和次发展中国家社区药房的现状和未来展望：回顾
4. Challenges in building trust in B2C e-Commerce and proposal to mitigate them: developing countries perspective [C] . Dey S.K., Nabi M.N., Anwer M. Computers and Information Technology, 2009. ICCIT '09 . 2009

机译：建立对B2C电子商务的信任所面临的挑战以及缓解这些挑战的建议：发展中国家的观点
5. Risk mitigation strategies in rural areas of developing countries. [D] . Petraud, Jean Paul. 2014

机译：发展中国家农村地区的风险减轻战略。
6. Interprofessional Education for Whom? — Challenges and Lessons Learned from Its Implementation in Developed Countries and Their Application to Developing Countries: A Systematic Review [O] . Bruno F. Sunguya, Woranich Hinthong, Masamine Jimba, -1

机译：对谁的跨职业教育？ -从发达国家执行公约及其在发展中国家的应用中获得的挑战和经验教训：系统回顾
7. Report of the Section for External Relations on the proposals for Council Regulations (EEC) applying generalized tariff preferences for 1988 in respect of certain industrial products, textile products, agricultural products originating in developing countries. Draft Decision of the representatives of the Governments of the Member States of the European Coal and Steel Community, meeting within the Council, applying for 1988 the generalized tariff preferences for certain steel products originating in developing countries. CES 688/87, 16 September 1987 [O] . 1987

机译：关于适用于某些工业产品，纺织产品，源自发展中国家的农产品的1988年普遍关税优惠的理事会条例（EEC）提案的对外关系部的报告。欧洲煤钢共同体成员国政府代表的决定草案，在理事会内举行会议，申请1988年来自发展中国家的某些钢铁产品的普遍关税优惠。 CEs 688/87，1987年9月16日

Building Representative Corpora from Illiterate Communities: A Review of Challenges and Mitigation Strategies for Developing Countries

摘要

著录项

相似文献

相关主题

期刊订阅