Learning Private Neural Language Modeling with Attentive Aggregation

机译：通过专注聚合学习专用神经语言建模

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Mobile keyboard suggestion is typically regarded as a word-level language modeling problem. Centralized machine learning techniques require the collection of massive user data for training purposes, which may raise privacy concerns in relation to users’ sensitive data. Federated learning (FL) provides a promising approach to learning private language modeling for intelligent personalized keyboard suggestions by training models on distributed clients rather than training them on a central server. To obtain a global model for prediction, existing FL algorithms simply average the client models and ignore the importance of each client during model aggregation. Furthermore, there is no optimization for learning a well-generalized global model on the central server. To solve these problems, we propose a novel model aggregation with an attention mechanism considering the contribution of client models to the global model, together with an optimization technique during server aggregation. Our proposed attentive aggregation method minimizes the weighted distance between the server model and client models by iteratively updating parameters while attending to the distance between the server model and client models. Experiments on two popular language modeling datasets and a social media dataset show that our proposed method outperforms its counterparts in terms of perplexity and communication cost in most settings of comparison.

机译：移动键盘建议通常被视为单词级语言建模问题。集中式机器学习技术需要出于训练目的而收集大量用户数据，这可能会引起与用户敏感数据有关的隐私问题。联合学习（FL）通过在分布式客户端上训练模型而不是在中央服务器上训练模型，为学习智能个性化键盘建议的私有语言模型提供了一种有前途的方法。为了获得用于预测的全局模型，现有的FL算法仅对客户端模型进行平均，而在模型聚合过程中忽略每个客户端的重要性。此外，对于在中央服务器上学习通用化的全局模型没有优化。为了解决这些问题，我们提出了一种新的模型聚合方法，其中考虑了客户端模型对全局模型的贡献，并采用了一种关注机制，并在服务器聚合过程中采用了优化技术。我们提出的专心聚合方法通过迭代更新参数，同时注意服务器模型和客户端模型之间的距离，从而最小化了服务器模型和客户端模型之间的加权距离。在两个流行语言建模数据集和一个社交媒体数据集上进行的实验表明，在大多数比较设置中，我们提出的方法在困惑度和沟通成本方面均优于同类方法。

著录项

来源
《International Joint Conference on Neural Networks》|2019年|1-8|共8页
会议地点
作者
Shaoxiong Ji; Shirui Pan; Guodong Long; Xue Li; Jing Jiang; Zi Huang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
federated learning; language modeling; attentive aggregation;

机译：联合学习;语言建模;专心聚合;

相似文献

外文文献
中文文献
专利

1. SACNN: Self-attentive Convolutional Neural Network Model for Natural Language Inference [J] . Quamer Waris, Jain Praphula Kumar, Rai Arpit, ACM transactions on Asian and low-resource language information processing . 2021,第3期

机译：SACNN：自我临床卷积神经网络模型，用于自然语言推断
2. A neural model of cortico-cerebellar interactions during attentive imitation and predictive learning of sequential handwriting movements. [J] . Grossberg S, Paine RW Neural Networks: The Official Journal of the International Neural Network Society . 2000,第8a9期

机译：细心模仿和顺序笔迹运动的预测学习过程中皮层-小脑相互作用的神经模型。
3. AKRNet: A novel convolutional neural network with attentive kernel residual learning for feature learning of gearbox vibration signals [J] . Ye Zhuang, Yu Jianbo Neurocomputing . 2021,第Auga4期

机译：AKRNET：一种小说卷积神经网络，具有细节核心剩余学习，用于齿轮箱振动信号的特征学习
4. Learning Private Neural Language Modeling with Attentive Aggregation [C] . Shaoxiong Ji, Shirui Pan, Guodong Long, International Joint Conference on Neural Networks . 2019

机译：学习私人神经语言建模与周度聚集
5. Neural models of multi-scale image completion and of featural bias during attentive memory search. [D] . Gaddam, Sai Chaitanya. 2009

机译：注意力记忆搜索过程中多尺度图像完成和胎儿偏向的神经模型。
6. Attentional but not Pre-Attentive Neural Measures of Auditory Discrimination are Atypical in Children with Developmental Language Disorder [O] . Sergey A. Kornilov, Nicole Landi, Natalia Rakhlin, -1

机译：注意力发育而不是注意力集中的听觉歧视神经测量方法对于发育性语言障碍儿童是典型的
7. The Artificial Neural Network Modeling of Language Learning Challenges of French-Speaking Students Learning Turkish as a Foreign Language: The Case of France [O] . 2019

机译：法语学生学习挑战的人工神经网络建模，学习土耳其语作为外语：法国的案例
8. Predictive Regulation of Associative Learning in a Neural Network by Reinforcement and Attentive Feedback [R] . Grossberg, S., Levine, D., Schmajuk, N. 1987

机译：通过强化和注意反馈对神经网络中联想学习的预测调节

Learning Private Neural Language Modeling with Attentive Aggregation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅