Tree Transformer: Integrating Tree Structures into Self-Attention

机译：树形变压器：将树形结构整合到自我关注中

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Pre-training Transformer from large-scale raw texts and fine-tuning on the desired task have achieved state-of-the-art results on diverse NLP tasks. However, it is unclear what the learned attention captures. The attention computed by attention heads seems not to match human intuitions about hierarchical structures. This paper proposes Tree Transformer, which adds an extra constraint to attention heads of the bidirectional Transformer encoder in order to encourage the attention heads to follow tree structures. The tree structures can be automatically induced from raw texts by our proposed "Constituent Attention" module, which is simply implemented by self-attention between two adjacent words. With the same training procedure identical to BERT, the experiments demonstrate the effectiveness of Tree Transformer in terms of inducing tree structures, better language modeling, and further learning more explainable attention scores~1.

机译：来自大规模原始文本的预训练Transformer以及对所需任务的微调，已在各种NLP任务上取得了最新的成果。但是，尚不清楚所学注意力吸引了什么。注意头计算出的注意似乎与人类对层次结构的直觉不匹配。本文提出了Tree Transformer，它为双向Transformer编码器的关注头添加了额外的约束，以鼓励关注头遵循树形结构。可以通过我们提出的“组成注意”模块从原始文本自动导出树结构，该模块可以通过两个相邻单词之间的自注意来简单地实现。通过与BERT相同的训练过程，实验证明了Tree Transformer在诱导树结构，更好的语言建模以及进一步学习更多可解释的注意力得分方面的有效性〜1。

著录项

来源
《International joint conference on natural language processing;Conference on empirical methods in natural language processing》|2019年|1061-1070|共10页
会议地点
作者
Yau-Shian Wang; Hung-Yi Lee; Yun-Nung Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A Tree of Trees: Using Campus Tree Diversity to Integrate Molecular, Organismal, and Evolutionary Biology [J] . Huang Sophia, Whittall Justen B. The American Biology Teacher: Journal of the National Association of Biology Teachers . 2018,第2期

机译：树木树：使用校园树多样性整合分子，有机体和进化生物学
2. Self-attention binary neural tree for video summarization [J] . Fu Hao, Wang Hongxing Pattern recognition letters . 2021,第Mara期

机译：用于视频摘要的自我关注二元神经树
3. Effective tree distribution and stand structures in a forest for tsunami mitigation considering the different tree-breaking patterns of tree species [J] . Tanaka Norio, Sato Hajime, Igarashi Yoshiya, Journal of Environmental Management . 2018,第OCTa1期

机译：考虑到树种的不同破树方式，有效的森林分布和林分结构可缓解海啸
4. Tree Transformer: Integrating Tree Structures into Self-Attention [C] . Yau-Shian Wang, Hung-Yi Lee, Yun-Nung Chen International joint conference on natural language processing . 2019

机译：树变压器：将树结构集成到自我关注
5. Evolutionary genetic studies of forest trees: Genetic structure of the boreal forest tree Pinus banksiana, and, The molecular phylogeny of the tropical tree family Dipterocarpaceae. [D] . Chiovitti, Sandra Lucy. 2006

机译：林木的进化遗传学研究：北方林木Pinus bankiana的遗传结构，以及热带乔木龙脑香科的分子系统发育。
6. An Integrative Database of β-Lactamase Enzymes: Sequences Structures Functions and Phylogenetic Trees [O] . Vivek Keshri, Seydina M. Diene, Adrien Estienne, 2019

机译：β-内酰胺酶的综合数据库：序列结构功能和系统发育树。
7. Tree2- Decision Trees for Tree Structured Data [O] . 2008

机译：Tree2-用于树状结构数据的决策树

Tree Transformer: Integrating Tree Structures into Self-Attention

摘要

著录项

相似文献

相关主题

期刊订阅