首页> 外文会议>Workshop on NLP for Similar Languages, Varieties and Dialects >Combining Deep Learning and String Kernels for the Localization of Swiss German Tweets
【24h】

Combining Deep Learning and String Kernels for the Localization of Swiss German Tweets

机译:结合深度学习和串核,以便瑞士德国推文本地化

获取原文

摘要

In this work, we introduce the methods proposed by the UnibucKernel team in solving the Social Media Variety Geolocation task featured in the 2020 VarDial Evaluation Campaign. We address only the second subtask, which targets a data set composed of nearly 30 thousand Swiss German Jodels. The dialect identification task is about accurately predicting the latitude and longitude of test samples. We frame the task as a double regression problem, employing a variety of machine learning approaches to predict both latitude and longitude. From simple models for regression, such as Support Vector Regression, to deep neural networks, such as Long Short-Term Memory networks and character-level convolutional neural networks, and, finally, to ensemble models based on meta-learners, such as XGBoost, our interest is focused on approaching the problem from a few different perspectives, in an attempt to minimize the prediction error. With the same goal in mind, we also considered many types of features, from high-level features, such as BERT embeddings, to low-level features, such as characters n-grams, which are known to provide good results in dialect identification. Our empirical results indicate that the handcrafted model based on string kernels outperforms the deep learning approaches. Nevertheless, our best performance is given by the ensemble model that combines both handcrafted and deep learning models.
机译:在这项工作中,我们介绍了Unibucknel团队在解决2020年的Vardial评估活动中的社交媒体品种地理定位任务方面提出的方法。我们只解决了第二个子任务,它针对一个由近3万瑞士德国杰德尔组成的数据集。方言识别任务是准确地预测测试样本的纬度和经度。我们将任务框架作为双重回归问题,采用各种机器学习方法来预测纬度和经度。从简单模型进行回归,如支持向量回归,深度神经网络,如长的短期内存网络和字符级卷积神经网络,而且最后,基于元学习者的集合模型,如XGBoost,我们的兴趣是专注于从几个不同的角度来接近问题,以便最大限度地减少预测误差。考虑到同样的目标,我们还考虑了许多类型的特征,从高级功能(如BERT Embeddings)到低级功能,如字符N-GRAM,已知在方言识别中提供良好的结果。我们的经验结果表明,基于串核的手工制作模型优于深度学习方法。尽管如此,我们的最佳性能是由集合模型给出的,这些模型结合了手工制作和深度学习模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号