首页> 外文期刊>Journal of medical Internet research >Social Reminiscence in Older Adults’ Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning
【24h】

Social Reminiscence in Older Adults’ Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning

机译:老年人日常谈话中的社会悔改:使用自然语言处理和机器学习自动检测

获取原文
           

摘要

Background Reminiscence is the act of thinking or talking about personal experiences that occurred in the past. It is a central task of old age that is essential for healthy aging, and it serves multiple functions, such as decision-making and introspection, transmitting life lessons, and bonding with others. The study of social reminiscence behavior in everyday life can be used to generate data and detect reminiscence from general conversations. Objective The aims of this original paper are to (1) preprocess coded transcripts of conversations in German of older adults with natural language processing (NLP), and (2) implement and evaluate learning strategies using different NLP features and machine learning algorithms to detect reminiscence in a corpus of transcripts. Methods The methods in this study comprise (1) collecting and coding of transcripts of older adults’ conversations in German, (2) preprocessing transcripts to generate NLP features (bag-of-words models, part-of-speech tags, pretrained German word embeddings), and (3) training machine learning models to detect reminiscence using random forests, support vector machines, and adaptive and extreme gradient boosting algorithms. The data set comprises 2214 transcripts, including 109 transcripts with reminiscence. Due to class imbalance in the data, we introduced three learning strategies: (1) class-weighted learning, (2) a meta-classifier consisting of a voting ensemble, and (3) data augmentation with the Synthetic Minority Oversampling Technique (SMOTE) algorithm. For each learning strategy, we performed cross-validation on a random sample of the training data set of transcripts. We computed the area under the curve (AUC), the average precision (AP), precision, recall, as well as F1 score and specificity measures on the test data, for all combinations of NLP features, algorithms, and learning strategies. Results Class-weighted support vector machines on bag-of-words features outperformed all other classifiers (AUC=0.91, AP=0.56, precision=0.5, recall=0.45, F1=0.48, specificity=0.98), followed by support vector machines on SMOTE-augmented data and word embeddings features (AUC=0.89, AP=0.54, precision=0.35, recall=0.59, F1=0.44, specificity=0.94). For the meta-classifier strategy, adaptive and extreme gradient boosting algorithms trained on word embeddings and bag-of-words outperformed all other classifiers and NLP features; however, the performance of the meta-classifier learning strategy was lower compared to other strategies, with highly imbalanced precision-recall trade-offs. Conclusions This study provides evidence of the applicability of NLP and machine learning pipelines for the automated detection of reminiscence in older adults’ everyday conversations in German. The methods and findings of this study could be relevant for designing unobtrusive computer systems for the real-time detection of social reminiscence in the everyday life of older adults and classifying their functions. With further improvements, these systems could be deployed in health interventions aimed at improving older adults’ well-being by promoting self-reflection and suggesting coping strategies to be used in the case of dysfunctional reminiscence cases, which can undermine physical and mental health.
机译:背景reminiscence是思考或谈论过去发生的个人经历的行为。这是一个对健康老化至关重要的老年的核心任务,它提供多种功能,例如决策和内省,传递寿命,与他人粘合。在日常生活中的社会remincence行为的研究可用于生成数据并从一般对话中检测refinisce。目标本原文的目标是(1)预处理德国老年人的谈话的预处理编码成绩单,具有自然语言处理(NLP),(2)使用不同的NLP功能和机器学习算法来实现和评估学习策略来检测回忆在成绩单的语料库中。方法本研究中的方法包括(1)收集和编码德语中的老年成人对话的成绩单,(2)预处理成绩单以产生NLP功能(文字袋式模型,言语款式的标签,预先磨碎的德国单词嵌入式),和(3)培训机器学习模型以使用随机林,支持向量机和自适应和极端梯度升压算法来检测Reminiscence。数据集包括2214个转录物,其中包括109个转录物,其具有重次remincence。由于数据中的课程不平衡,我们推出了三种学习策略:(1)类加权学习,(2)由投票合奏组成的元分类器,(3)与合成少数群体过采样技术(SMOTE)的数据增强算法。对于每个学习策略,我们对培训数据集的随机样本进行了交叉验证。我们计算了曲线(AUC)的区域,平均精度(AP),精度,回忆,以及测试数据的所有组合,算法和学习策略的所有组合都是对测试数据的分数和特异性措施。结果类加权支持向量机上的袋式特征优于所有其他分类器(AUC = 0.91,AP = 0.56,精度= 0.5,召回= 0.45,F1 = 0.48,特异性= 0.98),然后支持向量机Smote-Augmented数据和Word Embeddings功能(AUC = 0.89,AP = 0.54,精度= 0.35,召回= 0.59,F1 = 0.44,特异性= 0.94)。对于Meta分类器策略,在Word Embeddings和单词袋上培训的自适应和极端渐变升压算法优于所有其他分类器和NLP功能;然而,与其他策略相比,Meta分类器学习策略的性能较低,具有高度不平衡的精密召回权衡权衡。结论本研究规定了NLP和机器学习管道适用性的证据,以便在德语中的老年人日常谈话中自动检测回忆。本研究的方法和调查结果可能与设计不引人注目的计算机系统,以便在老年人日常生活中实时检测社会remincence,并对他们的功能进行分类。通过进一步的改进,这些系统可以通过促进自我反思和建议在功能失调的闭合案件的情况下,通过促进自我反思和建议所使用的应对策略来部署在旨在改善老年人的幸福干预措施的健康干预措施。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号