【24h】

Summarization of meetings using word clouds

机译:使用词云总结会议

获取原文

摘要

In this study parsimonious language models were used to construct word clouds of the proceedings of the European Parliament. Multiple design choices had to be made and are discussed. Important features are stemming during tokenization, including bigrams into the word cloud and multilingualism. Also, the original parsimonious language models were extended with an additional term dampening unigrams that already occurred in the word cloud. This algorithm was tested in a small user study, using proceedings of the University of Amsterdam Science faculty's student council. Members of this council had to give their preference for multiple word clouds constructed using either parsimonious language models or simple Term Frequencies (TF) with stop words. 68% over 29% (p < 0.05, two-tailed paired t-test) preferred the word clouds constructed using parsimonious language models. Beside the system design, further technical findings, the social significance of applying word clouds to political data and possibilities for future work are discussed.
机译:在这项研究中,使用了简约的语言模型来构建欧洲议会议事过程中的词云。必须做出多种设计选择并进行讨论。在标记化过程中,重要的功能不断涌现,包括词云和多种语言的词义。同样,原始的简约语言模型得到扩展,增加了单词云中已经出现的附加术语阻尼单字。使用阿姆斯特丹大学理学院学生会的程序在一个小的用户研究中对该算法进行了测试。该委员会的成员必须优先考虑使用简约语言模型或带有停用词的简单术语频率(TF)构建的多个词云。 68%高于29%(p <0.05,两尾配对t检验)更喜欢使用简约语言模型构建的词云。除了系统设计之外,还讨论了其他技术发现,将词云应用于政治数据的社会意义以及未来工作的可能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号