首页> 外文会议>International Conference on Information and Communication Technology >Academic Expert Finding in Indonesia using Word Embedding and Document Embedding: A Case Study of Fasilkom UI
【24h】

Academic Expert Finding in Indonesia using Word Embedding and Document Embedding: A Case Study of Fasilkom UI

机译:使用词嵌入和文档嵌入在印尼寻找学术专家:以Fasilkom UI为例

获取原文

摘要

Expertise retrieval covers the problems of expert and expertise finding. In academia, expert finding can be beneficial in finding a research partner or a potential thesis supervisor. This research finds the experts in the Faculty of Computer Science in Universitas Indonesia (Fasilkom UI) using the thesis abstract and metadata of Fasilkom UI students. The methods that are used to represent the query and expertise of the lecturers are the combination of word2vec and doc2vec, which are word embedding and document embedding, respectively. Both embeddings are able to model semantic information, which is necessary for solving the problem of vocabulary mismatch in search problems. Our result shows that representing the expertise query with word2vec leads to better performance than using doc2vec. In addition, we also found that generally, the performance of the embedding models is comparable to the standard retrieval model BM25 in retrieving experts using expertise queries in both Indonesian and English languages.
机译:专长检索涵盖专家和专长查找的问题。在学术界,专家发现对于寻找研究伙伴或潜在论文指导者可能会有所帮助。本研究使用Fasilkom UI学生的论文摘要和元数据寻找印度尼西亚大学(Fasilkom UI)计算机科学学院的专家。用于表示讲师的查询和专业知识的方法是word2vec和doc2vec的组合,分别是词嵌入和文档嵌入。两种嵌入都能够对语义信息进行建模,这对于解决搜索问题中的词汇不匹配问题是必不可少的。我们的结果表明,与使用doc2vec相比,使用word2vec表示专业知识查询可带来更好的性能。此外,我们还发现,在使用印度尼西亚语和英语两种语言的专业知识来检索专家的过程中,嵌入模型的性能通常与标准检索模型BM25相当。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号