首页> 外文会议>IEEE International Conference on Computer and Communication Systems >Book Genre Classification Based on Titles with Comparative Machine Learning Algorithms
【24h】

Book Genre Classification Based on Titles with Comparative Machine Learning Algorithms

机译:基于书名的图书类型分类与比较机器学习算法

获取原文

摘要

This paper presents algorithmic comparisons for producing a book's genre based on its title. While some titles are easy to interpret, some are irrelevant to the genre that they belong to. Henceforth, we seek to determine the optimal and most accurate method for accomplishing the task. Several data preprocessing steps were implemented, in which word embeddings were created to make the titles operable by the computer. Five different machine learning models were tested throughout the experiment. Each different algorithm was fine-tuned for attaining the best parameter values, while no modifications were conducted on the dataset. The results indicate that the Long Short-Term Memory (LSTM) with a dropout is the top performing architecture among the algorithms, with an accuracy of 65.58%. To the authors' knowledge, no prior study has been done about book genre classification by title, therefore the present study is the current best in the field.
机译:本文介绍了根据书名生成书体裁的算法比较。虽然有些标题易于理解,但有些与它们所属的类型无关。今后,我们寻求确定完成任务的最佳和最准确的方法。实施了几个数据预处理步骤,其中创建了词嵌入,以使标题可由计算机操作。在整个实验中测试了五种不同的机器学习模型。每个不同的算法都经过微调以达到最佳参数值,而没有对数据集进行任何修改。结果表明,具有遗漏的长短期内存(LSTM)是该算法中性能最高的体系结构,准确性为65.58%。据作者所知,尚未进行按标题对书籍体裁分类的研究,因此,本研究是当前该领域中最好的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号