首页> 外文会议>International Conference on Legal Knowledge and Information Systems >Computer-Assisted Creation of Boolean Search Rules for Text Classification in the Legal Domain
【24h】

Computer-Assisted Creation of Boolean Search Rules for Text Classification in the Legal Domain

机译:计算机辅助创建Boolean搜索规则的法律域中的文本分类

获取原文

摘要

In this paper, we present a method of building strong, explainable classifiers in the form of Boolean search rules. We developed an interactive environment called CASE (Computer Assisted Semantic Exploration) which exploits word co-occurrence to guide human annotators in selection of relevant search terms. The system seamlessly facilitates iterative evaluation and improvement of the classification rules. The process enables the human annotators to leverage the benefits of statistical information while incorporating their expert intuition into the creation of such rules. We evaluate classifiers created with our CASE system on 4 datasets, and compare the results to machine learning methods, including SKOPE rules, Random forest, Support Vector Machine, and fastText classifiers. The results drive the discussion on trade-offs between superior compactness, simplicity, and intuitiveness of the Boolean search rules versus the better performance of state-of-the-art machine learning models for text classification.
机译:在本文中,我们提出了一种以布尔搜索规则的形式构建强大,可解释的分类器的方法。我们开发了一个被称为案例(计算机辅助语义探索)的交互式环境,该互动环境利用Word Co-operation来指导人类注释在选择相关的搜索项中。该系统无缝促进迭代评估和改进分类规则。该过程使人类注入者能够利用统计信息的益处,同时将其专家直接纳入这些规则的创建。我们评估在4个数据集上使用我们的案例系统创建的分类器,并将结果与​​机器学习方法进行比较,包括Skope规则,随机林,支持向量机和FastText分类器。结果推动了对布尔搜索规则的卓越紧凑性,简单性和直观性之间的权衡讨论,而最先进的机器学习模型进行文本分类的更好性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号