Extraction Method of Judicial Language Entities Based On Regular Expression

机译：基于正则表达的司法语言实体提取方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the coming of the era of rule of law and intelligence, natural language processing technology plays a pivotal role. At present, a large number of unstructured judicial texts rely on manual processing and archiving. In order to make better use of them and achieve professional application, this paper proposes the goal of analyzing the structure of judgments, extracting the judicial language entities, and describing cases in the form of entity circulation map. As the text carrier of unstructured public events, the judicial document is of better standard format, finely crafted and easy processing, and becomes the research object of this paper. Through the survey of the development of named entity recognition technology, testing and contrasting the use of extraction tool, GATE, as well as considering the cost and effectiveness in the judicial field, this paper put forward a rule-based regular expression method for entity recognition. The scrapy crawler framework is used to obtain judgments classified from China Judgments Online website, so as to realize the task of analyzing the structure of judgments and extracting the judicial language entities.

机译：随着法治时代的来源，自然语言加工技术发挥了关键作用。目前，大量非结构化的司法文本依赖于手动处理和归档。为了更好地利用它们并实现专业应用，本文提出了分析判断结构，提取司法语言实体的结构，并描述实体循环地图形式的案例。作为非结构化公共事件的文本载体，司法文件具有更好的标准格式，精细制作和简单的处理，成为本文的研究对象。通过对命名实体识别技术的发展，测试和对比使用提取工具，门，以及考虑司法领域的成本和有效性，本文提出了一种基于规则的实体识别的正则表达方法。 SCRAPE履带框架用于获得从中国判断在线网站分类的判决，以实现分析判断结构并提取司法语言实体的任务。

著录项

来源
《International Conference on Intelligent Computing and Signal Processing》|2021年|372-376|共5页
会议地点
作者
JIAO Kainan; LI Xin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Crawlers; Manuals; Tools; Signal processing; Logic gates; Natural language processing; Task analysis;

机译：爬行者;手册;工具;信号处理;逻辑门;自然语言处理;任务分析;
入库时间 2022-08-26 13:57:51

相似文献

外文文献
中文文献
专利

1. Rule Based Chunk Extraction from PDF Documents Using Regular Expressions and Natural Language Processing [J] . Amol Rajaram Karad, Rahul Raghvendra Joshi International journal of computational intelligence research . 2021,第1期

机译：使用正则表达式和自然语言处理从PDF文档的规则的块提取
2. Rule Based Chunk Extraction from PDF Documents Using Regular Expressions and Natural Language Processing [J] . Amol Rajaram Karad, Rahul Raghvendra Joshi International Journal of Applied Engineering Research . 2015,第3期

机译：使用正则表达式和自然语言处理从PDF文档中基于规则的块提取
3. EventScript: An event-processing language based on regular expressions with actions [J] . Cohen NH, Kalleberg KT ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2008,第7期

机译：EventScript：一种基于事件的正则表达式的事件处理语言
4. Enabling Information Extraction by Inference of Regular Expressions from Sample Entities [C] . Falk Brauer, Robert Rieger, Adrian Mocan, ACM international conference on information and knowledge management . 2011

机译：通过从样本实体推断正则表达式来启用信息提取
5. Internet data extraction based on automatic regular expression inference. [D] . Lin, Ye. 2007

机译：基于自动正则表达式推断的Internet数据提取。
6. A rule-based named-entity recognition method for knowledge extraction of evidence-based dietary recommendations [O] . Tome Eftimov, Barbara Koroušić Seljak, Peter Korošec -1

机译：基于规则的命名实体识别方法用于基于证据的饮食推荐知识的提取
7. A Japanese-Chinese Cross-Language Entity Linking Method with Entity Disambiguation Based on Document Similarity [O] . Xiang Song, Jialiang Zhou, Fuminori Kimura, 2016

机译：一种基于文档相似性的实体歧义的日语 - 中文跨语言实体链接方法

Extraction Method of Judicial Language Entities Based On Regular Expression

摘要

著录项

相似文献

相关主题

期刊订阅