Tag Recommendation for Open Government Data by Multi-label Classification and Particular Noun Phrase Extraction

机译：通过多标签分类和特定名词短语提取的开放政府数据标记建议

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Open government data (OGD) is statistical data made and published by governments. Administrators often give tags to the metadata of OGD. Tags, which are a collection of a single word or multiple words, express the data. Tags are useful to understand the data without actually reading the data and also to search for OGD. However, administrators have to understand the data in detail in order to assign tags. We take two different approaches for giving appropriate tags to OGD. First, we use a multi-label classification technique to give tags to OGD from tags in the training data. Second, we extract particular noun phrases from the metadata of OGD by calculating the difference between the frequency of a noun phrase and the frequencies of single words within the noun phrase. Experiments using 196,587 datasets on Data.gov show that the accuracy of prediction by the multi-label classification method is enough to develop a tag recommendation system. Also, the experiments show that our extraction method of particular noun phrases extracts some infrequent tags of the datasets.

机译：开放式政府数据（OGD）是各国政府发布的统计数据。管理员经常向OGD的元数据提供标记。标签，它们是单个单词或多个单词的集合，表达数据。在没有实际读取数据的情况下，标签是有用的，无法读取数据，也很有用，并且还可以搜索OGD。但是，管理员必须详细了解数据以分配标记。我们采取两种不同的方法来为OGD提供适当的标签。首先，我们使用多标签分类技术从训练数据中的标签向OGD标记。其次，通过计算名词短语频率与名词短语内单词的频率之间的差异来提取来自OGD的元数据的特定名词短语。使用196,587个数据集在Data.gov上的实验表明，多标签分类方法预测的准确性足以开发标签推荐系统。此外，实验表明，我们的特定名词短语的提取方法提取了数据集的一些不常见的标记。

著录项

来源
《International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management》|2018年|1(CD-ROM)|共9页
会议地点
作者
Yasuhiro Yamada; Tetsuya Nakatoh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G354-53;
关键词
Open government data; E-Government; Tag recommendation; Multi-label classification; Metadata;

机译：开放政府数据;电子政务;标签推荐;多标签分类;元数据;

相似文献

外文文献
中文文献
专利

1. The sentence-composition effect: Processing of complex sentences depends on the configuration of common noun phrases versus unusual noun phrases [J] . Johnson M.L., Lowder M.W., Gordon P.C. Journal of Experimental Psychology. General . 2011,第4期

机译：句子组成效应：复杂句子的处理取决于普通名词短语与异常名词短语的配置
2. Use of noun phrases in automatic classification of electronic documents [J] . Alarmsoft Tecnologia em Seguran?a, Escola de Ciência da Informa??o, Maia Luiz Cláudio, Perspectivas em Ciencia da Informacao . 2010,第1期

机译：使用名词短语在电子文件的自动分类中
3. Open Relation Extraction for Chinese Noun Phrases [J] . Wang Chengyu, He Xiaofeng, Zhou Aoying IEEE Transactions on Knowledge and Data Engineering . 2021,第6期

机译：中文名词短语的开放关系提取
4. Tag Recommendation for Open Government Data by Multi-label Classification and Particular Noun Phrase Extraction [C] . Yasuhiro Yamada, Tetsuya Nakatoh International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management . 2018

机译：通过多标签分类和特定名词短语提取的开放政府数据标记建议
5. INTERNAL NOUN PHRASE RELATIONS OF COMPLEX NOUN PHRASES AND THE CONCEPT OF "AGENCY." [D] . HIGGINS, SUSAN GAYLE. 1973

机译：复杂名词短语的内部名词短语关系与“代理”的概念。
6. Behavior Based Social Dimensions Extraction for Multi-Label Classification [O] . Le Li, Junyi Xu, Weidong Xiao, -1

机译：用于多标签分类的基于行为的社会维度提取
7. Extraction of compound nouns in Malay noun phrases using a noun phrase frame structure [O] . Suhaimi Ab Rahman, Nazlia Omar, Mohd Juzaiddin Ab Aziz 2014

机译：使用名词短语框架结构提取马来语名词短语中的复合名词

Tag Recommendation for Open Government Data by Multi-label Classification and Particular Noun Phrase Extraction

摘要

著录项

相似文献

相关主题

期刊订阅