首页> 外文会议>IEEE International Conference on Semantic Computing >Extracting Semantics from Census-based Reference Data

【24h】

Extracting Semantics from Census-based Reference Data

机译：从基于人口普查的参考数据中提取语义

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present preliminary findings in extracting semantics from reference data generated by the United States Census Bureau. US Census reference data is based upon surveys designed to collect demographics and other socioeconomic factors by geographical regions. These data sets contain thousands of variables; this complexity makes the reference data difficult to learn, query, and integrate into analyses. Researchers often avoid working directly with US Census reference data and instead work with census-derived extracts capturing a much smaller subset of records. We propose to use natural language processing to extract the semantics of census-based reference data and to map census variables to known ontologies. This semantic processing reduces the large volume of variables into more manageable sets of conceptual variables that can be organized by meaning and semantic type.

机译：我们提取从美国普查局产生的参考数据中提取语义的初步发现。美国人口普查参考数据基于调查，旨在通过地理区域收集人口统计数据和其他社会经济因素。这些数据集包含数千个变量;这种复杂性使参考数据难以学习，查询和集成到分析中。研究人员经常避免与我们人口普查参考数据直接工作，而是与人口普查派生提取物一起工作，捕获更小的记录子集。我们建议使用自然语言处理来提取基于人口普查的参考数据的语义，并将人口普查变量映射到已知本体。该语义处理将大量变量减少到更可管理的概念变量集中，这些变量可以通过含义和语义类型组织。

著录项

来源
《IEEE International Conference on Semantic Computing 》|2021年|88-89|共2页
会议地点
作者
Daniel R. Harris; Nima Seyedtalebi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Conferences; Semantics; Ontologies; Natural language processing; Complexity theory; Data mining;

机译：会议;语义;本体;自然语言处理;复杂性理论;数据挖掘;

相似文献

外文文献
中文文献
专利

1. 汉语双关语理解中可供性提取对双关语双重语义提取的促进作用 [J] . 廖巧云, 胡权, 高梦婷, 中国应用语言学：英文版 . 2021 ,第001期
2. An assessment of the utility of LiDAR data in extracting base-year floorspace and a comparison with the census-based approach [J] . Sajad Shiravi, Ming Zhong, Seyed Ahad Beykaei, Environment and Planning . 2015 ,第4期

机译：评估LiDAR数据在提取基准年占地面积中的实用性，并与基于普查的方法进行比较
3. SETL: A programmable semantic extract-transform-load framework for semantic data warehouses [J] . Deb Nath Rudra Pratap, Hose Katja, Pedersen Torben Bach, Information Systems . 2017 ,第auga期

机译：SETL：语义数据仓库的可编程语义提取-转换-加载框架
4. Web of Data and Web of Entities: Identity and Reference in Interlinked Data in the Semantic Web [J] . Paolo Bouquet, Heiko Stoermer, Massimiliano Vignolo Knowledge Technology & Policy . 2012 ,第1期

机译：数据网和实体网：语义网中互连数据中的标识和引用
5. Extracting Metadata of Scientific References in Patents Based on Combination of Representation Learning and Machine Learning [C] . Jinzhu Zhang, Yiming Hu Annual Meeting of the Association for Information Science and Technology . 2019

机译：基于代表学习和机器学习的组合提取专利科学参考的元数据
6. Extracting quantitative information from nonnumeric marketing data: An augmented latent semantic analysis approach. [D] . Arroniz, Inigo. 2007

机译：从非数字营销数据中提取定量信息：一种增强的潜在语义分析方法。
7. Overcoming the absence of socioeconomic data in medical records: validation and application of a census-based methodology. [O] . N Krieger 1992

机译：克服病历中缺乏社会经济数据的方法：基于普查方法的验证和应用。
8. SETL: A programmable semantic extract-transform-load framework for semantic data warehouses [O] . Rudra Pratap Deb Nath, Katja Hose, Torben Bach Pedersen, 2017

机译：SETL：语义数据仓库的可编程语义提取 - 转换框架
9. Domain Independent Framework for Extracting Linked Semantic Data from Tables. [R] . Mulwad, V., Finin, T., Joshi, A. 2012

机译：用于从表中提取链接语义数据的域独立框架。

Extracting Semantics from Census-based Reference Data

摘要

著录项

相似文献

相关主题

期刊订阅