首页> 外文会议>IEEE International Conference on Semantic Computing >Extracting Semantics from Census-based Reference Data
【24h】

Extracting Semantics from Census-based Reference Data

机译:从基于人口普查的参考数据中提取语义

获取原文

摘要

We present preliminary findings in extracting semantics from reference data generated by the United States Census Bureau. US Census reference data is based upon surveys designed to collect demographics and other socioeconomic factors by geographical regions. These data sets contain thousands of variables; this complexity makes the reference data difficult to learn, query, and integrate into analyses. Researchers often avoid working directly with US Census reference data and instead work with census-derived extracts capturing a much smaller subset of records. We propose to use natural language processing to extract the semantics of census-based reference data and to map census variables to known ontologies. This semantic processing reduces the large volume of variables into more manageable sets of conceptual variables that can be organized by meaning and semantic type.
机译:我们提取从美国普查局产生的参考数据中提取语义的初步发现。美国人口普查参考数据基于调查,旨在通过地理区域收集人口统计数据和其他社会经济因素。这些数据集包含数千个变量;这种复杂性使参考数据难以学习,查询和集成到分析中。研究人员经常避免与我们人口普查参考数据直接工作,而是与人口普查派生提取物一起工作,捕获更小的记录子集。我们建议使用自然语言处理来提取基于人口普查的参考数据的语义,并将人口普查变量映射到已知本体。该语义处理将大量变量减少到更可管理的概念变量集中,这些变量可以通过含义和语义类型组织。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号