【24h】

Streaming Analysis of Discourse Participants

机译:话语参与者的流分析

获取原文

摘要

Inferring attributes of discourse participants has been treated as a batch-processing task: data such as all tweets from a given author are gathered in bulk, processed, analyzed for a particular feature, then reported as a result of academic interest. Given the sources and scale of material used in these efforts, along with potential use cases of such analytic tools, discourse analysis should be reconsidered as a streaming challenge. We show that under certain common formulations, the batch-processing analytic framework can be decomposed into a sequential series of updates, using as an example the task of gender classification. Once in a streaming framework, and motivated by large data sets generated by social media services, we present novel results in approximate counting, showing its applicability to space efficient streaming classification.
机译:推断话语参与者的属性已被视为批处理任务:批量收集诸如来自给定作者的所有推文之类的数据,进行处理,针对特定功能进行分析,然后根据学术兴趣进行报告。考虑到这些工作中使用的材料的来源和规模,以及此类分析工具的潜在用例,话语分析应被视为一项艰巨的挑战。我们表明,在某些常用公式的约束下,批处理分析框架可以分解为一系列连续更新,以性别分类任务为例。一旦处于流传输框架中,并受社交媒体服务生成的大数据集的激励,我们在近似计数中呈现新颖的结果,表明其适用于空间高效的流分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号