【24h】

Question identification on Turkish tweets

机译:土耳其推文上的问题识别

获取原文

摘要

Question identification is a field Natural Language Processing and also Information Extraction. The aim of work is detecting Turkish tweets which are including question expressions. The application contains three stages: applying some pre-processing steps to data set for cleaning unnecessary data like Retweet, determining candidate tweets via a rule-based method and extracting tweets which are really include questions using Conditional Random Fields. For this purpose one million tweets were collected and labeled. Tweets are ungrammatical data type. According to results; the model developed has been largely successful on tweets. Additionally, it is a first study about identifying questions on Turkish tweets.
机译:问题识别是现场自然语言处理以及信息提取。工作的目标是检测土耳其推文,包括问题表达。该应用程序包含三个阶段:将某些预处理步骤应用于用于清除转发等不必要数据的数据集,通过基于规则的方法确定候选推文,并将其提取的推文使用条件随机字段确定真正包含问题。为此,收集并标记了一百万条推文。推文是非语法数据类型。根据结果​​;开发的模型在很大程度上在推文上成功。此外,它是关于识别土耳其推文的问题的第一次研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号