首页> 外文期刊>WSEAS Transactions on Communications >Generating a Set of Rules to Determine The Gender of a Speaker of a Japanese Sentence
【24h】

Generating a Set of Rules to Determine The Gender of a Speaker of a Japanese Sentence

机译:生成一组规则以确定日语句子说话者的性别

获取原文
获取原文并翻译 | 示例
           

摘要

Some work has been reported on the problem of automatically determining the gender of a document's author as a part of researches to extract features of a document's author. Japanese language has expressions called masculine/feminine expression, and they can often indicate the gender of a speaker of a conversational sentence. The computer system needs this mechanism in order to make or understand natural Japanese conversational sentences. The authors made a system that determines the suitable gender of a speaker of a single conversational sentence and named it gender-determining system (GDS). It generates a set of rules to determine the more suitable gender of a speaker of a sentence automatically, by decision tree learning. The authors employed six linguistic features for each of two morphemes at the end of a sentence and presence or absence of morphemes whose part of speech is a miscellaneous pronoun or a particle for ending as features of decision tree learning. The authors calculated the accuracy of GDS using the cross validation method and it was approximately 69.3% when human could answer the same problem with approximately 71.7%. The authors showed decision tree learning is more suitable than multiple regression analysis or Bayesian estimation in order to classify the gender of the speaker of Japanese sentences and generate a set of rules to determine them, and selected the suitable features as the inputs of GDS. The set of rules GDS generated indicates, for example, women speak more politely than men in Japan.
机译:已经报道了一些有关自动确定文档作者性别的问题的工作,这是提取文档作者特征的研究的一部分。日语具有称为男性/女性表达的表达,并且它们通常可以指示对话句子的​​说话者的性别。计算机系统需要这种机制才能制作或理解自然的日语会话句子。作者创建了一个系统,该系统可以确定单个对话语句的说话者的合适性别,并将其命名为性别确定系统(GDS)。通过决策树学习,它会生成一组规则,以自动确定更适合说话者的性别。作者对句子结尾处的两个语素中的每个语素采用六个语言特征,并且存在或不存在语素的语素作为决策树学习的特征,其词性是其他代词或质点。作者使用交叉验证方法计算了GDS的准确性,当人类可以以大约71.7%回答相同的问题时,其准确性约为69.3%。作者表明,决策树学习比多元回归分析或贝叶斯估计更适合于对日语句子发音者的性别进行分类并生成一组规则来确定它们,并选择合适的特征作为GDS的输入。 GDS生成的一组规则表明,例如,日本女性说话比男性礼貌。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号