【24h】

Feature Extraction Using Restricted Bootstrapping

机译:使用受限自举进行特征提取

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The bootstrapping method is known as an application of the Page-rank technique for documents and words. The technique calculates the score of the words by mutually propagating the score of the words and the documents. However, sometimes the result is far away from the initial query word. The problem is known as gtopic drifth. This paper proposes to restrict the words to be to the top t words in the process of bootstrapping. The method is simpler than the technique known so far. The method is applied for the real bankruptcy information documents to extract the bankruptcy causes strongly related to the query. It is confirmed that the method prevents the topic drift.
机译:自举方法被称为用于文档和单词的Page-rank技术的应用。该技术通过相互传播单词和文档的分数来计算单词的分数。但是,有时结果离初始查询词很远。这个问题被称为gtopic漂移h。本文提出在引导过程中将单词限制为前t个单词。该方法比迄今为止已知的技术简单。该方法适用于真实的破产信息文件,以提取与查询密切相关的破产原因。证实了该方法防止了主题漂移。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号