首页> 外国专利> Text segmentation and label assignment with user interaction by means of topic specific language models and topic-specific label statistics

Text segmentation and label assignment with user interaction by means of topic specific language models and topic-specific label statistics

机译:通过主题特定的语言模型和主题特定的标签统计信息,通过用户交互进行文本分段和标签分配

摘要

The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labeling of successive parts of the document or the entire document.
机译:本发明涉及用于通过利用在带注释的训练数据上训练的统计模型来构造非结构化文本的方法,计算机程序产品,分割系统和用户界面。该方法将文本分割为文本部分,并将标签分配给文本部分作为部分标题。所执行的分割和分配被提供给用户以进行一般检查。另外,向用户提供替代的分段和标签分配,从而能够选择替代的分段和替代标签以及输入用户定义的分段和用户定义的标签。响应于用户引入的修改,发起多个不同的动作,包括对文档的连续部分或整个文档的重新分段和重新标记。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号