首页> 外文会议>European conference on IR research >StyleExplorer: A Toolkit for Textual Writing Style Visualization
【24h】

StyleExplorer: A Toolkit for Textual Writing Style Visualization

机译:StyleExplorer:用于文本写作风格可视化的工具包

获取原文

摘要

The analysis of textual writing styles is a well-studied problem with ongoing and active research in fields like authorship attribution, author profiling, text segmentation or plagiarism detection. While many features have been proposed and shown to be effective to characterize authors or document types in terms of high-dimensional feature vectors, an intuitive, human-friendly view on the computed data is often lacking. For example, machine learning algorithms are able to attribute previously unseen documents to a set of known authors by utilizing those features, but a visualization of the most discriminating features is usually not provided. To this end, we present StyleExplorer, a freely available web tool that is able to extract textual features from documents and to visualize them in multiple variants. Besides analyzing single documents intrinsically, it is also possible to visually compare multiple documents in single views with respect to selected metrics, making it a valuable analysis tool for various tasks in natural language processing as well as for areas in the humanities that work and analyze textual data.
机译:在作者身份归因,作者概况分析,文本分割或窃检测等领域,正在进行且积极的研究是对文本写作风格进行分析的一个经过充分研究的问题。尽管已经提出了许多功能,并且显示出许多功能可以有效地根据高维特征向量来表征作者或文档类型,但通常仍缺乏对计算数据的直观,人性化的视图。例如,机器学习算法能够通过利用那些特征将先前未见过的文档归因于一组已知的作者,但是通常不提供最具区分性的特征的可视化。为此,我们介绍了StyleExplorer,这是一个免费的网络工具,能够从文档中提取文本特征并将其可视化为多种变体。除了本质上分析单个文档之外,还可以相对于所选度量直观地比较单个视图中的多个文档,这使其成为用于自然语言处理中各种任务以及人文领域工作和分析文本的有价值的分析工具。数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号