首页> 外国专利> METHODS AND APPARATUS FOR STUDYING LARGE SETS OF DATA

METHODS AND APPARATUS FOR STUDYING LARGE SETS OF DATA

机译:研究大数据集的方法和装置

摘要

Interactive Methods and apparatus for studying similarities of values in very large data sets. The methods and apparatus employ a dotplot in an interactive graphical user interface to make the relationship between the similarities and the data set visible. A variety of filtering, weighting, and compression techniques make it possible to employ the dot plot with sequences of more than 10,000 tokens and to interactively magnify the dot plot, change weighting and display quantization, and view the underlying data. Also disclosed is a technique which is employed in the apparatus for identifying long sequences of similar tokens. The apparatus is used in the study of large bodies of text and code.
机译:用于研究非常大的数据集中的值的相似性的交互式方法和设备。该方法和设备在交互式图形用户界面中采用点图以使相似性和数据集之间的关系可见。多种过滤,加权和压缩技术使点图具有超过10,000个令牌的序列成为可能,并且可以交互式地放大点图,更改权重和显示量化并查看基础数据。还公开了一种在设备中采用的用于识别相似令牌的长序列的技术。该设备用于研究大量文本和代码。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号