首页> 外国专利> COMPUTE ENGINE ARCHITECTURE TO SUPPORT DATA-PARALLEL LOOPS WITH REDUCTION OPERATIONS

COMPUTE ENGINE ARCHITECTURE TO SUPPORT DATA-PARALLEL LOOPS WITH REDUCTION OPERATIONS

机译:计算引擎架构以减少操作支持数据并行循环

摘要

Techniques involving a compute engine architecture to support data-parallel loops with reduction operations are described. In some embodiments, a hardware processor includes a memory unit and a plurality of processing elements (PEs). Each of the PEs is directly coupled via one or more neighbor-to-neighbor links with one or more neighboring PEs so that each PE can receive a value from a neighboring PE, provide a value to a neighboring PE, or both receive a value from one neighboring PE and also provide a value to another neighboring PE. The hardware processor also includes a control engine coupled with the plurality of PEs that is to cause the plurality of PEs to collectively perform a task to generate one or more output values by each performing one or more iterations of a same subtask of the task.
机译:描述了涉及计算引擎体系结构以支持具有约简操作的数据并行循环的技术。在一些实施例中,硬件处理器包括存储单元和多个处理元件(PE)。每个PE都通过一个或多个邻居到邻居链路直接与一个或多个相邻PE耦合,以便每个PE可以从一个相邻PE接收一个值,向一个相邻PE提供一个值,或者都从一个相邻PE接收一个值一个相邻的PE,并且还向另一个相邻的PE提供值。硬件处理器还包括与多个PE耦合的控制引擎,该控制引擎将通过使每个PE对任务的同一子任务进行一次或多次迭代来共同执行一个任务以生成一个或多个输出值。

著录项

  • 公开/公告号US2018189110A1

    专利类型

  • 公开/公告日2018-07-05

    原文格式PDF

  • 申请/专利权人 INTEL CORPORATION;

    申请/专利号US201615396510

  • 发明设计人 GANESH VENKATESH;DEBORAH MARR;

    申请日2016-12-31

  • 分类号G06F9/50;G06F9/48;

  • 国家 US

  • 入库时间 2022-08-21 12:57:04

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号