机译:TurboDL:用细粒度多流调度改善GPU上的CNN训练
Huazhong Univ Sci & Technol Natl Engn Res Ctr Big Data Technol & Syst Serv Comp Technol & Syst Lab Cluster & Grid Comp Lab Sch Comp Sci & Technol Wuhan 430074 Hubei Peoples R China;
Huazhong Univ Sci & Technol Natl Engn Res Ctr Big Data Technol & Syst Serv Comp Technol & Syst Lab Cluster & Grid Comp Lab Sch Comp Sci & Technol Wuhan 430074 Hubei Peoples R China;
Huazhong Univ Sci & Technol Natl Engn Res Ctr Big Data Technol & Syst Serv Comp Technol & Syst Lab Cluster & Grid Comp Lab Sch Comp Sci & Technol Wuhan 430074 Hubei Peoples R China;
Univ Warwick Dept Comp Sci Coventry CV4 7AL W Midlands England;
Univ Sydney Sch Comp Sci Sydney NSW 2006 Australia;
Training; Graphics processing units; Synchronization; Convolution; Kernel; Resource management; Parallel processing; Deep learning; parallelism optimization; scheduling; GPU;
机译:HGP4CNN:用于培训现代GPU的卷积神经网络的有效平行化框架
机译:FRF:朝着经线调度器友好的STT-RAM / SRAM精细颗粒混合GPGPU注册文件设计
机译:具有3D锚点的端到端CNN和LSTM网络,用于4D微观图像中的有丝分子细胞检测及其在多个GPU上的并行实现
机译:虚拟GPU资源的高效共享和精细调度
机译:现代GPU的提示辅助调度
机译:使用深度多任务CNN的细粒度人脸注释
机译:TurboDL:用细粒度多流调度改善GPU上的CNN训练