Sirius: An Open End-to-End Voice and Vision Personal Assistant and Its Implications for Future Warehouse Scale Computers

Johann Hauswald; Michael A. Laurenzano; Yunqi Zhang; Cheng Li; Austin Rovinski; Arjun Khurana; Ronald G. Dreslinski; Trevor Mudge; Vinicius Petrucci; Lingjia Tang; Jason Mars; Clarity Lab

首页> 外文期刊>Computer architecture news >Sirius: An Open End-to-End Voice and Vision Personal Assistant and Its Implications for Future Warehouse Scale Computers

【24h】

Sirius: An Open End-to-End Voice and Vision Personal Assistant and Its Implications for Future Warehouse Scale Computers

机译：Sirius：开放的端到端语音和视觉个人助理及其对未来仓库规模计算机的启示

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

As user demand scales for intelligent personal assistants (IPAs) such as Apple's Siri, Google's Google Now, and Microsoft's Cortana, we are approaching the computational limits of current datacenter architectures. It is an open question how future server architectures should evolve to enable this emerging class of applications, and the lack of an open-source IPA workload is an obstacle in addressing this question. In this paper, we present the design of Sirius, an open end-to-end IPA web-service application that accepts queries in the form of voice and images, and responds with natural language. We then use this workload to investigate the implications of four points in the design space of future accelerator-based server architectures spanning traditional CPUs, GPUs, manycore throughput co-processors, and FP-GAs. To investigate future server designs for Sirius, we decompose Sirius into a suite of 7 benchmarks (Sirius Suite) comprising the computationally intensive bottlenecks of Sirius. We port Sirius Suite to a spectrum of accelerator platforms and use the performance and power trade-offs across these platforms to perform a total cost of ownership (TCO) analysis of various server design points. In our study, we find that accelerators are critical for the future scalability of IPA services. Our results show that GPU- and FPGA-accelerated servers improve the query latency on average by 10× and 16×. For a given throughput, GPU- and FPGA-accelerated servers can reduce the TCO of datacenters by 2.6 × and 1.4 ×, respectively.

机译：随着用户对智能个人助理（IPA）（例如Apple的Siri，Google的Google Now和Microsoft的Cortana）的需求扩展，我们正在接近当前数据中心体系结构的计算极限。这是一个悬而未决的问题，未来的服务器体系结构应如何发展以支持这种新兴的应用程序类别，而缺乏开源IPA工作负载是解决该问题的障碍。在本文中，我们介绍Sirius的设计，Sirius是一个开放的端到端IPA Web服务应用程序，它接受语音和图像形式的查询，并以自然语言进行响应。然后，我们使用此工作负载来调查未来基于加速器的服务器体系结构的设计空间中四点的含义，这些体系结构跨越传统的CPU，GPU，许多核心吞吐量协处理器和FP-GA。为了研究Sirius的未来服务器设计，我们将Sirius分解为7个基准测试套件（Sirius Suite），其中包括计算密集型Sirius瓶颈。我们将Sirius Suite移植到各种加速器平台上，并使用这些平台之间的性能和功率折衷来对各种服务器设计点进行总拥有成本（TCO）分析。在我们的研究中，我们发现加速器对于IPA服务的未来可扩展性至关重要。我们的结果表明，GPU和FPGA加速的服务器将查询延迟平均提高了10倍和16倍。对于给定的吞吐量，GPU和FPGA加速的服务器可以分别将数据中心的TCO降低2.6×和1.4×。

著录项

来源
《Computer architecture news》 |2015年第1期|223-238|共16页
作者
Johann Hauswald; Michael A. Laurenzano; Yunqi Zhang; Cheng Li; Austin Rovinski; Arjun Khurana; Ronald G. Dreslinski; Trevor Mudge; Vinicius Petrucci; Lingjia Tang; Jason Mars; Clarity Lab;
展开▼
作者单位

University of Michigan- Ann Arbor, MI, USA;

University of Michigan- Ann Arbor, MI, USA;

University of Michigan- Ann Arbor, MI, USA;

University of Michigan- Ann Arbor, MI, USA;

University of Michigan- Ann Arbor, MI, USA;

University of Michigan- Ann Arbor, MI, USA;

University of Michigan- Ann Arbor, MI, USA;

University of Michigan- Ann Arbor, MI, USA;

University of Michigan- Ann Arbor, MI, USA;

University of Michigan- Ann Arbor, MI, USA;

University of Michigan- Ann Arbor, MI, USA;

University of Michigan- Ann Arbor, MI, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
datacenters; warehouse scale computers; emerging workloads; intelligent personal assistants;

机译：数据中心仓库规模的计算机;新出现的工作量;聪明的个人助理;

相似文献

外文文献
中文文献
专利

1. Sirius: An Open End-to-End Voice and Vision Personal Assistant and Its Implications for Future Warehouse Scale Computers [J] . Hauswald Johann, Laurenzano Michael A., Zhang Yunqi, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2015,第4期

机译：Sirius：开放的端到端语音和视觉个人助理及其对未来仓库规模计算机的启示
2. Designing Future Warehouse-Scale Computers for Sirius, an End-to-End Voice and Vision Personal Assistant [J] . Hauswald Johann, Laurenzano Michael A., Zhang Yunqi, ACM transactions on computer systems . 2016,第1期

机译：为端到端语音和视觉个人助理Sirius设计未来的仓库规模计算机
3. Sirius Implications for Future Warehouse-Scale Computers [J] . Johann Hauswald, Michael A. Laurenzano, Yunqi Zhang, IEEE Micro . 2016,第3期

机译：Sirius对未来仓库规模计算机的启示
4. DjiNN and Tonic: DNN as a service and its implications for future warehouse scale computers [C] . Hauswald Johann, Kang Yiping, Laurenzano Michael A., 42th Annual International Symposium on Computer Architecture . 2015

机译：DjiNN和Tonic：DNN即服务及其对未来仓库规模计算机的影响
5. Voices with Vision: Writing Black, Feminist Futures in Twentieth Century African America [D] . Alexander, Phoenix 2019

机译：具有愿景的声音：在二十世纪非洲美国写黑，女权主义期货
6. Computer Applications in Medical Care. Computers in Nursing. Computer Uses in Nursing Information Systems: Future Perspectives: Visions of the Future for Nursing Information Systems: A Panel Discussion [O] . Judy G. Ozbolt 1983

机译：医疗保健中的计算机应用。护理中的计算机。护理信息系统中的计算机使用：未来观点：护理信息系统的未来愿景：小组讨论
7. Personal Information Disclosure via Voice Assistants: The Personalization–Privacy Paradox [O] . Debajyoti Pal, Chonlameth Arpnikanondt, Mohammad Abdur Razzaque 2020

机译：通过语音助理个人信息披露：个性化 - 隐私悖论

Sirius: An Open End-to-End Voice and Vision Personal Assistant and Its Implications for Future Warehouse Scale Computers

摘要

著录项

相似文献

相关主题

期刊订阅