What Can We Learn from Four Years of Data Center Hardware Failures?

机译：从四年的数据中心硬件故障中我们可以学到什么？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hardware failures have a big impact on the dependability of large-scale data centers. We present studies on over 290,000 hardware failure reports collected over the past four years from dozens of data centers with hundreds of thousands of servers. We examine the dataset statistically to discover failure characteristics along the temporal, spatial, product line and component dimensions. We specifically focus on the correlations among different failures, including batch and repeating failures, as well as the human operators' response to the failures. We reconfirm or extend findings from previous studies. We also find many new failure and recovery patterns that are the undesirable by-product of the state-of-the-art data center hardware and software design.

机译：硬件故障对大型数据中心的可靠性有很大影响。我们对过去四年来从数十个具有数十万台服务器的数据中心收集的290,000多个硬件故障报告进行了研究。我们对统计数据进行统计检查，以发现沿时间，空间，产品线和组件尺寸的故障特征。我们特别关注不同故障之间的相关性，包括批量和重复性故障，以及操作人员对故障的响应。我们重新确认或扩展先前研究的结果。我们还发现许多新的故障和恢复模式，这些都是最新的数据中心硬件和软件设计所不希望的副产品。

著录项

来源
《Annual IEEE/IFIP International Conference on Dependable Systems and Networks》|2017年|25-36|共12页
会议地点
作者
Guosai Wang; Lifei Zhang; Wei Xu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Hardware; Servers; Frequency modulation; Maintenance engineering; Companies; Electric breakdown;

机译：硬件;服务器;频率调制;维护工程;公司;电气故障;

相似文献

外文文献
中文文献
专利

1. Paul B. Carroll, Chunka. Mui, Billion Dollar Lessons: What You Can Learn from Inexcusable business failures of The Last 25 Years, The Penguin Croup (2008) [J] . Afie M. Badawy Journal of engineering and technology management . 2009,第3期

机译：保罗·卡洛尔（Paul B.梅，十亿美元的经验教训：您可以从过去25年的不可原谅的商业失败中学到什么，《企鹅大游行》（Penguin Croup，2008年）
2. Widespread Use of High-dose Ceftriaxone Therapy for Uncomplicated Gonorrhea Without Reported Ceftriaxone Treatment Failure: Results From 5 Years of Multicenter Surveillance Data in China (vol 70, pg 99, 2020) [J] . Han Y., Yin Y., Dai X. Clinical infectious diseases . 2019,第12期

机译：广泛使用高剂量头孢菌治疗，无需报告的Ceftriaxone治疗失败：结果来自中国五年多中心监测数据（Vol 70，PG 99,2020）
3. Attributes of Performance-centered Systems: What Can We Learn from Five Years of EPSS/PCD Competition Award Winners? [J] . CRAIG MARION Technical communication . 2002,第4期

机译：以性能为中心的系统的属性：从五年的EPSS / PCD竞赛获奖者中学到什么？
4. What Can We Learn from Four Years of Data Center Hardware Failures? [C] . Guosai Wang, Lifei Zhang, Wei Xu Annual IEEE/IFIP International Conference on Dependable Systems and Networks . 2017

机译：我们可以从四年的数据中心硬件故障中学到什么？
5. Large Scale Studies of Memory, Storage, and Network Failures in a Modern Data Center [D] . Meza, Justin J. 2018

机译：现代数据中心的内存，存储和网络故障的大规模研究
6. Crash course in EPaCCS (Electronic Palliative Care Coordination Systems): 8 years of successes and failures in patient data sharing to learn from [O] . Mila Petrova, Julia Riley, Julian Abel, -1

机译：EPaCCS（电子姑息治疗协调系统）速成课程：8年的患者数据共享成功与失败经验可供学习
7. P981Lvot area measurement using gated ct data reclassifies aortic stenosis severity as graded by echocardiographyP982Paradoxical low-flow low-gradient aortic stenosis: an intermediate state between moderate and severe aortic stenosis?P983Can rheumatic significant mitral stenosis be a cause of paradoxical low gradient, low flow, in patients with severe aortic stenosis? an echocardiographic and outcome studyP984Clinical and hemodynamic comparison of isolated versus combined aortic and mitral stenosisP985Echocardiographic end-diastolic velocity in the proximal descending aorta should be interpreted with caution when the ascending aorta is dilated: insights from cardiovascular magnetic resonanceP987Prevalence of atrial mitral regurgitation in patients with severe mitral regurgitationP988Role of 2D/3D echocardiography in the risk stratification of endocardial lead-related tricuspid regurgitation: a single-centre study among?241 patientsP989When TEE is needed in patients with staphylococcus aureus bacteremia for the assessment of risk profile of infective endocarditis?P990Appropriateness criteria to echocardiograms for suspected infective endocarditis: experience of a tertiary referral centerP991Independent predictors of outcome in infective endocarditisP992The role of transesophageal cardiography in clinical course and prognosis of complicated infective endocarditis in critically ill patients: our 15 years experienceP993Left bundle branch block atypical pattern as a prognostic determinant in patients taken to TAVIP994Efficacy of long-term ivabradine therapy in severe systolic chronic heart failure patients with and without type 2 diabetes mellitusP995Relations between left ventricular reverse remodeling and serum markers of extracellular matrix fibrosis in dilated cardiomyopathyP996The healthy left ventricle accommodates an increasing vortex formation time for volume transfer in diastolic filling :Implications for heart failureP997Evolutionary changes of pulmonary artery pressure after left ventricular assist device implantP998Functional correlates and prognostic value of coronary flow velocity reserve by vasodilator stress echocardiography in hypertrophic cardiomyopathyP999Quantification of myocardial performance in patients with non-obstructive versus latent-obstructive hypertrophic cardiomyopathyP1000Lifelong arrhythmic risk stratification in arrhythmogenic right ventricular cardiomyopathy: distribution of events and impact of periodical reassessmentP1001Impact of fibrosis visualized by CMR in vectorcardiogram recordings of patients with suspected arrhythmogenic cardiomyopathyP1002Determinants of the beneficial effect of aldosterone antagonism on exercise capacity in heart failure with reduced ejection fractionP1003Myocardial strain values in patients with acute myocarditis and preserved ejection fraction. A magnetic resonance feature tracking studyP1004Detection of subclinical left ventricular dysfunction by speckle tracking echocardiography in patients with myocarditis without prominent wall motion abnormalitiesP1005Aborted sudden cardiac death patients aged <50 years show only mild alterations on cardiac magnetic resonance imagingP1006Relationships between subepicardial and subendocardial longitudinal strain with late gadolinium enhancement in uncomplicated hypertensive patients [O] . L. Moderato, C. Di Nora, A. Soufiani, 2016

机译：P981LVOT区域测量使用门控CT数据重新分类主动脉狭窄的严重程度，以超声心动图7982分类为分类，如二醇的低流量低梯度主动脉狭窄：中度和严重主动脉狭窄之间的中间状态？P983CAN风湿显着二尖瓣狭窄是矛盾的低梯度，低流量的原因在严重主动脉狭窄的患者中？超声心动图和结合分离的主动脉和二尖瓣术和二尖瓣狭窄的血液动力学比较的超声心动图和血液动力学比较在近期下降主动脉中应当谨慎地解释升高的主动脉：从心血管磁共振的洞察中的心血管磁共振PREValence在严重的患者中的洞察中解释二尖瓣regurgitationP988 rool 2D / 3D超声心动图在内膜内铅相关三尖瓣反流的风险分层：241例患者中的单一学习，在葡萄球菌的患者中需要TEE，用于评估感染性心内炎的风险概况？P990姑息度标准怀疑感染心内膜炎的超声心动图：第三节推荐中心的经验，感染endocardisp992在感染性Endocardisap999中的临床过程中的作用和复杂感染的预后的作用生病患者的心内膜炎：我们的15年经验训练束分支块的非典型模式作为患者的预后决定因素，以TaviP994患者在严重的收缩期慢性心力衰竭患者中患者，无型糖尿病患者左心室反向重塑和血清基质纤维化的血清标志物在扩张心肌脑肿瘤中，健康的左心室容纳舒张填充中体积转移的增加的涡旋形成时间：对左心室辅助装置Implantp998函数相关和冠状动脉速率储备的肺动脉压的肺动脉压的影响。血管扩张器应力超声心动图在肥厚性心肌病型499中，非阻塞性患者心肌表现与潜在阻塞性肥厚性心肌病的患者患者患者患者患者血小板治疗1000Lifelong心律失常风险Strati心律病学右心室心肌病的发动机：CMR患者血管瘤术治疗患者血管动脉瘤患者血管诊断患者血管心目记录中CMR的纤维化术治疗的事件和影响患有急性心肌炎和保存的喷射分数。磁共振特征跟踪STOPYP1004DETTECTECTECTET通过突出壁运动患者的斑点左心室功能障碍的亚临床左心室功能障碍，没有突出的壁运动异常，P1005aborted突发的心脏死亡患者<50岁的突然性心脏死亡患者只显示心脏磁共振术中的轻度改变，钆和肾外腺纵向应变之间的心脏磁共振成像P1006相关性简单的高血压患者增强

What Can We Learn from Four Years of Data Center Hardware Failures?

摘要

著录项

相似文献

相关主题

期刊订阅