Please note that this seminar is now WEBEX participation only!
Memory Errors in Production Systems – Insights from the Field
Speaker: Dr. Sudhanva Gurumurthi, Principal Member of Technical Staff, AMD
Friday, August 28, 2020 at 12PM – 1PM PDT
Abstract: Memory reliability is important for the correct operation of computing systems. While technology scaling has paved the way for improvements in the capacity and energy-efficiency of memory, the reliability aspects of such scaling must be well characterized and addressed in the design of computer hardware. AMD has collected and analyzed memory reliability data from several production systems running in data centers. This data spans several generations of DRAM technologies, as well as SRAM. This talk will first explain how bit-cell reliability can impact on the design and use of computing hardware and highlight the importance of studying memory faults from commercial hardware in the field. The talk will then present memory reliability data and insights from AMD's field studies and discuss their implications from the viewpoint architecting resilient systems.
Speaker Bio: Sudhanva Gurumurthi is a Principal Member of the Technical Staff at AMD, where he leads advanced development in Reliability, Availability, and Serviceability (RAS). He used to be an Associate Professor with tenure in the Computer Science Department at the University of Virginia. Sudhanva is a recipient of the NSF CAREER Award, a Google Focused Research Award, two Google Faculty Research Awards, and other NSF and industry awards. He is a Senior Member of the IEEE and the ACM.
Subscribe or Invite your friends to sign up for our mailing list and get to hear about exciting electron-device relevant talks. We promise no spam and try to minimize email. You can unsubscribe easily.
To unsubscribe from the EDS-CHAP-SCV list, click the following link: https://listserv.ieee.org/cgi-bin/wa?SUBED1=EDS-CHAP-SCV&A=1