Expect the Unexpected and Build Resilience with Aws Pex116

Title

AWS re:Invent 2023 - Expect the unexpected and build resilience with AWS (PEX116)

Summary

  • The talk focused on AWS Resilience best practices and how AWS Resilience partners can assist customers in achieving resilience goals.
  • The speaker emphasized the importance of planning for unexpected failures and designing systems to recover with minimal impact on end-users.
  • AWS operates on a shared responsibility model for resilience, where AWS ensures the resilience of the cloud infrastructure, while customers and partners are responsible for application-level resilience.
  • Downtime can be costly, with impacts including loss of revenue, brand damage, productivity disruption, and regulatory penalties.
  • Possible causes of downtime range from more probable events like code deployments and configuration errors to less likely scenarios like natural disasters.
  • AWS has developed mechanisms, processes, and frameworks to guide resilience, including the resilience lifecycle framework.
  • The resilience lifecycle framework involves setting objectives, designing for high availability or disaster recovery, evaluating and testing, operating, and learning from incidents to improve.
  • AWS offers a suite of services to support building resilient workloads.
  • Partners can benefit by enhancing customer satisfaction, differentiating their offerings, and increasing revenue opportunities by qualifying for RFPs that require high resilience levels.
  • Attendees were invited to visit the AWS partner resilience booth and to attend related sessions on resilience at re:Invent.

Insights

  • Resilience in cloud computing is not just about the robustness of the infrastructure but also involves application design, deployment practices, and operational procedures.
  • The shared responsibility model in AWS indicates that while AWS ensures the physical and infrastructural resilience, customers and partners must actively design and manage their applications to be resilient.
  • The cost of downtime is significant and varies by industry, highlighting the need for businesses to invest in resilience to avoid financial and reputational damage.
  • The AWS resilience lifecycle framework is a structured approach that helps organizations systematically improve their resilience posture.
  • AWS's suite of services and the AWS Builder's Library provide tools and best practices for partners and customers to build resilient applications.
  • The talk suggests a growing market for AWS partners who specialize in resilience, indicating a business opportunity for service providers in this niche.
  • The availability of over 100 sessions on resilience at re:Invent underscores the importance of this topic and AWS's commitment to educating customers and partners on resilience best practices.