Achieving Amazon S3 Data Lake Resilience at Lexisnexis Stg101

Title

AWS re:Invent 2023 - Achieving Amazon S3 data lake resilience at LexisNexis (STG101)

Summary

  • Woon Jung, co-founder and CTO of Clumio, and Mark Sider, a senior consulting software engineer at LexisNexis, discuss the importance of S3 backup and data resiliency.
  • AWS provides infrastructure resiliency, but data resiliency is the customer's responsibility.
  • Reasons for S3 backup include operational recovery, protection against cyber attacks, and compliance requirements.
  • LexisNexis has a serverless architecture with a high rate of content change, requiring robust data resiliency solutions.
  • Clumio's architecture integrates with customer AWS accounts to provide continuous backup and meet stringent RPO and RTO requirements.
  • Clumio's Instant Access feature allows for rapid restoration of data, significantly reducing RTO.
  • LexisNexis tested Clumio's solution, restoring 26 billion records in under three hours.
  • Clumio's solution is cost-effective and customizable, with the ability to integrate with AWS services like CloudFront and Athena.

Insights

  • Data resiliency in the cloud is a shared responsibility, with AWS ensuring infrastructure uptime while customers must protect their data.
  • LexisNexis's serverless architecture without a VPC is highly scalable and leverages AWS CloudFront for disaster recovery resilience.
  • Clumio's backup solution is designed to be serverless, scalable, and capable of handling high data change rates, which is critical for LexisNexis's large data lake.
  • The Instant Access feature by Clumio is a game-changer for disaster recovery, allowing businesses to access backup data almost immediately without waiting for a full restore.
  • Clumio's ability to quickly adapt and customize their solution to meet specific customer needs, such as optional tagging and selective RPO for certain operations, demonstrates a customer-centric approach.
  • The integration of Clumio's backup solution with AWS CloudFront enables businesses to maintain service continuity during data restoration, which is crucial for customer trust and revenue protection.
  • Clumio's backup solution not only addresses the need for rapid data recovery but also offers cost savings through intelligent backup management and integration with existing AWS services.