3 Phased Approach to Delivering a Lakehouse with Data Mesh Ant106

Title

AWS re:Invent 2023 - 3-phased approach to delivering a lakehouse with data mesh (ANT106)

Summary

  • The speaker has extensive experience in data analytics and security, having worked for companies like Nike, Zendesk, and at the NSA.
  • They advocate for understanding the past and present to define the future, respecting operational systems while driving transformation.
  • The talk emphasizes the need for better decisions, faster access to resources, and customer demands for real-time experiences.
  • The speaker discusses the evolution from traditional data warehouses to cloud data warehouses and the emergence of the lakehouse concept, crediting Databricks for coining the term.
  • The lakehouse model is praised for promoting open architecture and flexibility in data management.
  • Data mesh is introduced as a way to empower business users and simplify data engineering tasks.
  • The speaker outlines a three-phased approach to implementing a lakehouse with data mesh:
    1. Unified data access through virtualization.
    2. Modern data stack integration for improved performance and cost savings.
    3. Enterprise mesh or fabric for open architecture and flexibility.
  • Customer examples are provided to illustrate the benefits of this approach, including cost savings, performance improvements, and enabling new capabilities.
  • The speaker concludes by encouraging attendees to try out Dremio's free Lakehouse option and to discuss further at their booth.

Insights

  • The transition from traditional data warehouses to cloud-based solutions like Snowflake and Databricks has been driven by the need for scalability, cloud agnosticism, and separation of storage and compute.
  • The lakehouse architecture is gaining traction because it combines the best features of data lakes and data warehouses, emphasizing openness and flexibility.
  • Data mesh is a concept that decentralizes data ownership and governance, allowing for more agile and responsive data management.
  • The speaker's three-phased approach suggests that organizations should start with data virtualization, then move to modernizing their data stack, and finally aim for an open and flexible enterprise mesh.
  • Real-world examples from companies like MSK, TransUnion, 7.11, and Amazon demonstrate the tangible benefits of adopting a lakehouse with data mesh, including significant cost savings and operational efficiencies.
  • The speaker's experience and the customer stories shared indicate a strong market validation for the lakehouse and data mesh approach, suggesting it could become a standard practice in data management and analytics.