From Rdbms to Nosql Prt314

Title

AWS re:Invent 2022 - From RDBMS to NoSQL (PRT314)

Summary

  • Rick Houlihan, Director of Developer Relations for Strategic Accounts at MongoDB, discusses the transition from relational databases (RDBMS) to NoSQL.
  • Amazon's journey from Oracle RDBMS to NoSQL is highlighted, emphasizing the need for efficiency, cost reduction, and database freedom.
  • The talk covers the process Amazon went through during its migration project, Rolling Stone, which involved deprecating 3,000 Oracle server instances.
  • Houlihan explains the mathematical reasoning behind choosing NoSQL over RDBMS, focusing on data modeling, relationships, and time complexity.
  • The session includes a breakdown of how to model relationships in NoSQL databases and the importance of understanding access patterns for developers.
  • Houlihan also discusses the history of data processing, the limitations of Moore's Law, and the advantages of NoSQL databases for both OLTP and OLAP workloads.
  • The talk concludes with a detailed example of data modeling for an insurance company portal, demonstrating the use of MongoDB's features to optimize for performance and cost.

Insights

  • Amazon's decision to move away from Oracle RDBMS was driven by the high cost of infrastructure and the need for a more efficient and scalable solution.
  • NoSQL databases offer a way to model data that reduces time complexity and infrastructure costs, especially for high-velocity access patterns.
  • The transition to NoSQL requires a deep understanding of data modeling and access patterns, which can be significantly different from those used in RDBMS.
  • NoSQL databases are not just for non-relational data; they can efficiently handle relational data and complex relationships through proper data modeling techniques.
  • The talk emphasizes the importance of developer education and enablement in successfully adopting NoSQL technologies, as well as the use of tools like MongoDB's Schema Bench for performance testing.
  • Houlihan's approach to data modeling in NoSQL databases involves optimizing for the most common access patterns while accommodating less frequent ones, using patterns like extended reference and outlier patterns.
  • The session underscores the importance of using MongoDB's default index (_id) effectively and the concept of minimum unit of cost for optimizing write operations.