Jpmorgan Chase One Data Platform for Reporting Analytics and Ml Fsi317

Title

AWS re:Invent 2023 - JPMorgan Chase: One data platform for reporting, analytics, and ML (FSI317)

Summary

  • JPMorgan Chase invests $15 billion annually in technology and employs 60,000 engineers.
  • The Asset Management division manages $2.9 trillion and offers over 600 investment strategies.
  • The AMIQ data and analytics platform supports various users and use cases within the organization.
  • The cloud migration was motivated by the need for scalability, velocity, and versatility.
  • The architecture uses AWS components like S3, EMR, and Redshift, along with internally developed capabilities.
  • Data ingestion and quality are critical, with a focus on supporting various formats and sources.
  • Data transformation and reporting are designed for scalability and diversity of data structures.
  • The platform is built with resiliency in mind to handle edge cases and failures.
  • A rules engine was developed to simplify the creation of data transformation pipelines.
  • Data distribution uses Redshift and OpenSearch to serve different consumption use cases.
  • The platform is made reusable for other teams within JPMorgan, like the Private Bank.
  • Cost management and serverless adoption are emphasized for efficiency.
  • The platform has been live for three years, cutting costs in half and enabling faster delivery to business.
  • It has led to $10 billion in additional sales annually through new products and strategies.

Insights

  • JPMorgan Chase's significant investment in technology underscores the importance of data and analytics in the financial industry.
  • The AMIQ platform's ability to support a diverse range of users and use cases demonstrates the flexibility and scalability of cloud-based solutions.
  • The choice of AWS components and the development of internal capabilities highlight the need for a balance between using off-the-shelf services and custom solutions.
  • The focus on data ingestion and quality reflects the critical nature of reliable data for business intelligence and analytics.
  • The use of a rules engine for data transformation pipelines indicates a trend towards low-code solutions that empower both engineers and business users.
  • The platform's design for resiliency and the handling of edge cases is crucial for maintaining uptime and reliability in large-scale systems.
  • The reusability of the platform for other teams within the organization shows the value of designing modular and extensible systems.
  • The emphasis on cost management and the move towards serverless architecture reveal a strategic approach to optimizing cloud resources and expenses.
  • The platform's impact on the business, including significant cost savings and additional sales, illustrates the tangible benefits of investing in a robust data platform.