How Fetch Built World Class Ml Models to Power Their Business Seg301

Title

AWS re:Invent 2023 - How Fetch built world-class ML models to power their business (SEG301)

Summary

  • Fetch is a premier D&B customer of AWS, offering a consumer rewards app that processes millions of receipts daily.
  • The presentation covers the technical aspects of machine learning, focusing on model development and deployment in the cloud.
  • Fetch transitioned from using third-party components to developing an in-house machine learning team to handle document scanning and fraud detection.
  • The company emphasizes the importance of team structure, with cross-functional, independent project teams for better integration and deployment of ML models.
  • Key projects include fraud detection, document understanding, and product intelligence.
  • The presentation highlights the use of Streamlit for building full-stack demos, the importance of domain-driven design, and the use of shadow pipelines for safe deployment.
  • Scaling with model servers is discussed, emphasizing the need for auto-scaling based on requests per second.
  • The importance of data quality, annotation, and stakeholder engagement is stressed for successful ML projects.
  • Fetch's future directions include improving personalization, discovery, recommendation systems, stateful ML services, and generative AI.

Insights

  • Fetch's transition to an in-house ML team reflects a broader trend of companies seeking greater control and customization over their core business processes.
  • The use of Streamlit for prototyping and the emphasis on domain-driven design suggest a pragmatic approach to ML development, focusing on business needs rather than just technical capabilities.
  • Shadow pipelines and the separation of ML endpoints from backend services demonstrate a mature approach to deploying ML models, ensuring minimal disruption to existing systems.
  • The discussion on scaling challenges with model servers and the recommendation to auto-scale by requests per second provide practical advice for ML engineers facing similar deployment issues.
  • The emphasis on data quality and annotation highlights the often-underestimated importance of data preparation in ML projects, which can significantly impact model performance.
  • Fetch's future focus on personalization and recommendation systems indicates a strategic move to leverage their unique dataset to enhance user experience and business value.
  • The mention of generative AI and vector databases suggests Fetch is keeping pace with cutting-edge ML technologies, potentially offering innovative solutions to complex data tasks.