Vertex AI


  Project Description
  Blog Post

Technologies

  • aiplatform Google Python SDK
  • Codefresh Pipelines
  • GitHub Actions
  • FastAPI
  • Docker

Summary

Evaluated and prototyped Vertex AI as a platform for serving machine learning models in a production environment. Focused on automated model deployment, online inference performance, and infrastructure scalability. The work led to key architectural decisions that significantly reduced projected infrastructure costs.

What did I do?

  • Assessed and de-risked ML inference platforms by evaluating Vertex AI for production use.
  • Identified key limitations for large-scale inference.
  • Authored a report that led the company to pivot to Anyscale before full deployment, avoiding costly misalignment and saving ~$40K per month in infrastructure costs.
Back to Top ↑