Limitations and Challenges in Vertex AI for Industrial-Scale – Batch Predictions

15 May 2024

This post is part of a series on Vertex AI and the limitations I encountered while using it to serve ML models at production scale. If you haven’t read the first post on Online Predictions, that’s a good starting point.

In this second post, I’ll focus on batch predictions, a core component in many real-world ML workflows; especially in industrialized R&D contexts like drug discovery. Think screening >10M molecules every other week. When inference isn’t time-sensitive, batch jobs are supposed to help reduce overhead and costs.

In theory, batch predictions on Vertex AI should make that easy. In practice, they came with a surprising number of constraints and quirks.

Let’s walk through them.

Le Miroir