A scalable machine learning model serving infrastructure on Kubernetes using the Kserve framework. It is part of a broader project aimed to modernize machine learning at Wikimedia.