Search Model Serving Using PyTorch and TorchServe

Pankaj Takawale
Walmart Global Tech Blog
10 min readJan 23, 2023

--

Search Model Serving GPU Metrics

Walmart Search has embarked on the journey of adopting Deep Learning in the search ecosystem to improve search relevance. For our pilot use case, we served the computationally intensive Bert Base model at runtime with an objective to achieve low latency and high throughput.

We built a highly scalable model serving platform to enable fast runtime inferencing using TorchServe for…

--

--

Pankaj Takawale
Walmart Global Tech Blog

Leading Machine Learning Model Serving Platform & Query Understanding at Walmart Search. Passionate about engineering excellence on large Distributed Systems