Search Model Serving Using PyTorch and TorchServe
Published in
10 min readJan 23, 2023
Walmart Search has embarked on the journey of adopting Deep Learning in the search ecosystem to improve search relevance. For our pilot use case, we served the computationally intensive Bert Base model at runtime with an objective to achieve low latency and high throughput.
We built a highly scalable model serving platform to enable fast runtime inferencing using TorchServe for…