RetailLarge E-Commerce Player

Recommendation Engine at Scale

12 weeks

Recommendation Engine at Scale

Challenge

The e‑commerce team had a basic “customers also bought” setup. They wanted personalised recommendations on homepage, category, and cart—with sub-200ms latency at peak (Diwali-level traffic)—and the ability to A/B test models without engineering bottlenecks.

Solution

We built a recommendation service that combines collaborative filtering, item embeddings, and business rules. We served it from a low-latency API with caching and fallbacks so the site never blocks on recommendations. We plugged in an experimentation framework so product and data science can ship new models and measure impact without touching the core pipeline.

What we did

Designed real-time and batch pipelines for embeddings and signals
Built recommendation API with caching and sub-200ms p99 latency
Integrated A/B testing and metrics for click-through and conversion
Handed over MLOps and model refresh process to client team

Results

18%
lift in click-through on recommendations
<200ms
p99 latency at peak traffic
2x
experiments per quarter
We went from generic blocks to personalised recommendations in three months. Latency held through Diwali and our data science team can now ship experiments without waiting on engineering.
Vikram ReddyHead of Data & ML

Start a project

Explore more work or get in touch.