Dev.to Machine Learning3h ago|Business & Industry Products & Services

Reranking 565K Products Using Deep Learning at SeeStocks

SeeStocks, a price comparison engine, built a multi-stage deep learning pipeline to rerank over 565,000 products and improve relevance on their category pages.

💡

Why it matters

This case study demonstrates how deep learning can be effectively applied to improve product search and discovery in large-scale ecommerce platforms.

Key Points

1Implemented a 3-stage pipeline: candidate retrieval, cross-encoder reranking, and business rules/diversity
2Leveraged vision-language models, taxonomic distance, and price distribution to score product relevance
3Faced challenges with flat product taxonomies and built a hierarchical disambiguation layer to improve classification
4Deployed the pipeline in production, achieving significant improvements in relevance, misclassification, and user engagement

Details

SeeStocks, a Spanish price comparison platform, manages a catalog of over 565,000 products across multiple retailers. To ensure the most relevant products appear first on their category pages, they built a multi-stage deep learning pipeline. The first stage uses approximate nearest neighbor search against pre-computed category embeddings to retrieve a broad set of candidate products. These candidates are then reranked in the second stage using a cross-encoder model that evaluates visual similarity, taxonomic distance, title-category coherence, and price distribution. Finally, the pipeline applies business rules like deduplication, retailer diversity, and freshness decay. Key challenges included dealing with the limitations of flat product taxonomies, which they solved by building a hierarchical disambiguation layer. The full pipeline runs on a single GPU server with under 200ms end-to-end latency, and has led to significant improvements in relevance, misclassification, and user engagement metrics.

Reranking 565K Products Using Deep Learning at SeeStocks

Why it matters

Key Points

Details

Dive deeper

Related Articles

Spiking Neural Network Reaches 1 Billion Parameters

The AI Coding Agent Wars: 10 Agents, 4 Architectures, 1 Win…

Baumgardner vs. Bo Mi Re Shin Live Stream Free

Path Selection for Quantum Repeater Networks

A comparison of LSTM and GRU networks for learning symbolic…

Reliable AI Should Be Structured as a System, Not a Superhe…

From Smart Chips to AI Teaching Grants—EU Act Risk, MCU Com…

AI Doesn't Write Code — Systems Do (And Most People Are Mis…

Case Study: AI System With Hidden Risk Exposure

HEPData: a repository for high energy physics data

AI Curator

Ask me anything about AI

Related Articles

Spiking Neural Network Reaches 1 Billion Parameters

The AI Coding Agent Wars: 10 Agents, 4 Architectures, 1 Win…

Baumgardner vs. Bo Mi Re Shin Live Stream Free

Path Selection for Quantum Repeater Networks

A comparison of LSTM and GRU networks for learning symbolic…

Reliable AI Should Be Structured as a System, Not a Superhe…

From Smart Chips to AI Teaching Grants—EU Act Risk, MCU Com…

AI Doesn't Write Code — Systems Do (And Most People Are Mis…

Case Study: AI System With Hidden Risk Exposure

HEPData: a repository for high energy physics data