Enhancing Relevance of Embedding-based Retrieval at Walmart

Read original: arXiv:2408.04884 - Published 8/16/2024 by Juexin Lin, Sachin Yadav, Feng Liu, Nicholas Rossi, Praveen R. Suram, Satya Chembolu, Prijith Chandran, Hrushikesh Mohapatra, Tony Lee, Alessandro Magnani and 1 other
Total Score

0

Enhancing Relevance of Embedding-based Retrieval at Walmart

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • Presents a method for enhancing the relevance of embedding-based product search at Walmart
  • Focuses on improving the performance of semantic search using embedding models
  • Introduces techniques to better match user queries with relevant products

Plain English Explanation

The paper discusses ways to improve the accuracy and relevance of product search at Walmart, a major e-commerce retailer. Specifically, it explores methods to enhance the performance of semantic search, which relies on embedding models to match user queries with related products.

Semantic search aims to understand the intent and meaning behind a user's search, rather than just looking for an exact keyword match. This can help surface more relevant results, even if the query doesn't perfectly match the product description. However, the researchers found that standard embedding-based retrieval models sometimes struggle to identify the most pertinent products.

To address this, the paper introduces several techniques to improve the relevance of the search results. These include incorporating additional signals, such as user interactions and product metadata, to better contextualize the search query. The researchers also explore ways to fine-tune the embedding models to be more attuned to Walmart's specific product catalog and customer behavior.

By enhancing the embedding-based retrieval system in these ways, the paper demonstrates how Walmart can deliver more accurate and personalized search results, ultimately improving the shopping experience for its customers.

Technical Explanation

The paper begins by reviewing prior work on semantic search and embedding-based information retrieval. It notes that while these techniques have shown promise, there is still room for improvement, particularly when applied to large-scale e-commerce platforms like Walmart.

To enhance the relevance of their embedding-based search, the researchers incorporate several novel components:

  1. Event-Enhanced Retrieval: They augment the standard query-product relevance scoring with additional signals, such as user interactions (e.g., clicks, purchases) and product metadata (e.g., categories, attributes). This helps the model better understand the context and intent behind the user's search.

  2. Targeted Fine-Tuning: The researchers fine-tune the pre-trained embedding models on Walmart's specific product catalog and user behavior data. This allows the models to learn the nuances and patterns unique to the Walmart domain, improving their ability to match queries to relevant products.

  3. Relevance Filtering: They introduce a filtering step to remove irrelevant product candidates before the final ranking stage. This helps to focus the model's attention on the most pertinent items, enhancing the overall quality of the search results.

Through extensive experiments, the paper demonstrates that these techniques significantly improve the relevance and performance of the embedding-based retrieval system at Walmart. The researchers also discuss several potential limitations and areas for future research, such as incorporating more advanced language modeling approaches and exploring alternative fine-tuning strategies.

Critical Analysis

The paper presents a well-designed and thorough study for enhancing the relevance of embedding-based product search at Walmart. The proposed techniques, such as event-enhanced retrieval and targeted fine-tuning, are well-grounded in prior research and effectively address the challenges faced by large-scale e-commerce platforms.

One potential area for further exploration is the integration of more advanced language models, such as transformer-based architectures, which have shown impressive performance in tasks like semantic search and text understanding. Combining the strengths of embedding-based retrieval with the capabilities of large language models could lead to even more accurate and contextual search results.

Additionally, the paper could have discussed the potential ethical implications of the proposed techniques, particularly around issues of algorithmic bias and the impact on user privacy and data governance. As embedding-based systems become more widely adopted, it is crucial to consider these important considerations.

Overall, the paper presents a valuable contribution to the field of e-commerce search, offering practical insights and techniques that can be leveraged by other online retailers to enhance the relevance and quality of their product search experiences.

Conclusion

This paper introduces several techniques to improve the relevance of embedding-based product search at Walmart, a major e-commerce retailer. By incorporating additional signals, fine-tuning the embedding models, and applying targeted filtering, the researchers were able to significantly enhance the accuracy and personalization of the search results.

The proposed methods demonstrate the potential for enhancing semantic search capabilities in large-scale e-commerce platforms, ultimately leading to a more seamless and satisfactory shopping experience for customers. As the field of information retrieval continues to evolve, this work offers a valuable blueprint for leveraging advanced techniques to deliver more relevant and contextual search outcomes.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Enhancing Relevance of Embedding-based Retrieval at Walmart
Total Score

0

Enhancing Relevance of Embedding-based Retrieval at Walmart

Juexin Lin, Sachin Yadav, Feng Liu, Nicholas Rossi, Praveen R. Suram, Satya Chembolu, Prijith Chandran, Hrushikesh Mohapatra, Tony Lee, Alessandro Magnani, Ciya Liao

Embedding-based neural retrieval (EBR) is an effective search retrieval method in product search for tackling the vocabulary gap between customer search queries and products. The initial launch of our EBR system at Walmart yielded significant gains in relevance and add-to-cart rates [1]. However, despite EBR generally retrieving more relevant products for reranking, we have observed numerous instances of relevance degradation. Enhancing retrieval performance is crucial, as it directly influences product reranking and affects the customer shopping experience. Factors contributing to these degradations include false positives/negatives in the training data and the inability to handle query misspellings. To address these issues, we present several approaches to further strengthen the capabilities of our EBR model in terms of retrieval relevance. We introduce a Relevance Reward Model (RRM) based on human relevance feedback. We utilize RRM to remove noise from the training data and distill it into our EBR model through a multi-objective loss. In addition, we present the techniques to increase the performance of our EBR model, such as typo-aware training, and semi-positive generation. The effectiveness of our EBR is demonstrated through offline relevance evaluation, online AB tests, and successful deployments to live production. [1] Alessandro Magnani, Feng Liu, Suthee Chaidaroon, Sachin Yadav, Praveen Reddy Suram, Ajit Puthenputhussery, Sijie Chen, Min Xie, Anirudh Kashi, Tony Lee, et al. 2022. Semantic retrieval at walmart. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3495-3503.

Read more

8/16/2024

Event-enhanced Retrieval in Real-time Search
Total Score

0

Event-enhanced Retrieval in Real-time Search

Yanan Zhang, Xiaoling Bai, Tianhua Zhou

The embedding-based retrieval (EBR) approach is widely used in mainstream search engine retrieval systems and is crucial in recent retrieval-augmented methods for eliminating LLM illusions. However, existing EBR models often face the semantic drift problem and insufficient focus on key information, leading to a low adoption rate of retrieval results in subsequent steps. This issue is especially noticeable in real-time search scenarios, where the various expressions of popular events on the Internet make real-time retrieval heavily reliant on crucial event information. To tackle this problem, this paper proposes a novel approach called EER, which enhances real-time retrieval performance by improving the dual-encoder model of traditional EBR. We incorporate contrastive learning to accompany pairwise learning for encoder optimization. Furthermore, to strengthen the focus on critical event information in events, we include a decoder module after the document encoder, introduce a generative event triplet extraction scheme based on prompt-tuning, and correlate the events with query encoder optimization through comparative learning. This decoder module can be removed during inference. Extensive experiments demonstrate that EER can significantly improve the real-time search retrieval performance. We believe that this approach will provide new perspectives in the field of information retrieval. The codes and dataset are available at https://github.com/open-event-hub/Event-enhanced_Retrieval .

Read more

4/10/2024

Robust Interaction-based Relevance Modeling for Online E-Commerce and LLM-based Retrieval
Total Score

0

Robust Interaction-based Relevance Modeling for Online E-Commerce and LLM-based Retrieval

Ben Chen, Huangyu Dai, Xiang Ma, Wen Jiang, Wei Ning

Semantic relevance calculation is crucial for e-commerce search engines, as it ensures that the items selected closely align with customer intent. Inadequate attention to this aspect can detrimentally affect user experience and engagement. Traditional text-matching techniques are prevalent but often fail to capture the nuances of search intent accurately, so neural networks now have become a preferred solution to processing such complex text matching. Existing methods predominantly employ representation-based architectures, which strike a balance between high traffic capacity and low latency. However, they exhibit significant shortcomings in generalization and robustness when compared to interaction-based architectures. In this work, we introduce a robust interaction-based modeling paradigm to address these shortcomings. It encompasses 1) a dynamic length representation scheme for expedited inference, 2) a professional terms recognition method to identify subjects and core attributes from complex sentence structures, and 3) a contrastive adversarial training protocol to bolster the model's robustness and matching capabilities. Extensive offline evaluations demonstrate the superior robustness and effectiveness of our approach, and online A/B testing confirms its ability to improve relevance in the same exposure position, resulting in more clicks and conversions. To the best of our knowledge, this method is the first interaction-based approach for large e-commerce search relevance calculation. Notably, we have deployed it for the entire search traffic on alibaba.com, the largest B2B e-commerce platform in the world.

Read more

6/5/2024

Hierarchical Structured Neural Network for Retrieval
Total Score

0

Hierarchical Structured Neural Network for Retrieval

Kaushik Rangadurai, Siyang Yuan, Minhui Huang, Yiqun Liu, Golnaz Ghasemiesfeh, Yunchen Pu, Xinfeng Xie, Xingfeng He, Fangzhou Xu, Andrew Cui, Vidhoon Viswanathan, Yan Dong, Liang Xiong, Lin Yang, Liang Wang, Jiyan Yang, Chonglin Sun

Embedding Based Retrieval (EBR) is a crucial component of the retrieval stage in (Ads) Recommendation System that utilizes Two Tower or Siamese Networks to learn embeddings for both users and items (ads). It then employs an Approximate Nearest Neighbor Search (ANN) to efficiently retrieve the most relevant ads for a specific user. Despite the recent rise to popularity in the industry, they have a couple of limitations. Firstly, Two Tower model architecture uses a single dot product interaction which despite their efficiency fail to capture the data distribution in practice. Secondly, the centroid representation and cluster assignment, which are components of ANN, occur after the training process has been completed. As a result, they do not take into account the optimization criteria used for retrieval model. In this paper, we present Hierarchical Structured Neural Network (HSNN), a deployed jointly optimized hierarchical clustering and neural network model that can take advantage of sophisticated interactions and model architectures that are more common in the ranking stages while maintaining a sub-linear inference cost. We achieve 6.5% improvement in offline evaluation and also demonstrate 1.22% online gains through A/B experiments. HSNN has been successfully deployed into the Ads Recommendation system and is currently handling major portion of the traffic. The paper shares our experience in developing this system, dealing with challenges like freshness, volatility, cold start recommendations, cluster collapse and lessons deploying the model in a large scale retrieval production system.

Read more

8/14/2024