Generative Retrieval with Preference Optimization for E-commerce Search

Read original: arXiv:2407.19829 - Published 7/30/2024 by Mingming Li, Huimu Wang, Zuxu Chen, Guangtao Nie, Yiming Qiu, Binbin Wang, Guoyu Tang, Lin Liu, Jingwei Zhuo

Generative Retrieval with Preference Optimization for E-commerce Search

Overview

This paper presents a novel approach called "Generative Retrieval with Preference Optimization" for e-commerce search.
The key idea is to use a generative model to retrieve relevant product information based on user preferences, rather than traditional keyword matching.
The model is trained to optimize for user preferences, resulting in more personalized and relevant search results.

Plain English Explanation

When you're shopping online, the search results you see often don't perfectly match what you're looking for. Generative Retrieval with Preference Optimization is a new approach that aims to change that.

Instead of just matching your search terms to product descriptions, this model tries to understand your preferences and find the items you're most likely to be interested in. It does this by using a special kind of artificial intelligence called a "generative model."

The generative model is trained on all the product data and user preferences. This allows it to learn patterns and relationships that a traditional search engine might miss. When you search for something, the model doesn't just look for matching keywords. It tries to predict the specific products you're likely to want, based on what it's learned about your preferences.

The result is a more personalized search experience, where the top results are tailored to your individual needs and interests. This could make online shopping much more efficient and satisfying, by helping you find exactly what you're looking for more quickly.

Technical Explanation

The Generative Retrieval with Preference Optimization approach uses a generative model to retrieve relevant product information for e-commerce search.

The key components are:

Semantic Tree-Structured Item Identifiers: The products are represented using a hierarchical tree structure that captures their semantic relationships. This allows the model to understand the broader context of each item.
Preference Optimization: The generative model is trained not just to match queries, but to optimize for user preferences. It learns patterns in how users interact with and rate different products.
Generative Retrieval: During search, the model generates a relevance score for each product, based on the user's query and the learned preference patterns. The top-scoring results are then returned.

This stands in contrast to traditional keyword-based search, which may miss important nuances in user intent and product relationships. By leveraging generative modeling and preference optimization, the system can provide more personalized and relevant search results.

The authors evaluate their approach on real-world e-commerce data, demonstrating significant improvements in metrics like click-through rate and satisfaction over baseline methods.

Critical Analysis

The Generative Retrieval with Preference Optimization approach presents an innovative solution to the problem of e-commerce search. By incorporating user preferences and leveraging generative modeling, it addresses some of the key limitations of traditional search engines.

However, the paper does not fully explore the potential limitations or caveats of this approach. For example, the reliance on hierarchical product representations may not generalize well to all e-commerce domains, and the preference optimization process could potentially lead to filter bubbles or reinforce existing biases.

Additionally, the evaluation is limited to a single e-commerce dataset, and it would be valuable to see how the approach performs across a wider range of scenarios and user populations.

Further research could also investigate ways to make the preference optimization process more transparent and controllable, to ensure that the system is behaving ethically and aligning with user values.

Conclusion

Generative Retrieval with Preference Optimization represents an important step forward in e-commerce search, by leveraging advanced AI techniques to provide more personalized and relevant results.

By shifting the focus from simple keyword matching to preference optimization, this approach has the potential to significantly improve the online shopping experience, making it easier for users to find the products they truly want.

While the paper raises some questions that warrant further exploration, the core ideas behind this work are highly promising and could have far-reaching implications for the future of e-commerce and information retrieval more broadly.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Generative Retrieval with Preference Optimization for E-commerce Search

Mingming Li, Huimu Wang, Zuxu Chen, Guangtao Nie, Yiming Qiu, Binbin Wang, Guoyu Tang, Lin Liu, Jingwei Zhuo

Generative retrieval introduces a groundbreaking paradigm to document retrieval by directly generating the identifier of a pertinent document in response to a specific query. This paradigm has demonstrated considerable benefits and potential, particularly in representation and generalization capabilities, within the context of large language models. However, it faces significant challenges in E-commerce search scenarios, including the complexity of generating detailed item titles from brief queries, the presence of noise in item titles with weak language order, issues with long-tail queries, and the interpretability of results. To address these challenges, we have developed an innovative framework for E-commerce search, called generative retrieval with preference optimization. This framework is designed to effectively learn and align an autoregressive model with target data, subsequently generating the final item through constraint-based beam search. By employing multi-span identifiers to represent raw item titles and transforming the task of generating titles from queries into the task of generating multi-span identifiers from queries, we aim to simplify the generation process. The framework further aligns with human preferences using click data and employs a constrained search method to identify key spans for retrieving the final item, thereby enhancing result interpretability. Our extensive experiments show that this framework achieves competitive performance on a real-world dataset, and online A/B tests demonstrate the superiority and effectiveness in improving conversion gains.

7/30/2024

Hi-Gen: Generative Retrieval For Large-Scale Personalized E-commerce Search

Yanjing Wu, Yinfu Feng, Jian Wang, Wenji Zhou, Yunan Ye, Rong Xiao, Jun Xiao

Leveraging generative retrieval (GR) techniques to enhance search systems is an emerging methodology that has shown promising results in recent years. In GR, a text-to-text model maps string queries directly to relevant document identifiers (docIDs), dramatically simplifying the retrieval process. However, when applying most GR models in large-scale E-commerce for personalized item search, we must face two key problems in encoding and decoding. (1) Existing docID generation methods ignore the encoding of efficiency information, which is critical in E-commerce. (2) The positional information is important in decoding docIDs, while prior studies have not adequately discriminated the significance of positional information or well exploited the inherent interrelation among these positions. To overcome these problems, we introduce an efficient Hierarchical encoding-decoding Generative retrieval method (Hi-Gen) for large-scale personalized E-commerce search systems. Specifically, we first design a representation learning model using metric learning to learn discriminative feature representations of items to capture semantic relevance and efficiency information. Then, we propose a category-guided hierarchical clustering scheme that makes full use of the semantic and efficiency information of items to facilitate docID generation. Finally, we design a position-aware loss to discriminate the importance of positions and mine the inherent interrelation between different tokens at the same position. This loss boosts the performance of the language model used in the decoding stage. Besides, we propose two variants of Hi-Gen (Hi-Gen-I2I and Hi-Gen-Cluster) to support online real-time large-scale recall in the online serving process. Hi-Gen gets 3.30% and 4.62% improvements over SOTA for Recall@1 on the public and industry datasets, respectively.

9/9/2024

Generative Retrieval with Semantic Tree-Structured Item Identifiers via Contrastive Learning

Zihua Si, Zhongxiang Sun, Jiale Chen, Guozhang Chen, Xiaoxue Zang, Kai Zheng, Yang Song, Xiao Zhang, Jun Xu, Kun Gai

The retrieval phase is a vital component in recommendation systems, requiring the model to be effective and efficient. Recently, generative retrieval has become an emerging paradigm for document retrieval, showing notable performance. These methods enjoy merits like being end-to-end differentiable, suggesting their viability in recommendation. However, these methods fall short in efficiency and effectiveness for large-scale recommendations. To obtain efficiency and effectiveness, this paper introduces a generative retrieval framework, namely SEATER, which learns SEmAntic Tree-structured item identifiERs via contrastive learning. Specifically, we employ an encoder-decoder model to extract user interests from historical behaviors and retrieve candidates via tree-structured item identifiers. SEATER devises a balanced k-ary tree structure of item identifiers, allocating semantic space to each token individually. This strategy maintains semantic consistency within the same level, while distinct levels correlate to varying semantic granularities. This structure also maintains consistent and fast inference speed for all items. Considering the tree structure, SEATER learns identifier tokens' semantics, hierarchical relationships, and inter-token dependencies. To achieve this, we incorporate two contrastive learning tasks with the generation task to optimize both the model and identifiers. The infoNCE loss aligns the token embeddings based on their hierarchical positions. The triplet loss ranks similar identifiers in desired orders. In this way, SEATER achieves both efficiency and effectiveness. Extensive experiments on three public datasets and an industrial dataset have demonstrated that SEATER outperforms state-of-the-art models significantly.

7/9/2024

❗

Evaluating Generative Ad Hoc Information Retrieval

Lukas Gienapp, Harrisen Scells, Niklas Deckers, Janek Bevendorff, Shuai Wang, Johannes Kiesel, Shahbaz Syed, Maik Frobe, Guido Zuccon, Benno Stein, Matthias Hagen, Martin Potthast

Recent advances in large language models have enabled the development of viable generative retrieval systems. Instead of a traditional document ranking, generative retrieval systems often directly return a grounded generated text as a response to a query. Quantifying the utility of the textual responses is essential for appropriately evaluating such generative ad hoc retrieval. Yet, the established evaluation methodology for ranking-based ad hoc retrieval is not suited for the reliable and reproducible evaluation of generated responses. To lay a foundation for developing new evaluation methods for generative retrieval systems, we survey the relevant literature from the fields of information retrieval and natural language processing, identify search tasks and system architectures in generative retrieval, develop a new user model, and study its operationalization.

5/24/2024