CoSPLADE: Contextualizing SPLADE for Conversational Information Retrieval

Read original: arXiv:2301.04413 - Published 7/8/2024 by Nam Le Hai, Thomas Gerald, Thibault Formal, Jian-Yun Nie, Benjamin Piwowarski, Laure Soulier

🤷

Overview

Conversational search is a challenging task that aims to retrieve relevant documents based on the current query and the full conversation history.
Previous methods have focused on a multi-stage ranking approach that relies on query reformulation, which can lead to suboptimal retrieval.
Other approaches have used a fully neural IR first-stage, but they are either zero-shot or rely on full learning-to-rank based on a dataset with pseudo-labels.

Plain English Explanation

Conversational search is the process of retrieving information from a database or the internet based on a series of back-and-forth questions and answers, rather than a single query. This is a difficult task because the system needs to understand the context of the entire conversation, not just the most recent question.

Previous methods have tried to solve this problem by breaking it into multiple steps. First, they would try to reformulate the user's query based on the conversation history. Then, they would use that reformulated query to search for relevant documents. The problem with this approach is that the query reformulation step might not always lead to the best search results.

Other approaches have tried to use a fully neural information retrieval (IR) system for the first stage. This means the system would try to understand the user's intent and match it to relevant documents without the need for query reformulation. However, these neural IR systems either can't handle new types of queries they haven't seen before (zero-shot) or they rely on a large dataset of pre-labeled examples to train the system, which can be difficult to obtain.

In this paper, the researchers propose a new, lightweight technique to train a first-stage ranker using a sparse representation method called SPLADE. By using SPLADE's efficient sparse representations, they are able to build a competitive conversational search system when combined with a second-stage ranker based on T5Mono.

Technical Explanation

The key innovation in this paper is the use of SPLADE, a sparse representation method, to train a first-stage ranker for conversational search. SPLADE is able to encode query-document relationships in a compact, efficient way, which allows the researchers to train a lightweight first-stage model.

The overall system works as follows:

The SPLADE-based first-stage ranker takes the current query and the conversation history and produces a set of potentially relevant documents.
These documents are then re-ranked by a second-stage model based on T5Mono, which can better capture the nuances of the conversation context.

By relying on SPLADE's sparse representations, the researchers were able to achieve competitive results on the TREC CAsT 2020 and 2021 conversational search benchmarks, without the need for complex query reformulation or large datasets of pre-labeled examples.

Critical Analysis

The researchers acknowledge that their approach has some limitations. For example, the SPLADE-based first-stage ranker may not be able to capture all the contextual information in the conversation history, which could lead to suboptimal initial document retrieval.

Additionally, the use of a separate second-stage ranker adds complexity to the overall system, which could impact its efficiency and deployment in real-world applications.

The researchers also note that their approach is sensitive to the quality of the SPLADE representations, which could be an area for further optimization and research.

Despite these limitations, the researchers' innovative use of SPLADE sparse representations to build a lightweight first-stage ranker is a promising step forward in the field of conversational search.

Conclusion

This paper presents an interesting approach to the challenging problem of conversational search. By leveraging SPLADE's efficient sparse representations, the researchers were able to build a competitive first-stage ranker without the need for complex query reformulation or large datasets of pre-labeled examples.

While the approach has some limitations, it represents an important step forward in the field and could inspire further research into lightweight, yet effective, methods for conversational search. As conversational interfaces become more prevalent, techniques like the one described in this paper will be crucial for providing users with relevant and contextual information.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤷

CoSPLADE: Contextualizing SPLADE for Conversational Information Retrieval

Nam Le Hai, Thomas Gerald, Thibault Formal, Jian-Yun Nie, Benjamin Piwowarski, Laure Soulier

Conversational search is a difficult task as it aims at retrieving documents based not only on the current user query but also on the full conversation history. Most of the previous methods have focused on a multi-stage ranking approach relying on query reformulation, a critical intermediate step that might lead to a sub-optimal retrieval. Other approaches have tried to use a fully neural IR first-stage, but are either zero-shot or rely on full learning-to-rank based on a dataset with pseudo-labels. In this work, leveraging the CANARD dataset, we propose an innovative lightweight learning technique to train a first-stage ranker based on SPLADE. By relying on SPLADE sparse representations, we show that, when combined with a second-stage ranker based on T5Mono, the results are competitive on the TREC CAsT 2020 and 2021 tracks.

7/8/2024

Contextualization with SPLADE for High Recall Retrieval

Eugene Yang

High Recall Retrieval (HRR), such as eDiscovery and medical systematic review, is a search problem that optimizes the cost of retrieving most relevant documents in a given collection. Iterative approaches, such as iterative relevance feedback and uncertainty sampling, are shown to be effective under various operational scenarios. Despite neural models demonstrating success in other text-related tasks, linear models such as logistic regression, in general, are still more effective and efficient in HRR since the model is trained and retrieves documents from the same fixed collection. In this work, we leverage SPLADE, an efficient retrieval model that transforms documents into contextualized sparse vectors, for HRR. Our approach combines the best of both worlds, leveraging both the contextualization from pretrained language models and the efficiency of linear models. It reduces 10% and 18% of the review cost in two HRR evaluation collections under a one-phase review workflow with a target recall of 80%. The experiment is implemented with TARexp and is available at https://github.com/eugene-yang/LSR-for-TAR.

5/8/2024

Two-Step SPLADE: Simple, Efficient and Effective Approximation of SPLADE

Carlos Lassance, Herv'e Dejean, St'ephane Clinchant, Nicola Tonellotto

Learned sparse models such as SPLADE have successfully shown how to incorporate the benefits of state-of-the-art neural information retrieval models into the classical inverted index data structure. Despite their improvements in effectiveness, learned sparse models are not as efficient as classical sparse model such as BM25. The problem has been investigated and addressed by recently developed strategies, such as guided traversal query processing and static pruning, with different degrees of success on in-domain and out-of-domain datasets. In this work, we propose a new query processing strategy for SPLADE based on a two-step cascade. The first step uses a pruned and reweighted version of the SPLADE sparse vectors, and the second step uses the original SPLADE vectors to re-score a sample of documents retrieved in the first stage. Our extensive experiments, performed on 30 different in-domain and out-of-domain datasets, show that our proposed strategy is able to improve mean and tail response times over the original single-stage SPLADE processing by up to $30times$ and $40times$, respectively, for in-domain datasets, and by 12x to 25x, for mean response on out-of-domain datasets, while not incurring in statistical significant difference in 60% of datasets.

4/23/2024

SPLATE: Sparse Late Interaction Retrieval

Thibault Formal, St'ephane Clinchant, Herv'e D'ejean, Carlos Lassance

The late interaction paradigm introduced with ColBERT stands out in the neural Information Retrieval space, offering a compelling effectiveness-efficiency trade-off across many benchmarks. Efficient late interaction retrieval is based on an optimized multi-step strategy, where an approximate search first identifies a set of candidate documents to re-rank exactly. In this work, we introduce SPLATE, a simple and lightweight adaptation of the ColBERTv2 model which learns an ``MLM adapter'', mapping its frozen token embeddings to a sparse vocabulary space with a partially learned SPLADE module. This allows us to perform the candidate generation step in late interaction pipelines with traditional sparse retrieval techniques, making it particularly appealing for running ColBERT in CPU environments. Our SPLATE ColBERTv2 pipeline achieves the same effectiveness as the PLAID ColBERTv2 engine by re-ranking 50 documents that can be retrieved under 10ms.

4/23/2024