PASH at TREC 2021 Deep Learning Track: Generative Enhanced Model for Multi-stage Ranking

Read original: arXiv:2205.11245 - Published 8/29/2024 by Yixuan Qiao, Hao Chen, Jun Wang, Tuozhen Liu, Xianbin Ye, Xin Tang, Rui Fang, Peng Gao, Wenfeng Xie, Guotong Xie

🤿

Overview

This paper describes the PASH team's participation in the TREC 2021 Deep Learning Track.
They used a combination of sparse and dense retrieval methods in the recall stage.
In the multi-stage ranking phase, they used point-wise and pair-wise ranking strategies based on a model pre-trained on general knowledge and document-level data.
They also introduced the generative model T5 to further enhance performance compared to their previous TREC 2020 submission.

Plain English Explanation

The PASH team participated in a competition called the TREC 2021 Deep Learning Track. In the first step, called the recall stage, they used a mix of two different retrieval methods - one that focuses on finding relevant documents quickly, and another that does a more in-depth search.

In the next step, the ranking phase, they used two different ranking strategies one after the other. The first was a point-wise approach, which scores each document individually. The second was a pair-wise approach, which compares pairs of documents. These ranking models were pre-trained on general knowledge as well as the specific documents in the competition.

Compared to their previous entry in TREC 2020, the PASH team also incorporated a new generative model called T5 to further improve their overall performance in the competition.

Technical Explanation

In the recall stage, the PASH team combined a sparse retrieval method and a dense retrieval method to generate an initial set of potentially relevant documents.

For the multi-stage ranking phase, they used both point-wise and pair-wise ranking strategies. The point-wise model scored each document individually, while the pair-wise model compared pairs of documents. These ranking models were pre-trained on general knowledge as well as the specific corpus of documents in the competition.

To further enhance performance compared to their TREC 2020 submission, the PASH team introduced the generative model T5 into their system.

Critical Analysis

The paper does not provide many details on the specific architectures, hyperparameters, or training procedures used for the various models. Without this information, it is difficult to fully evaluate the technical merits of their approach.

Additionally, the paper does not discuss any potential limitations or caveats of their methods. It would be helpful to understand if there are any specific scenarios or datasets where their approach may struggle, or any areas for further research and improvement.

While the introduction of the T5 model seems to have improved performance, the paper does not provide a thorough analysis of how much this contribution impacted the overall results compared to the other components of their system.

Conclusion

The PASH team's submission to the TREC 2021 Deep Learning Track combined several state-of-the-art retrieval and ranking techniques, including sparse, dense, point-wise, and pair-wise methods. The addition of the T5 generative model further enhanced their performance compared to their previous entry.

However, the paper lacks some technical details and a more critical analysis of the strengths, weaknesses, and potential areas for improvement of their approach. Overall, their work demonstrates the value of integrating multiple advanced AI models to tackle complex information retrieval tasks.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤿

PASH at TREC 2021 Deep Learning Track: Generative Enhanced Model for Multi-stage Ranking

Yixuan Qiao, Hao Chen, Jun Wang, Tuozhen Liu, Xianbin Ye, Xin Tang, Rui Fang, Peng Gao, Wenfeng Xie, Guotong Xie

This paper describes the PASH participation in TREC 2021 Deep Learning Track. In the recall stage, we adopt a scheme combining sparse and dense retrieval method. In the multi-stage ranking phase, point-wise and pair-wise ranking strategies are used one after another based on model continual pre-trained on general knowledge and document-level data. Compared to TREC 2020 Deep Learning Track, we have additionally introduced the generative model T5 to further enhance the performance.

8/29/2024

Generative Relevance Feedback and Convergence of Adaptive Re-Ranking: University of Glasgow Terrier Team at TREC DL 2023

Andrew Parry, Thomas Jaenich, Sean MacAvaney, Iadh Ounis

This paper describes our participation in the TREC 2023 Deep Learning Track. We submitted runs that apply generative relevance feedback from a large language model in both a zero-shot and pseudo-relevance feedback setting over two sparse retrieval approaches, namely BM25 and SPLADE. We couple this first stage with adaptive re-ranking over a BM25 corpus graph scored using a monoELECTRA cross-encoder. We investigate the efficacy of these generative approaches for different query types in first-stage retrieval. In re-ranking, we investigate operating points of adaptive re-ranking with different first stages to find the point in graph traversal where the first stage no longer has an effect on the performance of the overall retrieval pipeline. We find some performance gains from the application of generative query reformulation. However, our strongest run in terms of P@10 and nDCG@10 applied both adaptive re-ranking and generative pseudo-relevance feedback, namely uogtr_b_grf_e_gb.

5/3/2024

Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG

Gabriel de Souza P. Moreira, Ronay Ak, Benedikt Schifferer, Mengyao Xu, Radek Osmulski, Even Oldridge

Ranking models play a crucial role in enhancing overall accuracy of text retrieval systems. These multi-stage systems typically utilize either dense embedding models or sparse lexical indices to retrieve relevant passages based on a given query, followed by ranking models that refine the ordering of the candidate passages by its relevance to the query. This paper benchmarks various publicly available ranking models and examines their impact on ranking accuracy. We focus on text retrieval for question-answering tasks, a common use case for Retrieval-Augmented Generation systems. Our evaluation benchmarks include models some of which are commercially viable for industrial applications. We introduce a state-of-the-art ranking model, NV-RerankQA-Mistral-4B-v3, which achieves a significant accuracy increase of ~14% compared to pipelines with other rerankers. We also provide an ablation study comparing the fine-tuning of ranking models with different sizes, losses and self-attention mechanisms. Finally, we discuss challenges of text retrieval pipelines with ranking models in real-world industry applications, in particular the trade-offs among model size, ranking accuracy and system requirements like indexing and serving latency / throughput.

9/14/2024

SC-Rec: Enhancing Generative Retrieval with Self-Consistent Reranking for~Sequential Recommendation

Tongyoung Kim, Soojin Yoon, Seongku Kang, Jinyoung Yeo, Dongha Lee

Language Models (LMs) are increasingly employed in recommendation systems due to their advanced language understanding and generation capabilities. Recent recommender systems based on generative retrieval have leveraged the inferential abilities of LMs to directly generate the index tokens of the next item, based on item sequences within the user's interaction history. Previous studies have mostly focused on item indices based solely on textual semantic or collaborative information. However, although the standalone effectiveness of these aspects has been demonstrated, the integration of this information has remained unexplored. Our in-depth analysis finds that there is a significant difference in the knowledge captured by the model from heterogeneous item indices and diverse input prompts, which can have a high potential for complementarity. In this paper, we propose SC-Rec, a unified recommender system that learns diverse preference knowledge from two distinct item indices and multiple prompt templates. Furthermore, SC-Rec adopts a novel reranking strategy that aggregates a set of ranking results, inferred based on different indices and prompts, to achieve the self-consistency of the model. Our empirical evaluation on three real-world datasets demonstrates that SC-Rec considerably outperforms the state-of-the-art methods for sequential recommendation, effectively incorporating complementary knowledge from varied outputs of the model.

8/20/2024