LLM-based Weak Supervision Framework for Query Intent Classification in Video Search

Read original: arXiv:2409.08931 - Published 9/16/2024 by Farnoosh Javadi, Phanideep Gampa, Alyssa Woo, Xingxing Geng, Hang Zhang, Jose Sepulveda, Belhassen Bayar, Fei Wang

LLM-based Weak Supervision Framework for Query Intent Classification in Video Search

Overview

This blog post provides a plain English summary and technical explanation of the research paper "Instructions for *ACL Proceedings".
The paper discusses guidelines and best practices for authors submitting papers to conferences organized by the Association for Computational Linguistics (ACL).
Key topics covered include paper formatting, citation styles, figures and tables, and other submission requirements.

Plain English Explanation

The paper outlines the instructions and guidelines that authors must follow when submitting a paper to conferences organized by the Association for Computational Linguistics (ACL). These conferences are major events in the field of natural language processing and computational linguistics.

The instructions cover important aspects of paper formatting, such as the required font size, line spacing, and margins. Authors are also provided guidance on properly citing references and including figures and tables in their submissions. Additionally, the paper discusses requirements around the paper's structure, including the expected sections (e.g. introduction, related work, approach).

Following these instructions is crucial for authors, as it ensures their paper is presented in a consistent and professional manner, making it easier for reviewers to evaluate the work. Adhering to the guidelines also helps maintain the high quality standards of ACL conferences.

Technical Explanation

The paper provides a comprehensive set of instructions for authors submitting papers to ACL conferences. It covers key formatting requirements, such as:

The instructions also cover submission requirements, such as the need for a title page, abstract, and author information. Additionally, the paper discusses ethical considerations, such as avoiding plagiarism and ensuring appropriate attribution.

Critical Analysis

The paper provides a thorough and well-organized set of instructions for authors submitting to ACL conferences. The guidelines cover a wide range of formatting and structural elements, ensuring a consistent presentation across all accepted papers.

One potential limitation is the lack of examples or visual aids to illustrate the formatting requirements. Including sample figures, tables, or citation styles could further improve the clarity and usability of the instructions for authors.

Additionally, the paper does not address how the instructions may evolve over time or how authors can stay informed of any changes. Providing information on the review and update process for these guidelines could be valuable for the research community.

Conclusion

The "Instructions for *ACL Proceedings" paper serves as an essential resource for authors seeking to submit their work to ACL conferences. By outlining the formatting, structure, and submission requirements in detail, the paper helps ensure a high level of quality and consistency in the published proceedings. Adhering to these guidelines is critical for authors who want to have their research considered and accepted by these prestigious computational linguistics events.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

LLM-based Weak Supervision Framework for Query Intent Classification in Video Search

Farnoosh Javadi, Phanideep Gampa, Alyssa Woo, Xingxing Geng, Hang Zhang, Jose Sepulveda, Belhassen Bayar, Fei Wang

Streaming services have reshaped how we discover and engage with digital entertainment. Despite these advancements, effectively understanding the wide spectrum of user search queries continues to pose a significant challenge. An accurate query understanding system that can handle a variety of entities that represent different user intents is essential for delivering an enhanced user experience. We can build such a system by training a natural language understanding (NLU) model; however, obtaining high-quality labeled training data in this specialized domain is a substantial obstacle. Manual annotation is costly and impractical for capturing users' vast vocabulary variations. To address this, we introduce a novel approach that leverages large language models (LLMs) through weak supervision to automatically annotate a vast collection of user search queries. Using prompt engineering and a diverse set of LLM personas, we generate training data that matches human annotator expectations. By incorporating domain knowledge via Chain of Thought and In-Context Learning, our approach leverages the labeled data to train low-latency models optimized for real-time inference. Extensive evaluations demonstrated that our approach outperformed the baseline with an average relative gain of 113% in recall. Furthermore, our novel prompt engineering framework yields higher quality LLM-generated data to be used for weak supervision; we observed 47.60% improvement over baseline in agreement rate between LLM predictions and human annotations with respect to F1 score, weighted according to the distribution of occurrences of the search queries. Our persona selection routing mechanism further adds an additional 3.67% increase in weighted F1 score on top of our novel prompt engineering framework.

9/16/2024

💬

Leveraging Large Language Models for Knowledge-free Weak Supervision in Clinical Natural Language Processing

Enshuo Hsu, Kirk Roberts

The performance of deep learning-based natural language processing systems is based on large amounts of labeled training data which, in the clinical domain, are not easily available or affordable. Weak supervision and in-context learning offer partial solutions to this issue, particularly using large language models (LLMs), but their performance still trails traditional supervised methods with moderate amounts of gold-standard data. In particular, inferencing with LLMs is computationally heavy. We propose an approach leveraging fine-tuning LLMs and weak supervision with virtually no domain knowledge that still achieves consistently dominant performance. Using a prompt-based approach, the LLM is used to generate weakly-labeled data for training a downstream BERT model. The weakly supervised model is then further fine-tuned on small amounts of gold standard data. We evaluate this approach using Llama2 on three different n2c2 datasets. With no more than 10 gold standard notes, our final BERT models weakly supervised by fine-tuned Llama2-13B consistently outperformed out-of-the-box PubMedBERT by 4.7% to 47.9% in F1 scores. With only 50 gold standard notes, our models achieved close performance to fully fine-tuned systems.

6/12/2024

💬

Optimizing Language Model's Reasoning Abilities with Weak Supervision

Yongqi Tong, Sizhe Wang, Dawei Li, Yifan Wang, Simeng Han, Zi Lin, Chengsong Huang, Jiaxin Huang, Jingbo Shang

While Large Language Models (LLMs) have demonstrated proficiency in handling complex queries, much of the past work has depended on extensively annotated datasets by human experts. However, this reliance on fully-supervised annotations poses scalability challenges, particularly as models and data requirements grow. To mitigate this, we explore the potential of enhancing LLMs' reasoning abilities with minimal human supervision. In this work, we introduce self-reinforcement, which begins with Supervised Fine-Tuning (SFT) of the model using a small collection of annotated questions. Then it iteratively improves LLMs by learning from the differences in responses from the SFT and unfinetuned models on unlabeled questions. Our approach provides an efficient approach without relying heavily on extensive human-annotated explanations. However, current reasoning benchmarks typically only include golden-reference answers or rationales. Therefore, we present textsc{PuzzleBen}, a weakly supervised benchmark that comprises 25,147 complex questions, answers, and human-generated rationales across various domains, such as brainteasers, puzzles, riddles, parajumbles, and critical reasoning tasks. A unique aspect of our dataset is the inclusion of 10,000 unannotated questions, enabling us to explore utilizing fewer supersized data to boost LLMs' inference capabilities. Our experiments underscore the significance of textsc{PuzzleBen}, as well as the effectiveness of our methodology as a promising direction in future endeavors. Our dataset and code will be published soon on texttt{Anonymity Link}.

5/8/2024

📉

LLM-based query paraphrasing for video search

Jiaxin Wu, Chong-Wah Ngo, Wing-Kwong Chan, Sheng-Hua Zhong

Text-to-video retrieval answers user queries through search by concepts and embeddings. Limited by the size of the concept bank and the amount of training data, answering queries in the wild is not always effective due to the out-of-vocabulary problem. Furthermore, neither concept-based nor embedding-based search can perform reasoning to consolidate the search results for complex queries mixed with logical and spatial constraints. To address these problems, we leverage large language models (LLM) to paraphrase the query by text-to-text (T2T), text-to-image (T2I), and image-to-text (I2T) transformations. These transformations rephrase abstract concepts into simple words to address the out-of-vocabulary problem. Furthermore, the complex relationship in a query can be decoupled into simpler sub-queries, yielding better retrieval performance when fusing the search results of these sub-queries. To address the LLM hallucination problem, this paper also proposes a novel consistency-based verification strategy to filter the paraphrased queries that are factually incorrect. Extensive experiments are conducted for ad-hoc video search and known-item search on the TRECVid datasets. We provide empirical insights into how traditionally difficult-to-answer queries can be resolved by query paraphrasing.

7/18/2024