Instruction Tuning With Loss Over Instructions

2405.14394

Published 5/24/2024 by Zhengyan Shi, Adam X. Yang, Bin Wu, Laurence Aitchison, Emine Yilmaz, Aldo Lipani

✅

Abstract

Instruction tuning plays a crucial role in shaping the outputs of language models (LMs) to desired styles. In this work, we propose a simple yet effective method, Instruction Modelling (IM), which trains LMs by applying a loss function to the instruction and prompt part rather than solely to the output part. Through experiments across 21 diverse benchmarks, we show that, in many scenarios, IM can effectively improve the LM performance on both NLP tasks (e.g., MMLU, TruthfulQA, and HumanEval) and open-ended generation benchmarks (e.g., MT-Bench and AlpacaEval). Remarkably, in the most advantageous case, IM boosts model performance on AlpacaEval 1.0 by over 100%. We identify two key factors influencing the effectiveness of IM: (1) The ratio between instruction length and output length in the training data; and (2) The number of training examples. We observe that IM is especially beneficial when trained on datasets with lengthy instructions paired with brief outputs, or under the Superficial Alignment Hypothesis (SAH) where a small amount of training examples are used for instruction tuning. Further analysis substantiates our hypothesis that the improvement can be attributed to reduced overfitting to instruction tuning datasets. Our work provides practical guidance for instruction tuning LMs, especially in low-resource scenarios.

Create account to get full access

Overview

Instruction tuning is a crucial technique for shaping the outputs of language models (LMs) to desired styles.
This paper proposes a simple yet effective method called Instruction Modelling (IM), which trains LMs by applying a loss function to the instruction and prompt part, rather than solely to the output part.
Experiments across 21 diverse benchmarks show that IM can effectively improve LM performance on both NLP tasks and open-ended generation benchmarks, with up to 100% boost in some cases.
The effectiveness of IM is influenced by two key factors: the ratio between instruction length and output length in the training data, and the number of training examples.

Plain English Explanation

Language models (LMs) are powerful AI systems that can generate human-like text. However, their outputs can sometimes be unpredictable or not aligned with the desired style or intent. Instruction tuning is a technique that helps shape the LM's outputs to be more in line with specific instructions or prompts.

In this paper, the researchers propose a new method called Instruction Modelling (IM), which trains the LM to pay more attention to the instruction or prompt part of the training data, rather than just focusing on the generated output. This helps the LM better understand and follow the desired instructions, leading to improved performance on a wide range of tasks, from answering questions to generating creative content.

The researchers found that IM is particularly effective when the training data has long instructions paired with short outputs, or when there are only a small number of training examples available (the Superficial Alignment Hypothesis). This suggests that IM helps the LM avoid overfitting to the limited training data and instead focus on understanding and following the instructions more effectively.

Overall, this research provides practical guidance for using instruction tuning to improve the performance of language models, especially in situations where training data is scarce or the desired output style is quite different from the standard language used in the training data.

Technical Explanation

The paper proposes a new method called Instruction Modelling (IM) for training language models (LMs) to better follow instructions and generate desired outputs. In contrast to standard language modelling approaches that focus solely on predicting the next token in the output, IM applies a loss function to the instruction and prompt part of the training data, in addition to the output.

Through experiments across 21 diverse benchmarks, the researchers demonstrate that IM can significantly improve LM performance on both NLP tasks (e.g., MMLU, TruthfulQA, and HumanEval) and open-ended generation benchmarks (e.g., MT-Bench and AlpacaEval 1.0). In the most advantageous case, IM boosts model performance on AlpacaEval 1.0 by over 100%.

The researchers identify two key factors that influence the effectiveness of IM:

The ratio between instruction length and output length in the training data: IM is more beneficial when the instructions are longer than the outputs.
The number of training examples: IM is especially useful under the Superficial Alignment Hypothesis (SAH), where a small amount of training examples are used for instruction tuning.

Further analysis suggests that the improvement from IM can be attributed to reduced overfitting to the instruction tuning datasets, which helps the LM better understand and follow the desired instructions.

Critical Analysis

The paper provides a robust experimental evaluation of the proposed Instruction Modelling (IM) approach, demonstrating its effectiveness across a diverse range of benchmarks. However, the authors acknowledge several limitations and areas for further research:

Generalization to other model architectures: The experiments were conducted using the GPT-3 language model. It would be valuable to evaluate the performance of IM on other LM architectures to assess its broader applicability.
Long-term effects of instruction tuning: The paper focuses on immediate task performance, but it would be interesting to investigate the long-term effects of instruction tuning on the LM's overall behavior and alignment with human values and preferences.
Interaction with other fine-tuning techniques: The paper does not explore the potential synergies or conflicts between IM and other fine-tuning approaches, such as multitask learning or target-specific fine-tuning. Investigating these interactions could lead to more robust and effective fine-tuning strategies.
Broader social implications: While the paper focuses on the technical aspects of instruction tuning, it would be valuable to consider the potential societal impacts, both positive and negative, of language models that are increasingly aligned with specific instructions or prompts.

Overall, this paper provides a valuable contribution to the field of language model fine-tuning and offers practical guidance for improving the performance of LMs through instruction-based training. However, further research is needed to fully understand the broader implications and limitations of this approach.

Conclusion

This paper introduces a novel method called Instruction Modelling (IM) for training language models (LMs) to better follow instructions and generate desired outputs. Through extensive experiments, the researchers demonstrate that IM can significantly improve LM performance on a wide range of tasks, including both NLP benchmarks and open-ended generation.

The key factors influencing the effectiveness of IM are the ratio between instruction length and output length in the training data, as well as the number of training examples. IM is particularly beneficial when the instructions are longer than the outputs, and when only a small number of training examples are available (the Superficial Alignment Hypothesis).

This research provides practical guidance for using instruction tuning to enhance the capabilities of language models, especially in low-resource scenarios or when the desired output style differs from the standard language used in the training data. As language models continue to play a prominent role in various applications, techniques like IM will be crucial for aligning their outputs with human preferences and values.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

💬

From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning

Xuansheng Wu, Wenlin Yao, Jianshu Chen, Xiaoman Pan, Xiaoyang Wang, Ninghao Liu, Dong Yu

Large Language Models (LLMs) have achieved remarkable success, where instruction tuning is the critical step in aligning LLMs with user intentions. In this work, we investigate how the instruction tuning adjusts pre-trained models with a focus on intrinsic changes. Specifically, we first develop several local and global explanation methods, including a gradient-based method for input-output attribution, and techniques for interpreting patterns and concepts in self-attention and feed-forward layers. The impact of instruction tuning is then studied by comparing the explanations derived from the pre-trained and instruction-tuned models. This approach provides an internal perspective of the model shifts on a human-comprehensible level. Our findings reveal three significant impacts of instruction tuning: 1) It empowers LLMs to recognize the instruction parts of user prompts, and promotes the response generation constantly conditioned on the instructions. 2) It encourages the self-attention heads to capture more word-word relationships about instruction verbs. 3) It encourages the feed-forward networks to rotate their pre-trained knowledge toward user-oriented tasks. These insights contribute to a more comprehensive understanding of instruction tuning and lay the groundwork for future work that aims at explaining and optimizing LLMs for various applications. Our code and data are publicly available at https://github.com/JacksonWuxs/Interpret_Instruction_Tuning_LLMs.

4/5/2024

cs.CL cs.AI cs.LG

Contrastive Instruction Tuning

Tianyi Lorena Yan, Fei Wang, James Y. Huang, Wenxuan Zhou, Fan Yin, Aram Galstyan, Wenpeng Yin, Muhao Chen

Instruction tuning has been used as a promising approach to improve the performance of large language models (LLMs) on unseen tasks. However, current LLMs exhibit limited robustness to unseen instructions, generating inconsistent outputs when the same instruction is phrased with slightly varied forms or language styles. This behavior indicates LLMs' lack of robustness to textual variations and generalizability to unseen instructions, potentially leading to trustworthiness issues. Accordingly, we propose Contrastive Instruction Tuning, which maximizes the similarity between the hidden representations of semantically equivalent instruction-instance pairs while minimizing the similarity between semantically different ones. To facilitate this approach, we augment the existing FLAN collection by paraphrasing task instructions. Experiments on the PromptBench benchmark show that CoIN consistently improves LLMs' robustness to unseen instructions with variations across character, word, sentence, and semantic levels by an average of +2.5% in accuracy. Code is available at https://github.com/luka-group/CoIN.

6/7/2024

cs.CL cs.AI cs.LG

Towards Robust Instruction Tuning on Multimodal Large Language Models

Wei Han, Hui Chen, Soujanya Poria

Fine-tuning large language models (LLMs) on multi-task instruction-following data has been proven to be a powerful learning paradigm for improving their zero-shot capabilities on new tasks. Recent works about high-quality instruction-following data generation and selection require amounts of human labor to conceive model-understandable instructions for the given tasks and carefully filter the LLM-generated data. In this work, we introduce an automatic instruction augmentation method named INSTRAUG in multimodal tasks. It starts from a handful of basic and straightforward meta instructions but can expand an instruction-following dataset by 30 times. Results on two popular multimodal instructionfollowing benchmarks MULTIINSTRUCT and InstructBLIP show that INSTRAUG can significantly improve the alignment of multimodal large language models (MLLMs) across 12 multimodal tasks, which is even equivalent to the benefits of scaling up training data multiple times.

6/17/2024

cs.CL cs.AI

💬

BioInstruct: Instruction Tuning of Large Language Models for Biomedical Natural Language Processing

Hieu Tran, Zhichao Yang, Zonghai Yao, Hong Yu

To enhance the performance of large language models (LLMs) in biomedical natural language processing (BioNLP) by introducing a domain-specific instruction dataset and examining its impact when combined with multi-task learning principles. We created the BioInstruct, comprising 25,005 instructions to instruction-tune LLMs(LLaMA 1 & 2, 7B & 13B version). The instructions were created by prompting the GPT-4 language model with three-seed samples randomly drawn from an 80 human curated instructions. We employed Low-Rank Adaptation(LoRA) for parameter-efficient fine-tuning. We then evaluated these instruction-tuned LLMs on several BioNLP tasks, which can be grouped into three major categories: question answering(QA), information extraction(IE), and text generation(GEN). We also examined whether categories(e.g., QA, IE, and generation) of instructions impact model performance. Comparing with LLMs without instruction-tuned, our instruction-tuned LLMs demonstrated marked performance gains: 17.3% in QA, 5.7% in IE, and 96% in Generation tasks. Our 7B-parameter instruction-tuned LLaMA 1 model was competitive or even surpassed other LLMs in the biomedical domain that were also fine-tuned from LLaMA 1 with vast domain-specific data or a variety of tasks. Our results also show that the performance gain is significantly higher when instruction fine-tuning is conducted with closely related tasks. Our findings align with the observations of multi-task learning, suggesting the synergies between two tasks. The BioInstruct dataset serves as a valuable resource and instruction tuned LLMs lead to the best performing BioNLP applications.

6/10/2024

cs.CL cs.AI