Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models

Read original: arXiv:2404.02823 - Published 4/4/2024 by Haoran Sun, Lixin Liu, Junjie Li, Fengyu Wang, Baohua Dong, Ran Lin, Ruohui Huang

Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models

Overview

This paper introduces Conifer, a method for improving the complex constrained instruction-following ability of large language models.
The authors demonstrate that by fine-tuning large language models on a diverse set of constrained instructions, the models can better understand and follow complex instructions.
Conifer outperforms existing instruction-following approaches and shows strong generalization to unseen instructions.

Plain English Explanation

The paper discusses a technique called Conifer that helps large language models, such as GPT-3, become better at following complex instructions. These models are trained on a huge amount of text data, which allows them to understand and generate human-like language. However, they can struggle with specific, multi-step instructions that come with various constraints.

The Conifer approach fine-tunes these large models on a diverse set of instructions that involve different types of constraints, such as time limits, resource restrictions, or specific formatting requirements. By exposing the models to this wide range of constrained instructions during training, they become better at understanding and accurately following complex, multi-step instructions, even ones they haven't seen before.

The researchers show that Conifer outperforms other existing methods for instruction-following, and the models trained with Conifer can generalize well to new instructions they haven't encountered during the fine-tuning process. This is an important capability, as being able to understand and follow precise instructions is crucial for many real-world applications of language AI, such as task completion, content creation, and interactive assistants.

Technical Explanation

The paper introduces Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models. The authors propose a fine-tuning approach, called Conifer, that helps large language models like GPT-3 better understand and follow complex instructions with various constraints.

The key insight behind Conifer is that by exposing the language models to a diverse set of constrained instructions during fine-tuning, the models can learn to handle the complexities of instruction-following more effectively. The authors curate a dataset of instructions that cover a wide range of constraints, such as time limits, resource restrictions, and specific formatting requirements. They then fine-tune large language models on this dataset, enabling the models to generalize their instruction-following abilities to unseen instructions.

The authors evaluate Conifer on several benchmark datasets for instruction-following, including Instruction-Following Evaluation Through Verbalizer Manipulation and From Language Modeling to Instruction-Following: Understanding. They show that Conifer outperforms existing instruction-following approaches, such as GraphInstruCT: Empowering Large Language Models for Graph Understanding and Instructional Fingerprinting of Large Language Models. The models trained with Conifer demonstrate strong generalization to new, unseen instructions, highlighting the effectiveness of the proposed fine-tuning method.

Critical Analysis

The paper provides a robust evaluation of the Conifer approach, demonstrating its advantages over existing instruction-following methods. However, the authors acknowledge that the Conifer dataset used for fine-tuning may not capture all possible real-world constraints and instructions. There could be additional complexities or edge cases that the models may still struggle with.

Furthermore, the paper does not explore the scalability of the Conifer approach as the size and complexity of the instruction dataset grow. It would be valuable to understand how the performance and training efficiency of the models scale as the dataset becomes larger and more diverse.

Additionally, the paper does not delve into the potential biases or limitations that may be introduced by the Conifer dataset, which could impact the fairness and robustness of the resulting instruction-following capabilities.

Conclusion

The Conifer approach introduced in this paper represents a significant advancement in improving the complex constrained instruction-following ability of large language models. By fine-tuning these models on a diverse set of instructions with various constraints, the authors have demonstrated that the models can better understand and follow precise, multi-step instructions, even ones they haven't encountered before.

This capability is crucial for many real-world applications of language AI, such as task completion, content creation, and interactive assistants. The strong performance of Conifer-trained models on benchmark datasets suggests that this approach could have a meaningful impact on the practical deployment of language models in various industries and domains.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models

Haoran Sun, Lixin Liu, Junjie Li, Fengyu Wang, Baohua Dong, Ran Lin, Ruohui Huang

The ability of large language models (LLMs) to follow instructions is crucial to real-world applications. Despite recent advances, several studies have highlighted that LLMs struggle when faced with challenging instructions, especially those that include complex constraints, hindering their effectiveness in various tasks. To address this challenge, we introduce Conifer, a novel instruction tuning dataset, designed to enhance LLMs to follow multi-level instructions with complex constraints. Utilizing GPT-4, we curate the dataset by a series of LLM-driven refinement processes to ensure high quality. We also propose a progressive learning scheme that emphasizes an easy-to-hard progression, and learning from process feedback. Models trained with Conifer exhibit remarkable improvements in instruction-following abilities, especially for instructions with complex constraints. On several instruction-following benchmarks, our 7B model outperforms the state-of-the-art open-source 7B models, even exceeds the performance of models 10 times larger on certain metrics. All the code and Conifer dataset are available at https://www.github.com/ConiferLM/Conifer.

4/4/2024

💬

From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language Models

Qianyu He, Jie Zeng, Qianxi He, Jiaqing Liang, Yanghua Xiao

It is imperative for Large language models (LLMs) to follow instructions with elaborate requirements (i.e. Complex Instructions Following). Yet, it remains under-explored how to enhance the ability of LLMs to follow complex instructions with multiple constraints. To bridge the gap, we initially study what training data is effective in enhancing complex constraints following abilities. We found that training LLMs with instructions containing multiple constraints enhances their understanding of complex instructions, especially those with lower complexity levels. The improvement can even generalize to compositions of out-of-domain constraints. Additionally, we further propose methods addressing how to obtain and utilize the effective training data. Finally, we conduct extensive experiments to prove the effectiveness of our methods in terms of overall performance and training efficiency. We also demonstrate that our methods improve models' ability to follow instructions generally and generalize effectively across out-of-domain, in-domain, and adversarial settings, while maintaining general capabilities.

6/19/2024

Benchmarking Complex Instruction-Following with Multiple Constraints Composition

Bosi Wen, Pei Ke, Xiaotao Gu, Lindong Wu, Hao Huang, Jinfeng Zhou, Wenchuang Li, Binxin Hu, Wendy Gao, Jiaxin Xu, Yiming Liu, Jie Tang, Hongning Wang, Minlie Huang

Instruction following is one of the fundamental capabilities of large language models (LLMs). As the ability of LLMs is constantly improving, they have been increasingly applied to deal with complex human instructions in real-world scenarios. Therefore, how to evaluate the ability of complex instruction-following of LLMs has become a critical research problem. Existing benchmarks mainly focus on modeling different types of constraints in human instructions while neglecting the composition of different constraints, which is an indispensable constituent in complex instructions. To this end, we propose ComplexBench, a benchmark for comprehensively evaluating the ability of LLMs to follow complex instructions composed of multiple constraints. We propose a hierarchical taxonomy for complex instructions, including 4 constraint types, 19 constraint dimensions, and 4 composition types, and manually collect a high-quality dataset accordingly. To make the evaluation reliable, we augment LLM-based evaluators with rules to effectively verify whether generated texts can satisfy each constraint and composition. Furthermore, we obtain the final evaluation score based on the dependency structure determined by different composition types. ComplexBench identifies significant deficiencies in existing LLMs when dealing with complex instructions with multiple constraints composition.

7/12/2024

💬

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models

Yuxin Jiang, Yufei Wang, Xingshan Zeng, Wanjun Zhong, Liangyou Li, Fei Mi, Lifeng Shang, Xin Jiang, Qun Liu, Wei Wang

The ability to follow instructions is crucial for Large Language Models (LLMs) to handle various real-world applications. Existing benchmarks primarily focus on evaluating pure response quality, rather than assessing whether the response follows constraints stated in the instruction. To fill this research gap, in this paper, we propose FollowBench, a Multi-level Fine-grained Constraints Following Benchmark for LLMs. FollowBench comprehensively includes five different types (i.e., Content, Situation, Style, Format, and Example) of fine-grained constraints. To enable a precise constraint following estimation on diverse difficulties, we introduce a Multi-level mechanism that incrementally adds a single constraint to the initial instruction at each increased level. To assess whether LLMs' outputs have satisfied every individual constraint, we propose to prompt strong LLMs with constraint-evolution paths to handle challenging open-ended instructions. By evaluating 13 closed-source and open-source popular LLMs on FollowBench, we highlight the weaknesses of LLMs in instruction following and point towards potential avenues for future work. The data and code are publicly available at https://github.com/YJiangcm/FollowBench.

6/6/2024