Synergy-of-Thoughts: Eliciting Efficient Reasoning in Hybrid Language Models

2402.02563

Published 5/24/2024 by Yu Shang, Yu Li, Fengli Xu, Yong Li

💬

Abstract

Large language models (LLMs) have shown impressive emergent abilities in a wide range of tasks, but still face challenges in handling complex reasoning problems. Previous works like chain-of-thought (CoT) and tree-of-thoughts (ToT) have predominately focused on enhancing accuracy, but overlook the rapidly increasing token cost, which could be particularly problematic for open-ended real-world tasks with huge solution spaces. Motivated by the dual process theory of human cognition, we propose Synergy of Thoughts (SoT) to unleash the synergistic potential of hybrid LLMs for efficient reasoning. By default, SoT uses smaller-scale language models to generate multiple low-cost reasoning thoughts, which resembles the parallel intuitions produced by System 1. If these intuitions exhibit conflicts, SoT will invoke the reflective reasoning of scaled-up language models to emulate the intervention of System 2, which will override the intuitive thoughts and rectify the reasoning process. This framework is model-agnostic and training-free, which can be flexibly implemented with various off-the-shelf LLMs. Experiments on six representative reasoning tasks show that SoT substantially reduces the token cost by 38.3%-75.1%, and simultaneously achieves state-of-the-art reasoning accuracy and solution diversity. Notably, the average token cost reduction on open-ended tasks reaches up to 69.1%. Code repo with all prompts will be released upon publication.

Create account to get full access

Overview

Large language models (LLMs) have impressive abilities, but struggle with complex reasoning tasks
Previous approaches like chain-of-thought and tree-of-thoughts focus on improving accuracy, but don't address the rapidly increasing token costs
The paper proposes a new approach called "Synergy of Thoughts" (SoT) to enable efficient reasoning with hybrid LLMs

Plain English Explanation

The paper explores a new way to enhance the reasoning capabilities of large language models (LLMs). LLMs are powerful AI systems that can perform a wide range of tasks, but they still struggle when it comes to complex problem-solving and reasoning. Previous methods, like chain-of-thought and tree-of-thoughts, have tried to improve the accuracy of LLMs in these areas, but they often come with a significant increase in the number of tokens (basically, the amount of text) required to reach a solution.

The researchers behind this paper were inspired by the "dual process theory" of human cognition, which suggests that we have two types of thinking: a fast, intuitive "System 1" and a slower, more reflective "System 2." The new approach, called "Synergy of Thoughts" (SoT), aims to combine these two types of thinking within an LLM system.

The basic idea is to use smaller, more efficient language models to generate multiple low-cost "intuitive" thoughts, similar to the parallel intuitions produced by System 1. If these initial thoughts conflict or seem problematic, the system will then invoke a larger, more powerful language model to step in and refine the reasoning, similar to how System 2 would override and correct System 1 in human cognition.

This SoT framework is designed to be flexible and can work with a variety of different LLM models. The researchers tested it on several challenging reasoning tasks and found that it could substantially reduce the token cost (by 38-75%) while still achieving state-of-the-art performance in terms of accuracy and solution diversity. This is especially important for open-ended, real-world tasks where the space of possible solutions is vast.

Technical Explanation

The paper proposes a new framework called "Synergy of Thoughts" (SoT) to enhance the reasoning capabilities of large language models (LLMs) in a more efficient manner. The key insight is to leverage the dual process theory of human cognition, which suggests that we have two types of thinking: a fast, intuitive "System 1" and a slower, more reflective "System 2."

In the SoT framework, smaller-scale language models are used to generate multiple low-cost "intuitive" thoughts, resembling the parallel intuitions produced by System 1. If these initial thoughts exhibit conflicts, the system will then invoke a scaled-up language model to emulate the intervention of System 2, which will override the intuitive thoughts and rectify the reasoning process.

This approach is motivated by the observation that previous methods, such as chain-of-thought and tree-of-thoughts, have focused primarily on improving accuracy, but have overlooked the rapidly increasing token cost, which can be particularly problematic for open-ended, real-world tasks with huge solution spaces.

The researchers evaluated the SoT framework on six representative reasoning tasks and found that it can substantially reduce the token cost by 38.3%-75.1% while simultaneously achieving state-of-the-art reasoning accuracy and solution diversity. Notably, the average token cost reduction on open-ended tasks reaches up to 69.1%.

Critical Analysis

The paper presents a novel and promising approach to enhancing the reasoning capabilities of LLMs in a more efficient manner. The key strength of the SoT framework is its ability to leverage the complementary strengths of smaller and larger language models, drawing inspiration from the dual process theory of human cognition.

One potential limitation is that the paper does not provide a deep analysis of the types of reasoning tasks and problem domains where the SoT framework might be most effective. It would be valuable to understand the characteristics of tasks that are particularly well-suited for this approach, as well as any potential limitations or edge cases.

Additionally, the paper does not explore the sensitivity of the SoT framework to the specific choice and configuration of the smaller and larger language models used. It would be interesting to see how the performance and efficiency of the system might vary with different model architectures and sizes.

Another area for further research could be investigating ways to further improve the integration and synchronization between the "intuitive" and "reflective" components of the SoT framework. For example, exploring adaptive mechanisms to dynamically adjust the interplay between the two components based on task complexity or other contextual factors.

Overall, the SoT framework represents a promising step forward in the quest to develop more efficient and capable reasoning systems based on large language models. The insights and techniques presented in this paper could inspire further advancements in this important area of AI research.

Conclusion

The paper introduces a novel framework called "Synergy of Thoughts" (SoT) that aims to enhance the reasoning capabilities of large language models (LLMs) in a more efficient manner. Inspired by the dual process theory of human cognition, the SoT framework leverages the synergistic potential of hybrid LLMs, using smaller-scale models to generate low-cost intuitive thoughts and invoking larger models to refine the reasoning process when needed.

The key advantage of the SoT approach is its ability to substantially reduce the token cost (by 38-75%) while still maintaining state-of-the-art performance in terms of reasoning accuracy and solution diversity. This is particularly important for open-ended, real-world tasks where the solution space is vast and the token budget is a critical constraint.

The insights and techniques presented in this paper represent a significant contribution to the ongoing efforts to develop more efficient and capable reasoning systems based on large language models. As AI systems continue to play an increasingly important role in our lives, advancements like the SoT framework will be crucial in unlocking the full potential of these powerful technologies while addressing the practical challenges of their deployment.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

📊

Empowering Multi-step Reasoning across Languages via Tree-of-Thoughts

Leonardo Ranaldi, Giulia Pucci, Federico Ranaldi, Elena Sofia Ruzzetti, Fabio Massimo Zanzotto

Reasoning methods, best exemplified by the well-known Chain-of-Thought (CoT), empower the reasoning abilities of Large Language Models (LLMs) by eliciting them to solve complex tasks in a step-by-step manner. Although they are achieving significant success, the ability to deliver multi-step reasoning remains limited to English because of the imbalance in the distribution of pre-training data, which makes other languages a barrier. In this paper, we propose Cross-lingual Tree-of-Thoughts (Cross-ToT), a method for aligning Cross-lingual CoT reasoning across languages. The proposed method, through a self-consistent cross-lingual prompting mechanism inspired by the Tree-of-Thoughts approach, provides multi-step reasoning paths in different languages that, during the steps, lead to the final solution. Experimental evaluations show that our method significantly outperforms existing prompting methods by reducing the number of interactions and achieving state-of-the-art performance.

6/24/2024

cs.CL cs.AI

💬

Multimodal Chain-of-Thought Reasoning in Language Models

Zhuosheng Zhang, Aston Zhang, Mu Li, Hai Zhao, George Karypis, Alex Smola

Large language models (LLMs) have shown impressive performance on complex reasoning by leveraging chain-of-thought (CoT) prompting to generate intermediate reasoning chains as the rationale to infer the answer. However, existing CoT studies have primarily focused on the language modality. We propose Multimodal-CoT that incorporates language (text) and vision (images) modalities into a two-stage framework that separates rationale generation and answer inference. In this way, answer inference can leverage better generated rationales that are based on multimodal information. Experimental results on ScienceQA and A-OKVQA benchmark datasets show the effectiveness of our proposed approach. With Multimodal-CoT, our model under 1 billion parameters achieves state-of-the-art performance on the ScienceQA benchmark. Our analysis indicates that Multimodal-CoT offers the advantages of mitigating hallucination and enhancing convergence speed. Code is publicly available at https://github.com/amazon-science/mm-cot.

5/21/2024

cs.CL cs.AI cs.CV

📉

Faithful Logical Reasoning via Symbolic Chain-of-Thought

Jundong Xu, Hao Fei, Liangming Pan, Qian Liu, Mong-Li Lee, Wynne Hsu

While the recent Chain-of-Thought (CoT) technique enhances the reasoning ability of large language models (LLMs) with the theory of mind, it might still struggle in handling logical reasoning that relies much on symbolic expressions and rigid deducing rules. To strengthen the logical reasoning capability of LLMs, we propose a novel Symbolic Chain-of-Thought, namely SymbCoT, a fully LLM-based framework that integrates symbolic expressions and logic rules with CoT prompting. Technically, building upon an LLM, SymbCoT 1) first translates the natural language context into the symbolic format, and then 2) derives a step-by-step plan to solve the problem with symbolic logical rules, 3) followed by a verifier to check the translation and reasoning chain. Via thorough evaluations on 5 standard datasets with both First-Order Logic and Constraint Optimization symbolic expressions, SymbCoT shows striking improvements over the CoT method consistently, meanwhile refreshing the current state-of-the-art performances. We further demonstrate that our system advances in more faithful, flexible, and explainable logical reasoning. To our knowledge, this is the first to combine symbolic expressions and rules into CoT for logical reasoning with LLMs. Code is open at https://github.com/Aiden0526/SymbCoT.

6/12/2024

cs.CL

Abstraction-of-Thought Makes Language Models Better Reasoners

Ruixin Hong, Hongming Zhang, Xiaoman Pan, Dong Yu, Changshui Zhang

Abstract reasoning, the ability to reason from the abstract essence of a problem, serves as a key to generalization in human reasoning. However, eliciting language models to perform reasoning with abstraction remains unexplored. This paper seeks to bridge this gap by introducing a novel structured reasoning format called Abstraction-of-Thought (AoT). The uniqueness of AoT lies in its explicit requirement for varying levels of abstraction within the reasoning process. This approach could elicit language models to first contemplate on the abstract level before incorporating concrete details, which is overlooked by the prevailing step-by-step Chain-of-Thought (CoT) method. To align models with the AoT format, we present AoT Collection, a generic finetuning dataset consisting of 348k high-quality samples with AoT reasoning processes, collected via an automated and scalable pipeline. We finetune a wide range of language models with AoT Collection and conduct extensive evaluations on 23 unseen tasks from the challenging benchmark Big-Bench Hard. Experimental results indicate that models aligned to AoT reasoning format substantially outperform those aligned to CoT in many reasoning tasks.

6/19/2024

cs.CL cs.AI