InternLM-Law: An Open Source Chinese Legal Large Language Model

Read original: arXiv:2406.14887 - Published 6/24/2024 by Zhiwei Fei, Songyang Zhang, Xiaoyu Shen, Dawei Zhu, Xiao Wang, Maosong Cao, Fengzhe Zhou, Yining Li, Wenwei Zhang, Dahua Lin and 2 others

InternLM-Law: An Open Source Chinese Legal Large Language Model

Overview

This paper introduces InternLM-Law, an open-source Chinese legal large language model (LLM) trained on a large corpus of Chinese legal documents.
The goal is to develop a powerful language model that can assist with various legal tasks, such as legal document drafting, legal research, and legal question answering.
The model is trained using state-of-the-art LLM techniques and is designed to capture the unique language and domain-specific knowledge of the Chinese legal system.

Plain English Explanation

InternLM-Law is a new artificial intelligence (AI) system that has been trained on a vast amount of Chinese legal documents. The researchers who developed this system wanted to create a powerful language model that can help with various legal tasks, such as writing legal documents, searching for relevant legal information, and answering legal questions.

The researchers used advanced machine learning techniques to train the InternLM-Law model on this large corpus of legal data. The goal was to capture the unique language and specialized knowledge that is specific to the Chinese legal system. By doing this, the model can now understand and generate text in a way that is tailored to the legal domain, rather than just general language.

This open-source model can be used by legal professionals, researchers, and others who work with Chinese legal materials. It has the potential to significantly improve efficiency and productivity in a variety of legal applications, by automating certain tasks and providing valuable insights and recommendations.

Technical Explanation

The InternLM-Law model is built using state-of-the-art large language model (LLM) techniques. The researchers trained the model on a large corpus of Chinese legal documents, including laws, regulations, court decisions, and other legal materials. This allowed the model to learn the unique vocabulary, syntax, and domain-specific knowledge that is characteristic of the Chinese legal system.

The model's architecture and training process draw inspiration from recent advancements in the field of LLMs, such as the techniques used to develop models like LawGPT, Chinese Tiny LLM, and Legal Documents Drafting. However, the researchers have tailored the model specifically for the Chinese legal domain, which sets it apart from more general-purpose LLMs.

Through extensive experimentation and evaluation, the researchers have demonstrated that InternLM-Law outperforms other LLMs on a variety of Chinese legal tasks, such as legal document summarization, legal question answering, and legal reasoning. The model's strong performance highlights the value of developing domain-specific LLMs that can leverage the unique characteristics of a particular field or language.

Critical Analysis

The researchers acknowledge that while InternLM-Law is a significant step forward, there are still some limitations and areas for further improvement. For example, the model's performance may be impacted by the quality and comprehensiveness of the training data, which can be challenging to obtain and curate for the legal domain.

Additionally, as with many LLMs, there are concerns about potential biases and ethical issues that may arise from the model's use, such as the risk of perpetuating existing biases in the legal system or generating content that could be misused. The researchers emphasize the importance of responsible development and deployment of such powerful AI systems, and encourage further research in this direction.

It is also worth noting that the comparative evaluation of InternLM-Law against other Chinese LLMs, such as those discussed in the survey paper and the comparative evaluation, could provide additional insights and context for understanding the model's strengths and limitations.

Conclusion

The development of InternLM-Law represents an important step forward in the field of domain-specific large language models. By leveraging the unique characteristics of the Chinese legal system, the researchers have created a powerful tool that can assist legal professionals and researchers in a variety of tasks.

While there are still some challenges and areas for improvement, the strong performance of InternLM-Law highlights the potential of tailored LLMs to unlock new capabilities and efficiencies in critical domains like the law. As the field of AI continues to advance, it will be essential to develop models that can effectively capture and apply the specialized knowledge and language of various professions and disciplines.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

InternLM-Law: An Open Source Chinese Legal Large Language Model

Zhiwei Fei, Songyang Zhang, Xiaoyu Shen, Dawei Zhu, Xiao Wang, Maosong Cao, Fengzhe Zhou, Yining Li, Wenwei Zhang, Dahua Lin, Kai Chen, Jidong Ge

While large language models (LLMs) have showcased impressive capabilities, they struggle with addressing legal queries due to the intricate complexities and specialized expertise required in the legal field. In this paper, we introduce InternLM-Law, a specialized LLM tailored for addressing diverse legal queries related to Chinese laws, spanning from responding to standard legal questions (e.g., legal exercises in textbooks) to analyzing complex real-world legal situations. We meticulously construct a dataset in the Chinese legal domain, encompassing over 1 million queries, and implement a data filtering and processing pipeline to ensure its diversity and quality. Our training approach involves a novel two-stage process: initially fine-tuning LLMs on both legal-specific and general-purpose content to equip the models with broad knowledge, followed by exclusive fine-tuning on high-quality legal data to enhance structured output generation. InternLM-Law achieves the highest average performance on LawBench, outperforming state-of-the-art models, including GPT-4, on 13 out of 20 subtasks. We make InternLM-Law and our dataset publicly available to facilitate future research in applying LLMs within the legal domain.

6/24/2024

💬

LawGPT: A Chinese Legal Knowledge-Enhanced Large Language Model

Zhi Zhou, Jiang-Xin Shi, Peng-Xiao Song, Xiao-Wen Yang, Yi-Xuan Jin, Lan-Zhe Guo, Yu-Feng Li

Large language models (LLMs), including both proprietary and open-source models, have showcased remarkable capabilities in addressing a wide range of downstream tasks. Nonetheless, when it comes to practical Chinese legal tasks, these models fail to meet the actual requirements. Proprietary models do not ensure data privacy for sensitive legal cases, while open-source models demonstrate unsatisfactory performance due to their lack of legal knowledge. To address this problem, we introduce LawGPT, the first open-source model specifically designed for Chinese legal applications. LawGPT comprises two key components: legal-oriented pre-training and legal supervised fine-tuning. Specifically, we employ large-scale Chinese legal documents for legal-oriented pre-training to incorporate legal domain knowledge. To further improve the model's performance on downstream legal tasks, we create a knowledge-driven instruction dataset for legal supervised fine-tuning. Our experimental results demonstrate that LawGPT outperforms the open-source LLaMA 7B model. Our code and resources are publicly available at https://github.com/pengxiao-song/LaWGPT and have received 5.7K stars on GitHub.

6/10/2024

LawLuo: A Chinese Law Firm Co-run by LLM Agents

Jingyun Sun, Chengxiao Dai, Zhongze Luo, Yangbo Chang, Yang Li

Large Language Models (LLMs) demonstrate substantial potential in delivering legal consultation services to users without a legal background, attributed to their superior text comprehension and generation capabilities. Nonetheless, existing Chinese legal LLMs limit interaction to a single model-user dialogue, unlike the collaborative consultations typical of law firms, where multiple staff members contribute to a single consultation. This limitation prevents an authentic consultation experience. Additionally, extant Chinese legal LLMs suffer from critical limitations: (1) insufficient control over the quality of instruction fine-tuning data; (2) increased model hallucination resulting from users' ambiguous queries; and (3) a reduction in the model's ability to follow instructions over multiple dialogue turns. In response to these challenges, we propose a novel legal dialogue framework that leverages the collaborative capabilities of multiple LLM agents, termed LawLuo. This framework encompasses four agents: a receptionist, a lawyer, a secretary, and a boss, each responsible for different functionalities, collaboratively providing a comprehensive legal consultation to users. Additionally, we constructed two high-quality legal dialogue datasets, KINLED and MURLED, and fine-tuned ChatGLM-3-6b using these datasets. We propose a legal query clarification algorithm called ToLC. Experimental results demonstrate that LawLuo outperforms baseline LLMs, including GPT-4, across three dimensions: lawyer-like language style, the usefulness of legal advice, and the accuracy of legal knowledge. Our code and datasets are available at https://github.com/NEFUJing/LawLuo.

7/24/2024

LawLLM: Law Large Language Model for the US Legal System

Dong Shu, Haoran Zhao, Xukun Liu, David Demeter, Mengnan Du, Yongfeng Zhang

In the rapidly evolving field of legal analytics, finding relevant cases and accurately predicting judicial outcomes are challenging because of the complexity of legal language, which often includes specialized terminology, complex syntax, and historical context. Moreover, the subtle distinctions between similar and precedent cases require a deep understanding of legal knowledge. Researchers often conflate these concepts, making it difficult to develop specialized techniques to effectively address these nuanced tasks. In this paper, we introduce the Law Large Language Model (LawLLM), a multi-task model specifically designed for the US legal domain to address these challenges. LawLLM excels at Similar Case Retrieval (SCR), Precedent Case Recommendation (PCR), and Legal Judgment Prediction (LJP). By clearly distinguishing between precedent and similar cases, we provide essential clarity, guiding future research in developing specialized strategies for these tasks. We propose customized data preprocessing techniques for each task that transform raw legal data into a trainable format. Furthermore, we also use techniques such as in-context learning (ICL) and advanced information retrieval methods in LawLLM. The evaluation results demonstrate that LawLLM consistently outperforms existing baselines in both zero-shot and few-shot scenarios, offering unparalleled multi-task capabilities and filling critical gaps in the legal domain.

8/1/2024