LiLiuM: eBay's Large Language Models for e-commerce

Read original: arXiv:2406.12023 - Published 6/19/2024 by Christian Herold, Michael Kozielski, Leonid Ekimov, Pavel Petrushkov, Pierre-Yves Vandenbussche, Shahram Khadivi

LiLiuM: eBay's Large Language Models for e-commerce

Overview

This paper presents LiLiuM, eBay's large language models designed for e-commerce applications.
LiLiuM models are trained on eBay's proprietary data to perform various tasks like product search, recommendation, and question answering.
The paper discusses the LiLiuM framework, its different components, and the techniques used to train and deploy the models at scale.

Plain English Explanation

eBay, the popular e-commerce platform, has developed its own large language models called LiLiuM. These models are trained on a vast amount of eBay's own data, including product information, customer reviews, and conversations. The goal is to use these models to power various e-commerce features, such as helping customers find the right products, suggesting relevant items they might be interested in, and answering their questions about products or the platform.

The paper describes the LiLiuM framework and the different components that make it up. It explains the techniques used to train these large language models efficiently and deploy them at scale to serve eBay's customers. The researchers also discuss how they've tailored the models to work well in the e-commerce domain, which may differ from general-purpose language models.

Overall, the LiLiuM project aims to leverage the power of large language models to enhance the customer experience on eBay and make the platform more useful and engaging for shoppers.

Technical Explanation

The paper presents the LiLiuM framework, eBay's large language models for e-commerce. LiLiuM models are trained on eBay's proprietary data, including product information, customer reviews, and conversations, to perform various tasks like product search, recommendation, and question answering.

The paper describes the different components of the LiLiuM framework, including the data preprocessing and cleaning pipelines, the model training and fine-tuning approaches, and the deployment and serving infrastructure. The researchers highlight the techniques used to train the models efficiently at scale, such as distributed training, model parallelism, and mixed-precision computation.

Additionally, the paper discusses how the LiLiuM models are tailored to the e-commerce domain, which may differ from general-purpose language models. This includes incorporating domain-specific knowledge, handling product-related entities and attributes, and adapting the models to e-commerce-specific tasks and user interactions.

The paper also presents experiments and evaluations of the LiLiuM models on various e-commerce-related tasks, demonstrating their effectiveness and performance compared to other baselines.

Critical Analysis

The paper provides a comprehensive overview of the LiLiuM framework and the techniques used to develop large language models for e-commerce applications. The authors highlight the importance of leveraging domain-specific data and tailoring the models to the e-commerce context, which is a crucial aspect often overlooked in general-purpose language models.

However, the paper does not delve into the potential limitations or challenges faced during the development and deployment of the LiLiuM models. For example, it would be valuable to understand how the researchers addressed issues like data bias, model fairness, and the interpretability of the models' decision-making processes, which are important considerations in e-commerce applications.

Additionally, the paper could have provided more insights into the specific use cases and real-world performance of the LiLiuM models, such as their impact on customer satisfaction, conversion rates, or other relevant business metrics. This would help readers better understand the practical implications and benefits of the LiLiuM framework.

Overall, the paper provides a solid technical foundation for the LiLiuM project, but additional research and discussion on the model limitations, fairness, and real-world impact would further strengthen the presentation and contribute to the broader understanding of large language models in e-commerce.

Conclusion

The LiLiuM framework presented in this paper represents eBay's effort to leverage large language models to enhance the e-commerce experience for its customers. By training these models on eBay's proprietary data and tailoring them to the e-commerce domain, the researchers aim to improve product search, recommendation, and question-answering capabilities on the platform.

The technical details provided in the paper outline the key components and techniques used to develop and deploy the LiLiuM models at scale. This work highlights the potential of large language models to be adapted and optimized for specific industry verticals, like e-commerce, where domain-specific knowledge and user interactions can be leveraged to create more effective and engaging customer experiences.

While the paper provides a solid foundation for the LiLiuM project, further research and analysis on model fairness, interpretability, and real-world impact would contribute to a more comprehensive understanding of the potential and limitations of large language models in e-commerce applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

LiLiuM: eBay's Large Language Models for e-commerce

Christian Herold, Michael Kozielski, Leonid Ekimov, Pavel Petrushkov, Pierre-Yves Vandenbussche, Shahram Khadivi

We introduce the LiLiuM series of large language models (LLMs): 1B, 7B, and 13B parameter models developed 100% in-house to fit eBay's specific needs in the e-commerce domain. This gives eBay full control over all aspects of the models including license, data, vocabulary, and architecture. We expect these models to be used as a foundation for fine-tuning and instruction-tuning, eliminating dependencies to external models. The LiLiuM LLMs have been trained on 3 trillion tokens of multilingual text from general and e-commerce domain. They perform similar to the popular LLaMA-2 models on English natural language understanding (NLU) benchmarks. At the same time, we outperform LLaMA-2 on non-English NLU tasks, machine translation and on e-commerce specific downstream tasks. As part of our data mixture, we utilize the newly released RedPajama-V2 dataset for training and share our insights regarding data filtering and deduplication. We also discuss in detail how to serialize structured data for use in autoregressive language modeling. We provide insights on the effects of including code and parallel machine translation data in pre-training. Furthermore, we develop our own tokenizer and model vocabulary, customized towards e-commerce. This way, we can achieve up to 34% speed-up in text generation on eBay-specific downstream tasks compared to LLaMA-2. Finally, in relation to LLM pretraining, we show that checkpoint averaging can further improve over the best individual model checkpoint.

6/19/2024

👀

Investigating LLM Applications in E-Commerce

Chester Palen-Michel, Ruixiang Wang, Yipeng Zhang, David Yu, Canran Xu, Zhe Wu

The emergence of Large Language Models (LLMs) has revolutionized natural language processing in various applications especially in e-commerce. One crucial step before the application of such LLMs in these fields is to understand and compare the performance in different use cases in such tasks. This paper explored the efficacy of LLMs in the e-commerce domain, focusing on instruction-tuning an open source LLM model with public e-commerce datasets of varying sizes and comparing the performance with the conventional models prevalent in industrial applications. We conducted a comprehensive comparison between LLMs and traditional pre-trained language models across specific tasks intrinsic to the e-commerce domain, namely classification, generation, summarization, and named entity recognition (NER). Furthermore, we examined the effectiveness of the current niche industrial application of very large LLM, using in-context learning, in e-commerce specific tasks. Our findings indicate that few-shot inference with very large LLMs often does not outperform fine-tuning smaller pre-trained models, underscoring the importance of task-specific model optimization.Additionally, we investigated different training methodologies such as single-task training, mixed-task training, and LoRA merging both within domain/tasks and between different tasks. Through rigorous experimentation and analysis, this paper offers valuable insights into the potential effectiveness of LLMs to advance natural language processing capabilities within the e-commerce industry.

8/26/2024

A Survey of Large Language Models for European Languages

Wazir Ali, Sampo Pyysalo

Large Language Models (LLMs) have gained significant attention due to their high performance on a wide range of natural language tasks since the release of ChatGPT. The LLMs learn to understand and generate language by training billions of model parameters on vast volumes of text data. Despite being a relatively new field, LLM research is rapidly advancing in various directions. In this paper, we present an overview of LLM families, including LLaMA, PaLM, GPT, and MoE, and the methods developed to create and enhance LLMs for official European Union (EU) languages. We provide a comprehensive summary of common monolingual and multilingual datasets used for pretraining large language models.

8/29/2024

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

Xinrun Du, Zhouliang Yu, Songyang Gao, Ding Pan, Yuyang Cheng, Ziyang Ma, Ruibin Yuan, Xingwei Qu, Jiaheng Liu, Tianyu Zheng, Xinchen Luo, Guorui Zhou, Wenhu Chen, Ge Zhang

In this study, we introduce CT-LLM, a 2B large language model (LLM) that illustrates a pivotal shift towards prioritizing the Chinese language in developing LLMs. Uniquely initiated from scratch, CT-LLM diverges from the conventional methodology by primarily incorporating Chinese textual data, utilizing an extensive corpus of 1,200 billion tokens, including 800 billion Chinese tokens, 300 billion English tokens, and 100 billion code tokens. This strategic composition facilitates the model's exceptional proficiency in understanding and processing Chinese, a capability further enhanced through alignment techniques. Demonstrating remarkable performance on the CHC-Bench, CT-LLM excels in Chinese language tasks, and showcases its adeptness in English through SFT. This research challenges the prevailing paradigm of training LLMs predominantly on English corpora and then adapting them to other languages, broadening the horizons for LLM training methodologies. By open-sourcing the full process of training a Chinese LLM, including a detailed data processing procedure with the obtained Massive Appropriate Pretraining Chinese Corpus (MAP-CC), a well-chosen multidisciplinary Chinese Hard Case Benchmark (CHC-Bench), and the 2B-size Chinese Tiny LLM (CT-LLM), we aim to foster further exploration and innovation in both academia and industry, paving the way for more inclusive and versatile language models.

9/16/2024