AutoTimes: Autoregressive Time Series Forecasters via Large Language Models

2402.02370

Published 5/24/2024 by Yong Liu, Guo Qin, Xiangdong Huang, Jianmin Wang, Mingsheng Long

💬

Abstract

Foundation models of time series have not been fully developed due to the limited availability of time series corpora and the underexploration of scalable pre-training. Based on the similar sequential formulation of time series and natural language, increasing research demonstrates the feasibility of leveraging large language models (LLM) for time series. Nevertheless, the inherent autoregressive property and decoder-only architecture of LLMs have not been fully considered, resulting in insufficient utilization of LLM abilities. To further exploit the general-purpose token transition and multi-step generation ability of large language models, we propose AutoTimes to repurpose LLMs as autoregressive time series forecasters, which independently projects time series segments into the embedding space and autoregressively generates future predictions with arbitrary lengths. Compatible with any decoder-only LLMs, the consequent forecaster exhibits the flexibility of the lookback length and scalability of the LLM size. Further, we formulate time series as prompts, extending the context for prediction beyond the lookback window, termed in-context forecasting. By adopting textual timestamps as position embeddings, AutoTimes integrates multimodality for multivariate scenarios. Empirically, AutoTimes achieves state-of-the-art with 0.1% trainable parameters and over 5 times training/inference speedup compared to advanced LLM-based forecasters.

Create account to get full access

Overview

Time series forecasting is an important challenge, but existing approaches have limitations due to the lack of large-scale time series datasets and the underutilization of powerful large language models (LLMs).
The paper proposes a new method called AutoTimes that repurposes LLMs as autoregressive time series forecasters, leveraging their general-purpose token transition and multi-step generation abilities.
AutoTimes exhibits flexible lookback lengths, scalable model sizes, and the ability to incorporate multimodal information like textual timestamps.
Empirically, AutoTimes outperforms advanced LLM-based forecasters while requiring only 0.1% trainable parameters and offering significant speedups.

Plain English Explanation

Time series data, which represents the evolution of a variable over time, is widely used in fields like finance, weather forecasting, and healthcare. However, developing effective time series foundation models has been challenging due to the limited availability of large time series datasets and the lack of exploration into leveraging powerful large language models (LLMs).

The authors of this paper realized that time series data and natural language have a similar sequential structure, so they proposed a new method called AutoTimes that repurposes LLMs to excel at time series forecasting. Unlike previous approaches that did not fully utilize the unique properties of LLMs, AutoTimes takes advantage of their ability to autoregressively generate future predictions and to handle arbitrary-length forecasting windows.

Additionally, AutoTimes allows for in-context forecasting, where the model can consider contextual information beyond just the immediate past values of the time series. This is achieved by formulating the time series as a "prompt" that the LLM can use to generate future predictions. AutoTimes also incorporates multimodal information, such as textual timestamps, to enhance its forecasting capabilities.

The researchers found that AutoTimes outperforms other advanced LLM-based forecasting methods while requiring only a tiny fraction of the trainable parameters (0.1%) and offering significant speedups in both training and inference. This demonstrates the power of repurposing large language models for time series forecasting, a promising direction for the field.

Technical Explanation

The paper proposes a novel method called AutoTimes that leverages the sequential nature of time series data and the powerful capabilities of large language models (LLMs) to tackle the challenge of time series forecasting.

The authors recognize that time series and natural language share a similar sequential structure, which suggests the feasibility of using LLMs for time series tasks. However, they note that the inherent autoregressive property and decoder-only architecture of LLMs have not been fully exploited in previous approaches, leading to suboptimal utilization of their capabilities.

To address this, AutoTimes repurposes LLMs as autoregressive time series forecasters. It independently projects time series segments into the embedding space and then autoregressively generates future predictions with arbitrary lengths. This approach allows for flexible lookback lengths and scalable model sizes, as it is compatible with any decoder-only LLM.

Furthermore, the paper introduces the concept of "in-context forecasting," where the time series is formulated as a prompt that extends the context for prediction beyond the immediate lookback window. This enables the model to consider additional contextual information during the forecasting process.

To handle multivariate scenarios, AutoTimes adopts textual timestamps as position embeddings, effectively integrating multimodal information into the forecasting process.

Empirically, the authors demonstrate that AutoTimes achieves state-of-the-art performance on various benchmarks while requiring only 0.1% trainable parameters and offering over 5 times training and inference speedup compared to advanced LLM-based forecasters. This highlights the efficiency and effectiveness of repurposing large language models for time series forecasting tasks.

Critical Analysis

The paper presents a promising approach to leveraging the power of large language models for time series forecasting, a task that has faced challenges due to the limited availability of large-scale time series datasets and the underexploration of scalable pre-training methods.

One notable strength of the proposed AutoTimes method is its flexibility and scalability. By making use of the autoregressive and multi-step generation capabilities of LLMs, AutoTimes can handle arbitrary lookback lengths and model sizes, enabling it to adapt to a wide range of time series forecasting scenarios.

The incorporation of in-context forecasting and multimodal information, such as textual timestamps, is also a valuable contribution, as it allows the model to leverage contextual cues beyond the immediate past values of the time series. This can be particularly beneficial in complex, real-world forecasting tasks.

However, the paper does not extensively discuss the potential limitations or caveats of the AutoTimes approach. For example, it would be valuable to understand how the method performs on time series with different characteristics (e.g., high-frequency, seasonal, or irregular patterns) or how it compares to specialized time series models in terms of interpretability and explainability.

Additionally, while the empirical results demonstrate impressive performance and efficiency, it would be helpful to see more discussion on the generalization of these findings and whether the benefits of AutoTimes hold true across a broader range of time series datasets and forecasting scenarios.

Overall, the paper presents a compelling and well-executed approach to time series forecasting that leverages the potential of large language models. Further exploration of the method's limitations and robustness, as well as comparisons to state-of-the-art time series forecasting techniques, could provide valuable insights and guide future research in this direction.

Conclusion

The paper introduces a novel method called AutoTimes that repurposes large language models (LLMs) as effective and efficient time series forecasters. By exploiting the autoregressive and multi-step generation capabilities of LLMs, AutoTimes can handle flexible lookback lengths and scale to large model sizes, outperforming advanced LLM-based forecasting approaches.

The key innovations of AutoTimes include its ability to perform in-context forecasting, where the time series is formulated as a prompt to leverage contextual information beyond the immediate past, and its integration of multimodal information, such as textual timestamps, to enhance its forecasting capabilities.

The empirical results demonstrate the power of this approach, with AutoTimes achieving state-of-the-art performance while requiring only a tiny fraction of trainable parameters and offering significant speedups in both training and inference. This suggests that repurposing large language models for time series forecasting is a promising direction that could have widespread implications across various domains that rely on accurate and efficient time series predictions.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Large Language Models Are Zero-Shot Time Series Forecasters

Nate Gruver, Marc Finzi, Shikai Qiu, Andrew Gordon Wilson

By encoding time series as a string of numerical digits, we can frame time series forecasting as next-token prediction in text. Developing this approach, we find that large language models (LLMs) such as GPT-3 and LLaMA-2 can surprisingly zero-shot extrapolate time series at a level comparable to or exceeding the performance of purpose-built time series models trained on the downstream tasks. To facilitate this performance, we propose procedures for effectively tokenizing time series data and converting discrete distributions over tokens into highly flexible densities over continuous values. We argue the success of LLMs for time series stems from their ability to naturally represent multimodal distributions, in conjunction with biases for simplicity, and repetition, which align with the salient features in many time series, such as repeated seasonal trends. We also show how LLMs can naturally handle missing data without imputation through non-numerical text, accommodate textual side information, and answer questions to help explain predictions. While we find that increasing model size generally improves performance on time series, we show GPT-4 can perform worse than GPT-3 because of how it tokenizes numbers, and poor uncertainty calibration, which is likely the result of alignment interventions such as RLHF.

6/19/2024

cs.LG

Large Language Models for Time Series: A Survey

Xiyuan Zhang, Ranak Roy Chowdhury, Rajesh K. Gupta, Jingbo Shang

Large Language Models (LLMs) have seen significant use in domains such as natural language processing and computer vision. Going beyond text, image and graphics, LLMs present a significant potential for analysis of time series data, benefiting domains such as climate, IoT, healthcare, traffic, audio and finance. This survey paper provides an in-depth exploration and a detailed taxonomy of the various methodologies employed to harness the power of LLMs for time series analysis. We address the inherent challenge of bridging the gap between LLMs' original text data training and the numerical nature of time series data, and explore strategies for transferring and distilling knowledge from LLMs to numerical time series analysis. We detail various methodologies, including (1) direct prompting of LLMs, (2) time series quantization, (3) aligning techniques, (4) utilization of the vision modality as a bridging mechanism, and (5) the combination of LLMs with tools. Additionally, this survey offers a comprehensive overview of the existing multimodal time series and text datasets and delves into the challenges and future opportunities of this emerging field. We maintain an up-to-date Github repository which includes all the papers and datasets discussed in the survey.

5/8/2024

cs.LG cs.AI cs.CL

TimeCMA: Towards LLM-Empowered Time Series Forecasting via Cross-Modality Alignment

Chenxi Liu, Qianxiong Xu, Hao Miao, Sun Yang, Lingzheng Zhang, Cheng Long, Ziyue Li, Rui Zhao

The widespread adoption of scalable mobile sensing has led to large amounts of time series data for real-world applications. A fundamental application is multivariate time series forecasting (MTSF), which aims to predict future time series values based on historical observations. Existing MTSF methods suffer from limited parameterization and small-scale training data. Recently, Large language models (LLMs) have been introduced in time series, which achieve promising forecasting performance but incur heavy computational costs. To solve these challenges, we propose TimeCMA, an LLM-empowered framework for time series forecasting with cross-modality alignment. We design a dual-modality encoding module with two branches, where the time series encoding branch extracts relatively low-quality yet pure embeddings of time series through an inverted Transformer. In addition, the LLM-empowered encoding branch wraps the same time series as prompts to obtain high-quality yet entangled prompt embeddings via a Pre-trained LLM. Then, we design a cross-modality alignment module to retrieve high-quality and pure time series embeddings from the prompt embeddings. Moreover, we develop a time series forecasting module to decode the aligned embeddings while capturing dependencies among multiple variables for forecasting. Notably, we tailor the prompt to encode sufficient temporal information into a last token and design the last token embedding storage to reduce computational costs. Extensive experiments on real data offer insight into the accuracy and efficiency of the proposed framework.

6/17/2024

cs.LG cs.AI cs.CL

Arrows of Time for Large Language Models

Vassilis Papadopoulos, J'er'emie Wenger, Cl'ement Hongler

We study the probabilistic modeling performed by Autoregressive Large Language Models (LLMs) through the angle of time directionality, addressing a question first raised in (Shannon, 1951). For large enough models, we empirically find a time asymmetry in their ability to learn natural language: a difference in the average log-perplexity when trying to predict the next token versus when trying to predict the previous one. This difference is at the same time subtle and very consistent across various modalities (language, model size, training time, ...). Theoretically, this is surprising: from an information-theoretic point of view, there should be no such difference. We provide a theoretical framework to explain how such an asymmetry can appear from sparsity and computational complexity considerations, and outline a number of perspectives opened by our results.

6/4/2024

cs.LG cs.AI cs.CL