On Sample Selection for Continual Learning: a Video Streaming Case Study

2405.10290

Published 5/17/2024 by Alexander Dietmuller, Romain Jacob, Laurent Vanbever

↗️

Abstract

Machine learning (ML) is a powerful tool to model the complexity of communication networks. As networks evolve, we cannot only train once and deploy. Retraining models, known as continual learning, is necessary. Yet, to date, there is no established methodology to answer the key questions: With which samples to retrain? When should we retrain? We address these questions with the sample selection system Memento, which maintains a training set with the most useful samples to maximize sample space coverage. Memento particularly benefits rare patterns -- the notoriously long tail in networking -- and allows assessing rationally when retraining may help, i.e., when the coverage changes. We deployed Memento on Puffer, the live-TV streaming project, and achieved a 14% reduction of stall time, 3.5x the improvement of random sample selection. Finally, Memento does not depend on a specific model architecture; it is likely to yield benefits in other ML-based networking applications.

Create account to get full access

Overview

Machine learning (ML) is a powerful tool for modeling complex communication networks
As networks evolve, retraining models (continual learning) is necessary, but there are challenges around when and how to retrain
The paper introduces Memento, a sample selection system that maintains a training set to maximize coverage and improve model performance, especially for rare patterns

Plain English Explanation

Machine learning is a great way to understand and model how communication networks work. But as these networks change and evolve over time, we can't just train the models once and be done. We need to keep retraining them, in a process called continual learning.

The big questions are: Which samples should we use to retrain the models? And when is the right time to do the retraining? Memento addresses these challenges. It maintains a training set that covers the most important and useful samples, to keep the models as accurate as possible. This is especially helpful for capturing rare patterns in network data - the "long tail" that's often overlooked.

Memento also helps determine when retraining would be most beneficial, by tracking how the sample coverage changes over time. The researchers used Memento in a live TV streaming project called Puffer, and saw a 14% reduction in video stalling, 3.5 times better than just randomly selecting samples for retraining.

The key is that Memento isn't tied to any specific machine learning model architecture. It should provide benefits for a wide range of ML-based networking applications.

Technical Explanation

The paper introduces Memento, a sample selection system designed to address the challenges of continual learning in machine learning models for communication networks. As networks evolve, retraining the models is necessary, but the authors identify two key questions that need to be answered: which samples should be used for retraining, and when should retraining be performed.

Memento maintains a training set that maximizes coverage of the sample space, ensuring the model is exposed to the most useful and representative data. This is particularly beneficial for capturing rare patterns - the "long tail" - that are often overlooked but important in networking applications.

The researchers deployed Memento on the Puffer live-TV streaming project and saw a 14% reduction in stall time, 3.5 times better than random sample selection for retraining. Crucially, Memento is not tied to any specific model architecture, so it should provide benefits across a range of ML-based networking applications.

Critical Analysis

The paper presents a well-designed and thoughtful approach to the challenges of continual learning in communication network modeling. The Memento system offers a rational way to determine when retraining would be most beneficial, by tracking changes in sample coverage over time.

One limitation noted in the paper is that Memento does not explicitly handle concept drift, where the underlying data distribution changes. Adaptive Memory Replay and Liquid Ensemble Selection are two continual learning techniques that may be able to address this issue.

Additionally, the paper does not explore the computational costs and efficiency of Memento compared to other continual learning approaches like Grasp Rehearsal or Continual Learning in the Presence of Repetition. These tradeoffs would be important to understand for real-world deployment.

Overall, the Memento system appears to be a valuable contribution to the continual learning literature, with potential applications beyond just communication network modeling. Further research on its scalability and handling of concept drift would help solidify its position as a go-to approach for ML-powered networking applications.

Conclusion

The paper introduces Memento, a sample selection system for continual learning in communication network modeling. By maintaining a training set that maximizes coverage of the sample space, Memento helps ensure models stay accurate as networks evolve, especially for capturing rare patterns.

Memento also provides a rational way to determine when retraining would be most beneficial, by tracking changes in sample coverage over time. The researchers demonstrated its effectiveness in a live TV streaming project, with a 14% reduction in stall time compared to random sample selection.

Crucially, Memento is not tied to any specific machine learning model architecture, making it a versatile tool for a wide range of ML-based networking applications. While there are some areas for further research, Memento represents an important step forward in addressing the challenges of continual learning in communication network modeling.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Watch Your Step: Optimal Retrieval for Continual Learning at Scale

Truman Hickok, Dhireesha Kudithipudi

In continual learning, a model learns incrementally over time while minimizing interference between old and new tasks. One of the most widely used approaches in continual learning is referred to as replay. Replay methods support interleaved learning by storing past experiences in a replay buffer. Although there are methods for selectively constructing the buffer and reprocessing its contents, there is limited exploration of the problem of selectively retrieving samples from the buffer. Current solutions have been tested in limited settings and, more importantly, in isolation. Existing work has also not explored the impact of duplicate replays on performance. In this work, we propose a framework for evaluating selective retrieval strategies, categorized by simple, independent class- and sample-selective primitives. We evaluated several combinations of existing strategies for selective retrieval and present their performances. Furthermore, we propose a set of strategies to prevent duplicate replays and explore whether new samples with low loss values can be learned without replay. In an effort to match our problem setting to a realistic continual learning pipeline, we restrict our experiments to a setting involving a large, pre-trained, open vocabulary object detection model, which is fully fine-tuned on a sequence of 15 datasets.

5/13/2024

cs.CV

Liquid Ensemble Selection for Continual Learning

Carter Blair, Ben Armstrong, Kate Larson

Continual learning aims to enable machine learning models to continually learn from a shifting data distribution without forgetting what has already been learned. Such shifting distributions can be broken into disjoint subsets of related examples; by training each member of an ensemble on a different subset it is possible for the ensemble as a whole to achieve much higher accuracy with less forgetting than a naive model. We address the problem of selecting which models within an ensemble should learn on any given data, and which should predict. By drawing on work from delegative voting we develop an algorithm for using delegation to dynamically select which models in an ensemble are active. We explore a variety of delegation methods and performance metrics, ultimately finding that delegation is able to provide a significant performance boost over naive learning in the face of distribution shifts.

5/14/2024

cs.LG cs.AI

Adaptive Memory Replay for Continual Learning

James Seale Smith, Lazar Valkov, Shaunak Halbe, Vyshnavi Gutta, Rogerio Feris, Zsolt Kira, Leonid Karlinsky

Foundation Models (FMs) have become the hallmark of modern AI, however, these models are trained on massive data, leading to financially expensive training. Updating FMs as new data becomes available is important, however, can lead to `catastrophic forgetting', where models underperform on tasks related to data sub-populations observed too long ago. This continual learning (CL) phenomenon has been extensively studied, but primarily in a setting where only a small amount of past data can be stored. We advocate for the paradigm where memory is abundant, allowing us to keep all previous data, but computational resources are limited. In this setting, traditional replay-based CL approaches are outperformed by a simple baseline which replays past data selected uniformly at random, indicating that this setting necessitates a new approach. We address this by introducing a framework of adaptive memory replay for continual learning, where sampling of past data is phrased as a multi-armed bandit problem. We utilize Bolzmann sampling to derive a method which dynamically selects past data for training conditioned on the current task, assuming full data access and emphasizing training efficiency. Through extensive evaluations on both vision and language pre-training tasks, we demonstrate the effectiveness of our approach, which maintains high performance while reducing forgetting by up to 10% at no training efficiency cost.

4/22/2024

cs.LG cs.CL cs.CV

Online Continual Learning of Video Diffusion Models From a Single Video Stream

Jason Yoo, Dylan Green, Geoff Pleiss, Frank Wood

Diffusion models have shown exceptional capabilities in generating realistic videos. Yet, their training has been predominantly confined to offline environments where models can repeatedly train on i.i.d. data to convergence. This work explores the feasibility of training diffusion models from a semantically continuous video stream, where correlated video frames sequentially arrive one at a time. To investigate this, we introduce two novel continual video generative modeling benchmarks, Lifelong Bouncing Balls and Windows 95 Maze Screensaver, each containing over a million video frames generated from navigating stationary environments. Surprisingly, our experiments show that diffusion models can be effectively trained online using experience replay, achieving performance comparable to models trained with i.i.d. samples given the same number of gradient steps.

6/10/2024

cs.CV cs.LG