On Sample Selection for Continual Learning: a Video Streaming Case Study
Abstract
Machine learning (ML) is a powerful tool to model the complexity of communication networks. As networks evolve, we cannot only train once and deploy. Retraining models, known as continual learning, is necessary. Yet, to date, there is no established methodology to answer the key questions: With which samples to retrain? When should we retrain?
We address these questions with the sample selection system Memento, which maintains a training set with the “most useful” samples to maximize sample space coverage. Memento particularly benefits rare patterns—the notoriously long “tail” in networking—and allows assessing rationally when retraining may help, i.e., when the coverage changes.
We deployed Memento on Puffer, the live-TV streaming project, and achieved a 14% reduction of stall time, 3.5× the improvement of random sample selection. Memento is model-agnostic and can be applied beyond video streaming.
People
BibTex
@ARTICLE{dietmüller2024sample,
copyright = {Creative Commons Attribution-ShareAlike 4.0 International},
doi = {10.3929/ethz-b-000677081},
year = {2024-04},
volume = {54},
type = {Journal Article},
journal = {ACM SIGCOMM Computer Communication Review},
author = {Dietmüller, Alexander and Jacob, Romain and Vanbever, Laurent},
abstract = {Machine learning (ML) is a powerful tool to model the complexity of communication networks. As networks evolve, we cannot only train once and deploy. Retraining models, known as continual learning, is necessary. Yet, to date, there is no established methodology to answer the key questions: With which samples to retrain? When should we retrain?We address these questions with the sample selection system Memento, which maintains a training set with the "most useful" samples to maximize sample space coverage. Memento particularly benefits rare patterns—the notoriously long "tail" in networking—and allows assessing rationally when retraining may help, i.e., when the coverage changes.We deployed Memento on Puffer, the live-TV streaming project, and achieved a 14% reduction of stall time, 3.5× the improvement of random sample selection. Memento is model-agnostic and can be applied beyond video streaming.},
issn = {0146-4833},
keywords = {video streaming; Machine Learning; Continual learning},
language = {en},
publisher = {Association for Computing Machinery},
number = {2},
title = {On Sample Selection for Continual Learning: a Video Streaming Case Study},
PAGES = {10 - 25}
}
Research Collection: 20.500.11850/677081