Mandivarapu, Jaya Krishna and Camp, Blake and Estrada, Rolando (2020) Self-Net: Lifelong Learning via Continual Self-Modeling. Frontiers in Artificial Intelligence, 3. ISSN 2624-8212
pubmed-zip/versions/1/package-entries/frai-03-00019/frai-03-00019.pdf - Published Version
Download (1MB)
Abstract
Learning a set of tasks over time, also known as continual learning (CL), is one of the most challenging problems in artificial intelligence. While recent approaches achieve some degree of CL in deep neural networks, they either (1) store a new network (or an equivalent number of parameters) for each new task, (2) store training data from previous tasks, or (3) restrict the network's ability to learn new tasks. To address these issues, we propose a novel framework, Self-Net, that uses an autoencoder to learn a set of low-dimensional representations of the weights learned for different tasks. We demonstrate that these low-dimensional vectors can then be used to generate high-fidelity recollections of the original weights. Self-Net can incorporate new tasks over time with little retraining, minimal loss in performance for older tasks, and without storing prior training data. We show that our technique achieves over 10X storage compression in a continual fashion, and that it outperforms state-of-the-art approaches on numerous datasets, including continual versions of MNIST, CIFAR10, CIFAR100, Atari, and task-incremental CORe50. To the best of our knowledge, we are the first to use autoencoders to sequentially encode sets of network weights to enable continual learning.
Item Type: | Article |
---|---|
Subjects: | EP Archives > Multidisciplinary |
Depositing User: | Managing Editor |
Date Deposited: | 30 Jan 2023 05:23 |
Last Modified: | 16 Jul 2024 06:49 |
URI: | http://research.send4journal.com/id/eprint/1194 |