Multi-scale tensor network architecture for machine learning

Reyes, J A and Stoudenmire, E M (2021) Multi-scale tensor network architecture for machine learning. Machine Learning: Science and Technology, 2 (3). 035036. ISSN 2632-2153

[thumbnail of Reyes_2021_Mach._Learn.__Sci._Technol._2_035036.pdf] Text
Reyes_2021_Mach._Learn.__Sci._Technol._2_035036.pdf - Published Version

Download (527kB)

Abstract

We present an algorithm for supervised learning using tensor networks, employing a step of data pre-processing by coarse-graining through a sequence of wavelet transformations. These transformations are represented as a set of tensor network layers identical to those in a multi-scale entanglement renormalization ansatz tensor network. We perform supervised learning and regression tasks through a model based on a matrix product states (MPSs) acting on the coarse-grained data. Because the entire model consists of tensor contractions (apart from the initial non-linear feature map), we can adaptively fine-grain the optimized MPS model 'backwards' through the layers with essentially no loss in performance. The MPS itself is trained using an adaptive algorithm based on the density matrix renormalization group algorithm. We test our methods by performing a classification task on audio data and a regression task on temperature time-series data, studying the dependence of training accuracy on the number of coarse-graining layers and showing how fine-graining through the network may be used to initialize models which access finer-scale features.

Item Type: Article
Subjects: Open STM Article > Multidisciplinary
Depositing User: Unnamed user with email support@openstmarticle.com
Date Deposited: 06 Jul 2023 04:26
Last Modified: 08 Jun 2024 08:46
URI: http://asian.openbookpublished.com/id/eprint/1250

Actions (login required)

View Item
View Item