1 of 8

Out-of-distribution generalization for learning quantum dynamics

By Caro, Huang, Ezzell, Gibbs, Sornborger, Cincio, Coles, and Holmes

2 of 8

Outline

Overview of main result
Locally scrambling ensembles
Framework
Numerical results
Open Questions

Applications of Quantum Dynamics Learning.

3 of 8

Main result

Out-of-distribution generalization for unitary learning is possible for a wide variety of training and testing distributions, if both distributions are locally scrambled

A QNN capturing the action of a target unitary on only a polynomial number of random product states generalizes to test data composed of fully random states
Possible to learn a unitary on a broad spread of highly entangled states having only studied its action on a limited number of product states
Product states can be prepared using only single qubit gates, bringing the possibility of using QML to learn unitary processes nearer term
A low-entangling unitary can be compiled using results from only low-entangled training states, potentially indicating a new quantum-inspired classical approach to unitary compilation.

Generalization for learning a target unitary U

4 of 8

Locally scrambling ensembles

Ensembles of states that are at least locally random
Invariant under pre-processing by tensor products of arbitrary local unitaries, i.e. of the form

for some locally scrambled unitary ensemble

Out-of-distribution learning results hold for a slightly broader class of ensembles with the requirement that the ensemble agrees with a locally scrambled one up to and including its complex second moments, denoted by
divides naturally into sub-classes of ensembles that give rise to training sets and testing sets

5 of 8

Examples of ensembles in :

- Products of Haar-random single-qubit states.
- Products of random single-qubit stabilizer states.
- Products of Haar-random k-qubit states.
- Haar-random n-qubit states.
- A 2-design on n-qubit states.
- The output states of random quantum circuits,

where denotes the k-local n-qubit quantum

circuit architecture from which the random circuit is

constructed

Locally Scrambling Ensembles

6 of 8

Framework

For learning an unknown n-qubit unitary , optimize classical parameters α of V(α), an n-qubit unitary QNN, so for α_opt, V(α_opt) predicts the action of U on previously unseen test states, i.e. minimize

where testing distribution P is a probability distribution over (pure) n-qubit states and the factor of 1/4 ensures 0 ≤ R_P(α) ≤ 1.

A learner so cannot evaluate the above cost function (generally does not have access to the full testing ensemble P!)
Evaluate the training cost on a training data set consisting of input-output pairs of pure n-qubit states, where N input states are drawn independently from a training distribution Q.

Rewrite in terms of the average fidelity as

which can be efficiently computed using a Loschmidt echo or swap test circuit

Goal is to achieve small risk R_P(α) by training the parameters α of the QNN to minimize the training cost C_DQ(N)(α)

7 of 8

Numerical results

Compare the exact evolution of product states under an unknown target Hamiltonian to an L layered Trotterized ansatz.
In-distribution risk is average prediction error over random product states
Out-of-distribution testing risk

is over the global Haar distribution

Linear correlation between the cost function and both and demonstrates that both in-distribution and out-of-distribution generalization have been successfully achieved
Increasing the number of layers in the ansatz corresponds to a decreased obtainable cost function value

Out-of-distribution Generalization for Hamiltonian Learning

8 of 8

Open Questions

Is out-of-distribution generalization is possible for other QML tasks such as learning quantum channels, or for performing classification tasks?
Could out-of distribution generalization be viable for other classes of distributions?
Is it worth investigating possible connections to Mitiq?

Paper mentions learning pulse sequences for noise resilient gate implementations