Mesoscopic integrated circuits aim for precise control over elementary quantum systems. However, as fidelities improve, the increasingly rare errors and component crosstalk pose a challenge for validating error models and quantifying accuracy of circuit performance. Here we propose and implement a circuit-level benchmark that models fidelity as a random walk of an error syndrome, detected by an accumulating probe. Additionally, contributions of correlated noise, induced environmentally or by memory, are revealed as limits of achievable fidelity by statistical consistency analysis of the full distribution of error counts. Applying this methodology to a high-fidelity implementation of on-demand transfer of electrons in quantum dots we are able to utilize the high precision of charge counting to robustly estimate the error rate of the full circuit and its variability due to noise in the environment. As the clock frequency of the circuit is increased, the random walk reveals a memory effect. This benchmark contributes towards a rigorous metrology of quantum circuits.
Fidelity control is important to quantum metrology and fault-tolerant quantum computation. Here, authors realize clock-controlled transfer of electrons through quantum dots and describe the statistics of accumulated charge by a random-walk model, achieving a benchmark for single-electron circuits.
Precise manipulation of individual quantum particles in complex single-electron circuits for sensors, quantum metrology, and quantum information transfer1,2 requires tools to certify fidelity and establish a scalable error model. A similar challenge arises in the gate-based approach to universal quantum computation3–8 where benchmarking gate sequences9–13 are employed to validate independent-error models14 which are crucial for scaling towards fault-tolerance15,16. Here, we introduce the idea of benchmarking by error accumulation to integrated single-electron circuits. We experimentally realize the clock-controlled transfer of electrons through a chain of quantum dots, and describe the statistics of accumulated charge by a random-walk model. High-fidelity components and unprecedented accuracy of charge counting enable the detection of excess noise beyond the sampling error, the identification of the timescale for consecutive step interaction, and an accurate estimate for the failure probabilities of the elementary charge transfer. Abstracting errors from component to circuit level opens a path to leverage charge counting for microscopic certification of electrical quantities challenging the precision of metrological measurements17, and to introduce fidelity control in building blocks of quantum circuits18–21.
In quantum metrology, stability and reproducibility of the environment for elementary quantum entities (photons, qubits, electrons) and their uncontrolled interactions set the practical limits on the precision of quantum circuits22, which approach the fundamental quantum limits, i.e., counting shot noise for independent identical particles, or the Heisenberg limit for entanglement-enhanced measurements23. In particular, accurate benchmarking of fidelity in the presence of long-term drifts and memory is difficult but essential for the validation of the precision of quantum standards. Identifying and quantifying the residual error, i.e., any deviation from the perfect performance of a circuit, define the challenge to be answered by the random-walk benchmarking for high-precision single-electron current sources. Validating consistency of the error model by statistical testing ensures the robustness of the fidelity estimates, which is an actively studied problem in the related context of assessing quantum computation platforms14,24–26.
The random-walk benchmarking addresses the question of uniformity in time of repeated identical operations by error accumulation. The error signal (syndrome) considered here is the discrete charge stored in the circuit after executing a sequence of t operations. The measured deviation x in the number of trapped electrons is modeled by the probability for a random walker to reach integer coordinate x from initial position of x = 0 in t steps (Fig. 1). In the desired high-fidelity limit of near-deterministic on-demand transfer of a fixed number of electrons any residual randomly occurring errors that alter x will be very rare and the walker will remain stationary most of the time, with occasional steps of length one. Here we study to what extent two single-step, x → x ± 1, probabilities P± describe the statistics of x collected by repeated operation of the circuit, and how deviations from independent-error accumulation can be detected and quantified, revealing otherwise hidden physics. The baseline random-walk model with t- and x-independent P± predicts the following distribution:



Schematic of the random-walk benchmark for single-electron transfer utilizing charge counting.
a Sample micrograph and measurement scheme. After the initial charge measurement t clock cycles are applied. The paths taken by 30 simulated walkers (using error rates extracted from the counting statistics) are represented by blue lines, transitioning every clock cycle in x by a step of −1, 0, +1. The frequency with which each branch is visited is indicated by the linewidth. A final charge measurement yields the end-point of the random walk as the difference between initial and final charge. The orange line exemplifies a single random walk with self-intersections. b Signal to noise ratio: a (typical) histogram of the differential charge detection signal with the identified difference in electron number indicated by color. The peak separation is shown in units of the Gaussian noise amplitude σ (black dashed lines indicate the corresponding Gaussian fits). c Measured statistics of finding the walker at position x after t steps.
Experimentally, the high-fidelity circuit for electron transfer is realized by a chain of quantum dots in which the first and the last dot are operated as single-electron pumps27 and the central dot provides the error signal as shown in Fig. 1. A clock of frequency (f = 30–300 MHz) drives the pumps to transfer one electron per cycle through the chain (from top to bottom in Fig. 1).
Within one clock cycle, the entrance barrier to the dynamic quantum dot is lowered and raised by the pump stimulus, isolating one electron from the source reservoir and then ejecting it over the high exit barrier; barrier height asymmetry between entrance and exit defines the transfer direction28. The operating points of the pumps are chosen to minimize and approximately balance the error probabilities of transferring either zero or two electrons instead of one (with a slight bias towards zero-electron transfers, as this error rate only increases exponentially and not double-exponentially with deviations from the optimal operating point29). The working points of the pumps are not retuned when operating the full circuit. Reproducible formation of quantum dots30 allows demonstrating the high-fidelity operation of the circuit event at zero magnetic field, at which readout precision is enhanced by cryogenic reflectometry.
The excess charge x from accumulating errors is inferred from a differential measurement by a charge detector capacitively coupled to the central dot, reading out the detector state before and after each sequence transferring t electrons. As tunneling events are only enabled by the clocked stimulus applied to the pumps, a long detector integration-time up to 1 ms can be chosen for unambiguous identification of x with a signal to noise ratio of 17 (Fig. 1b). A full histogram of detector states before and after the transfer sequence allows to reconstruct the shape of the Coulomb blockade peak resonance utilized by the charge detector and provides rigorous classification thresholds for the identification of x. The sequence of electron transfer and charge detection is repeated with the repetition rate limited by the detector integration-time (up to 4 kHz), until a set number of counts (N = 1 × 105 to 2 × 106) is accumulated. Any deviations not aligned with the measurement timing, such as instabilities in the charge detector, are readily recognized and discarded, while unintended charge transitions during the operation of the pump are counted and correctly identified as errors.
Although the individual accuracy of the active components can exceed metrological precision31, their simultaneous operation in a mesoscopic circuit32 precludes the prediction of transfer fidelity from component-wise characterization due to interactions and crosstalk between the elements in the chain, exemplifying the need for circuit-level benchmarking. Experimental evidence for strong discord between component-wise and circuit-level characterization is given in Supplementary Note 3.
Here we report the measurement results on two devices: device A introduces the methodology to resolve effects beyond statistical noise of independent-error accumulation in a high-fidelity circuit, while device B demonstrates the effects of memory with increased repetition frequency. Both devices share very similar device geometries and parameters.
Figure 2a shows the counting statistics measured for device A at f = 30 MHz for t up to 104 compared to predictions of the baseline model. General trends expected from the random walk are evident: for short sequences,


Consistency of random-walk models with respect to the experimental statistics of accumulated errors.
a Measured
The key question for random-walk benchmarking is whether the uncorrelated residual randomness defined by two probabilities P+ and P− predicts the entire probability distribution. This question is answered in three steps: (i) significance testing of deviations from the baseline model as a statistical null-hypothesis to delineate the inevitable sampling error from model error; (ii) extending the model to accommodate correlated excess noise33 detected in the first step; (iii) perform parameter estimation of the noise model that yields average values of P± with an estimate of the variability.
For consistency testing, we have increased the number N of samples per sequence by a factor of ~10, and limited t to 100. Fisherian significance tests34 are used to define consistency regions of p-value > 0.05 in the parameter space (P+, P−) where the baseline model cannot be rejected at this significance level (see “Methods” section). Figure 2b shows quasielliptic consistency regions computed for each sequence length t separately, randomly clustering in a tight area with the sizes shrinking roughly as
To quantify the excess noise, the model is now extended (part (ii) of the outline above) by drawing the step probabilities P± randomly from a Dirichlet distribution38,39 (Supplementary Note 8) over the standard 2-simplex; the corresponding parameters
In order to gain insight into a possible physics mechanism for excess noise and illustrate the robustness of statistical methods, we have simulated the experimental timeline using a random-walk model with P± parameters subjected to 1/f noise from an ensemble of independent two-level fluctuators (Supplementary Note 13). The results follow the general pattern outlined above: (i) for a fixed size of the statistical sample, there is a threshold in the excess noise amplitude above which the data contradict both the baseline and the fast-fluctuator models but remain consistent with the slow-drift model. This threshold corresponds to excess noise sufficiently affecting probabilities of multiple errors per burst to reveal inconsistency with Eq. (1) in the tails (∣x∣ > 1) of the error syndrome distribution
The methodology to quantify independent-error accumulation described above makes it possible to probe the effect of increased clock frequency on the circuit and thereby investigate response times of the electron shuttle and interactions between subsequent steps. In device B, the error rates are P− = (6.31 ± 0.23) × 10−3 and P+ = (2.71 ± 0.043) × 10−2 at the same frequency of 30 MHz as device A investigated above. A ten-fold increase of the clock frequency to 300 MHz is introduced by uniform time compression of signals controlling the transfer operations; the resulting counting statistics is presented in Fig. 3a (circles). The random-walk model with constant P±, described by Eq. (1), no longer applies even qualitatively, which raises the question whether the fidelity of the circuit has decreased to a point where errors can no longer be considered rare as outlined in the beginning. This question is answered in the negative with the help of the following theorem defining a spread condition, which sets a precise bound on the applicability of the random-walk approach with possibly non-stationary error rates: If distributions



Memory effect probed at the increased clock frequency.
a Measured
We find that the distributions measured on device B do satisfy the spread condition (2) as long as all x are fully resolved in counting (t ≤ 6). We estimate the non-stationary but x-homogeneous single-step error probabilities of the corresponding Markov chains,
To probe this memory effect, we introduce a delay time τDelay between otherwise unaltered signals driving the transfer operations thus extending the physical time f−1 corresponding to a single step of the random walk from τop to τop + τDelay as sketched in Fig. 3b. With increasing delay, a gradual reduction of the t-dependence in
In conclusion, the view of single-electron components as elements of a digital circuit has enabled an abstract and universal description of fidelity in terms of the random walk of an error syndrome. Accumulation of errors over long sequences allows probing fast and accurate operations beyond the bandwidth of a slow single-charge detector. The accompanying statistical methodology quantifies the stability of the error process and uncovers short memory times, both of which are elusive to direct observation. In quantum metrology, an accurate estimate of the circuit error has an immediate application: the variance of the current I = (Is + Id)/2 flowing into (Is) and out of (Id) the circuit is given by the variance of the differential charge x, which corresponds to the displacement current Is − Id = efx/t. Hence, the variance of x,
Devices A and B were fabricated from GaAs/AlGaAs heterostructures with two dimensional electron gas (2DEG) nominally 90 nm below the surface. Quantum dots are formed by CrAu top gates depleting a shallow-etched mesa30. The charge detector is formed against the edge of a separate mesa and capacitively coupled to the central quantum dot via a floating gate45.
All measurements were performed in a dilution refrigerator at a base temperature of 20 mK and 0 T external field. The charge detector signal is readout by rf reflectometry46. Sinusoidal pulses generated by arbitrary waveform generators modulate the entrance barriers of the single-electron pumps and drive the clock-controlled electron transfer27. The drift-stability due to control voltages is estimated to be better than 10−8. Charge transfer and detector readout are triggered in a sequence: (i) readout of the initial detector state, (ii) application of t sinusoidal pulses to both pumps simultaneously, (iii) readout of the final detector state, (iv) reset by connecting the intermediate dot to source. The difference between initial and final detector state yields the charge x deposited on the central quantum dot by the burst transfer, providing raw data for subsequent statistical analysis.
Fisher’s p-value for each experimentally measured x-resolved set of N counts is defined as the probability of an equally or more extreme outcome under the null-hypothesis being tested (either the baseline random walk or one of the two excess noise models with Dirichlet-distributed P±); it is evaluated by Monte Carlo sampling as described in the Supplementary Notes 4 and 8.
Supplementary information is available for this paper at 10.1038/s41467-020-20554-w.
We acknowledge T. Gerster, L. Freise, H. Marx, K. Pierz, and T. Weimann for support in device fabrication, J. Valeinis for discussions. D.R. additionally acknowledges funding by the Deutsche Forschungsgemeinschaft (DFG) under Germany’s Excellence Strategy—EXC-2123 —90837967, as well as the support of the Braunschweig International Graduate School of Metrology B-IGSM. M.K., A.A., and V.K are supported by Latvian Council of Science (grant no. lzp-2018/1-0173). A.A. also acknowledges support by ‘Quantum algorithms: from complexity theory to experiment’ funded under ERDF program 1.1.1.5.
D.R. and N.U. designed and performed the experiment. M.K., A.A., and V.K. developed random-walk modeling and statistical methodology. DR., N.U., M.K., and V.K. performed the data analysis. M.K. wrote the supplementary information with contributions by D.R., V.K., and A.A. All authors contributed to the discussion of results and the writing of the manuscript.
Open Access funding enabled and organized by Projekt DEAL.
The data that support the graphs of this work are available in the Zenodo repository 10.5281/zenodo.4287363.
The code producing the figures is available from the corresponding author upon reasonable request.
The authors declare no competing interests.
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
29.
30.
31.
32.
33.
34.
35.
36.
37.
38.
39.
40.
41.
42.
43.
44.
45.
46.