Incorporate Lia's corrections to fault tolerance

This commit is contained in:
2026-05-04 14:59:49 +02:00
parent 17191382cf
commit 9e5eaaf985
2 changed files with 58 additions and 49 deletions

View File

@@ -25,7 +25,7 @@ introduces two new challenges \cite[Sec.~4]{gottesman_introduction_2009}:
hardware themselves.
\end{itemize}
In the literature, both of these points are viewed under the umbrella
of \emph{fault tolerance}.
of \emph{fault-tolerant} quantum computing.
We focus only on the second aspect in this work.
It was recognized early on as a challenge of \ac{qec} that the correction
@@ -188,9 +188,11 @@ We visualize the different types of noise models in
The simplest type of noise model is \emph{bit-flip} noise.
This corresponds to the classical \ac{bsc}, i.e., only $X$ errors on the
data qubits are possible \cite[Appendix~A]{gidney_new_2023}.
The occurrence of bit-flip errors is modeled as a Bernoulli process
$\text{Bern}(p)$.
This type of noise model is shown in \Cref{subfig:bit_flip}.
Note that we cannot use bit-flip noise to develop fault-tolerant
Note that bit-flip noise is not suitable for developing fault-tolerant
systems, as it does not account for errors during the syndrome extraction.
%%%%%%%%%%%%%%%%
@@ -243,12 +245,12 @@ Here we not only consider noise between syndrome extraction rounds
and at the measurements, but at each gate.
Specifically, we allow arbitrary $n$-qubit Pauli errors after each
$n$-qubit gate \cite[Def.~2.5]{derks_designing_2025}.
An $n$-qubit Pauli error is simply a series of correlated Pauli
An $n$-qubit Pauli error can be written as a series of correlated Pauli
errors on each related individual qubit.
This type of noise model is shown in \Cref{subfig:circuit_level}.
While phenomenological noise is useful for some design aspects of
fault tolerant circuitry, for simulations, circuit-level noise should
fault-tolerant circuitry, for simulations, circuit-level noise should
always be used \cite[Sec.~4.2]{derks_designing_2025}.
Note that this introduces new challenges during the decoding process,
as the decoding complexity is increased considerably due to the many
@@ -286,7 +288,7 @@ fault-tolerant \ac{qec} schemes.
E.g., they can be used to easily determine whether a measurement
schedule is fault-tolerant \cite[Example~12]{derks_designing_2025}.
Other approaches of implementing fault tolerance exist, such as
Other approaches of implementing fault-tolerance circuits exist, such as
flag error correction, which uses additional ancilla qubits to detect
potentially damaging high-weight errors \cite[Sec.~1]{chamberland_flag_2018}.
However, \acp{dem} offer some unique advantages
@@ -300,8 +302,7 @@ However, \acp{dem} offer some unique advantages
treated in a unified manner. This leads to a more powerful
description of the overall circuit.
\end{itemize}
In this work, we only consider the process of decoding under the
\ac{dem} framework.
In this work, we consider the process of decoding under the \ac{dem} framework.
% Core idea
@@ -459,15 +460,22 @@ circuit, tracking which measurements they affect
We turn to our example of the three-qubit repetition code to
illustrate the construction of the syndrome measurement matrix.
We begin by extending our check matrix in \Cref{eq:rep_code_H}
to represent three rounds of syndrome extraction.
We begin by extending our check matrix $\bm{H}_Z$ in
\Cref{eq:rep_code_H} to represent three rounds of syndrome extraction.
Each round yields an additional set of syndrome bits,
and we combine them by stacking them in a new vector
$\bm{s} \in \mathbb{F}_2^{R(n-k)}$.
We thus have to replicate the rows of $\bm{\Omega}$, once for each
$\bm{s} \in \mathbb{F}_2^{R(n-k)}$, where $R \in \mathbb{N}$ is the
number of syndrome measurement rounds.
We thus have to replicate the rows of $\bm{H}_Z$, once for each
additional syndrome measurement, to obtain
\begin{align*}
\bm{\Omega} =
\bm{\Omega}_0 =
\begin{pmatrix}
\bm{H}_Z \\
\bm{H}_Z \\
\bm{H}_Z
\end{pmatrix}
=
\begin{pmatrix}
1 & 1 & 0 \\
0 & 1 & 1 \\
@@ -482,7 +490,7 @@ additional syndrome measurement, to obtain
depicts the corresponding circuit.
Note that we have not yet introduced error locations in the syndrome
extraction circuitry, so we still consider only bit flip noise at this stage.
Recall that $\bm{\Omega}$ describes which \ac{vn} is connected to
Recall that $\bm{\Omega}_0$ describes which \ac{vn} is connected to
which parity check and the syndrome indicates which parity checks
are violated.
This means that if an error exists at only a single \ac{vn}, we can
@@ -491,7 +499,7 @@ If errors occur at multiple locations, the resulting syndrome will be
the linear combination of the respective columns.
We thus have
\begin{align*}
\bm{s} \in \text{span} \{\bm{\Omega}\}
\bm{s} \in \text{span} \{\bm{\Omega}_0\}
.%
\end{align*}
@@ -502,11 +510,11 @@ only considering $X$ errors in this case.
We introduce new error locations at the appropriate positions,
arriving at the circuit depicted in
\Cref{fig:rep_code_multiple_rounds_phenomenological}.
For each additional error location, we extend $\bm{\Omega}$ by
For each additional error location, we extend $\bm{\Omega}_0$ by
appending the corresponding syndrome vector as a column.
\begin{gather}
\label{eq:syndrome_matrix_ex}
\bm{\Omega} =
\bm{\Omega}_1 =
\left(
\begin{array}{ccccccccccccccc}
1 & 1 & 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0
@@ -523,24 +531,25 @@ appending the corresponding syndrome vector as a column.
& 0 & 1 & 1 & 0 & 1
\end{array}
\right) . \\[-6mm]
\hspace*{-58.7mm}
\hspace*{-56.7mm}
\underbrace{
\phantom{
\begin{array}{ccc}
0 & 0 & 0
\end{array}
}
}_\text{Original matrix}
}_{\bm{\Omega}_0} \nonumber
\end{gather}
Notice that the first three columns correspond to the original
measurement syndrome matrix, as these columns correspond to the error
locations on the data qubits.
measurement syndrome matrix $\bm{\Omega}_0$, as these columns
correspond to the error locations on the data qubits.
In this example, all measurements we considered were syndrome measurements.
Assuming no errors, the results of those measurements were
deterministic, irrespective of the actual logical state
$\ket{\psi}_\text{L}$, as they only depend on whether
$\ket{\psi}_\text{L} \in \mathcal{C}$, not on the concrete state.
Assuming no errors, the results of those measurements are
deterministic: They are not subject to any probabilistic behavior
despite the quantum mechanical nature of the underlying system.
They only depend on whether $\ket{\psi}_\text{L} \in \mathcal{C}$,
not on the concrete state.
It is, in general, possible to also consider non-deterministic measurements.
As an example, it is usual to consider a round of noiseless
measurements of the actual data qubit states after the last syndrome
@@ -557,7 +566,7 @@ extraction round.
\centering
\begin{tikzpicture}
\node{$%
\bm{\Omega} =
\bm{\Omega}_0 =
\begin{pmatrix}
1 & 1 & 0 \\
0 & 1 & 1 \\
@@ -667,7 +676,7 @@ extraction round.
\end{gather*}
\vspace*{-8mm}
\begin{gather*}
\bm{\Omega} =
\bm{\Omega}_1 =
\left(
\begin{array}{
cccccc%
@@ -761,10 +770,10 @@ Instead of using stabilizer measurement results directly, we
generalize the notion of what constitutes a parity check slightly.
We formally define a \emph{detector} as a deterministic parity constraint on
a set of measurement outcomes \cite[Def.~2.1]{derks_designing_2025}.
It can be seen that we will have as many linearly
independent detectors as there are separate deterministic measurements.
In the most straightforward case, we may simply use the stabilizer
measurements as detectors.
We immediately recognize that we will have as many linearly
independent detectors as there are separate deterministic measurements.
We generally aim to utilize the maximum number of linearly
independent detectors \cite[Sec.~2.2]{derks_designing_2025}.
@@ -775,8 +784,8 @@ the \emph{detector matrix} $\bm{D} \in \mathbb{F}_2^{D\times M}$
\cite[Def.~2.2]{derks_designing_2025}, with $~D\in \mathbb{N}$
denoting the number of detectors.
Similar to the way a \ac{pcm} associates bits with parity checks, the
detector matrix links measurements and detectors.
Each column corresponds to a measurement, while each rows corresponds
detector matrix links measurement outcomes and detectors.
Each column corresponds to a measurement, while each row corresponds
to a detector.
We should note at this point that the combination of measurements
into detectors has no bearing on the actual construction of the
@@ -786,12 +795,12 @@ affects the decoder.
Note that we can use the detector matrix $\bm{D}$ to describe the set
of possible measurement outcomes under the absence of noise.
The same way we use a \ac{pcm} to describe the code space as
\begin{align*}
Similar to the we use a \ac{pcm} to describe the code space as
\begin{equation*}
\mathcal{C}
= \{ \bm{x} \in \mathbb{F}_2^{n} : \bm{H}\bm{x}^\text{T} = \bm{0} \}
,%
\end{align*}
\end{equation*}
the set of possible measurement outcomes is simply $\text{kern}\{\bm{D}\}$
\cite[Sec.~2.2]{derks_designing_2025}.
@@ -915,7 +924,8 @@ with $\bm{m}^{(0)} = \bm{0}$.
We again turn our attention to the three-qubit repetition code.
In \Cref{fig:rep_code_multiple_rounds_phenomenological} we can see
that $E_6$ has occurred and has subsequently tripped the last four measurements.
that $E_6$ has occurred and has subsequently triggered the last four
measurements.
We now take those measurements and combine them according to
\Cref{eq:measurement_combination}.
We can see this process graphically in
@@ -923,13 +933,13 @@ We can see this process graphically in
To understand why this way of defining the detectors is useful, we
note that the error $E_6$ in
\Cref{fig:rep_code_multiple_rounds_phenomenological} has not only
tripped the measurements in the syndrome extraction round immediately
triggered the measurements in the syndrome extraction round immediately
afterwards, but all subsequent ones as well.
To only see errors in the rounds immediately following them, we
consider our newly defined detectors instead of the measurements,
that effectively compute the difference between the measurements.
Each error can only trip syndrome bits that follow it.
Each error can only trigger syndrome bits that follow it.
This is reflected in the triangular structure of $\bm{\Omega}$ in
\Cref{eq:syndrome_matrix_ex}.
Combining the measurements into detectors according to
@@ -949,7 +959,7 @@ The detector error matrix
\end{array}
\right)
\end{align*}
we obtain this way has a block-diagonal structure.
obtained this way has a block-diagonal structure.
Note that we exploit the fact that each syndrome measurement round is
identical to obtain this structure.
@@ -1030,11 +1040,11 @@ measurements) models \cite[Sec.~2.1]{gidney_fault-tolerant_2021}.
These differ in the way they compute individual error probabilities
from the physical error rate.
In this work we only consider \emph{standard circuit-based depolarizing
noise}, as this is the standard approach in the literature.
We thus set the error probabilities of all error locations in the
circuit-level noise model to the same value, the physical error rate
$p_\text{phys}$.
In this work we consider the \emph{standard circuit-based depolarizing
noise} variant of circuit-level noise, as this is the standard
approach in the literature:
We set the error probabilities of all error locations to the same
value, the physical error rate $p_\text{phys}$.
%%%%%%%%%%%%%%%%
\subsection{Per-Round Logical Error Rate}
@@ -1065,13 +1075,12 @@ The overall probability of error is then
\end{align}
We approximate $p_\text{e,total}$ using a Monte Carlo simulation and
compute the per-round-\ac{ler} using \Cref{eq:per_round_ler}.
This is a common approach taken in the literature
\cite{gong_toward_2024}\cite{wang_fully_2025}.
This is the approach taken in \cite{gong_toward_2024}\cite{wang_fully_2025}.
Another common approach \cite{chen_exponential_2021}%
Another approach \cite{chen_exponential_2021}%
\cite{bausch_learning_2024}\cite{beni_tesseract_2025} is to assume an
exponential decay for the decoder's \emph{logical fidelity}
\cite[Eq.~2]{bausch_learning_2024}
\cite[Eq.~(2)]{bausch_learning_2024}
\begin{align*}
F_\text{total} = (F_\text{round})^{R}
.%
@@ -1079,7 +1088,7 @@ exponential decay for the decoder's \emph{logical fidelity}
The logical fidelity is a measure of the quality of a logical state
\cite[Appendix~E]{postler_demonstration_2024}.
As it is related to the error rate through $F = 1 - 2p$, we obtain
\cite[Eq.~4]{bausch_learning_2024}
\cite[Eq.~(4)]{bausch_learning_2024}
\begin{align}
(1 - 2p_\text{e,total}) &= (1 - 2p_\text{e,round})^{R} \nonumber\\
\implies \hspace{15mm} p_\text{e,round} &= \frac{1}{2}

View File

@@ -42,7 +42,7 @@
\Crefname{equation}{}{}
\Crefname{section}{Section}{Sections}
\Crefname{subsection}{Subsection}{Subsections}
\Crefname{subsection}{Section}{Sections}
\Crefname{figure}{Figure}{Figures}
%
@@ -89,7 +89,7 @@
% \thesisHeadOfInstitute{Prof. Dr.-Ing. Peter Rost}
%\thesisHeadOfInstitute{Prof. Dr.-Ing. Peter Rost\\Prof. Dr.-Ing.
% Laurent Schmalen}
\thesisSupervisor{Jonathan Mandelbaum}
\thesisSupervisor{M.Sc. Jonathan Mandelbaum}
\thesisStartDate{01.11.2025}
\thesisEndDate{04.05.2026}
\thesisSignatureDate{04.05.2026}