Write conclusion

Finish intro
2026-05-04 01:21:26 +02:00 · 2026-05-03 20:51:32 +02:00
3 changed files with 183 additions and 171 deletions
--- a/src/thesis/chapters/1_introduction.tex
+++ b/src/thesis/chapters/1_introduction.tex
@@ -3,14 +3,13 @@
 % Intro to quantum computing
 % TODO: Rephrase
 In 1982, Richard Feynman, motivated by the difficulty of simulating
 quantum-mechanical systems on classical hardware, put forward the
 idea of building computers from quantum hardware themselves
 \cite{feynman_simulating_1982}.
 The use of such quantum computers has since been shown to offer promising
 prospects not only with regard to simulating quantum systems but also
-for solving certain kinds of problems that are classicaly intractable.
+for solving certain kinds of problems that are classically intractable.
 The most prominent example is Shor's algorithm for integer
 factorization \cite{shor_algorithms_1994}.
@@ -73,12 +72,12 @@ subsequent decoding process on the measured syndrome.
 Another difference between \ac{qec} and classical channel coding is
 the resource constraints.
-For QEC, low latency matters more than low overall computational
+For \ac{qec}, low latency matters more than low overall computational
 complexity, due to the backlog problem
-\cite[Sec.~II.G.3.]{terhal_quantum_2015}: Some gates may turn
+\cite[Sec.~II.G.3.]{terhal_quantum_2015}: Certain gates turn
 single-qubit errors into multi-qubit ones, so errors must be
 corrected beforehand.
-A QEC system that is too slow accumulates a backlog at these points,
+A \ac{qec} system that is too slow accumulates a backlog at these points,
 causing exponential slowdown.
 Several code constructions have been proposed for \ac{qec} codes over the years.
@@ -87,43 +86,75 @@ standard for experimental applications for a long time
 \cite[Sec.~I]{koutsioumpas_colour_2025}, due to their
 reliance on only local connections between qubits
 \cite[Sec.~5]{roffe_decoding_2020}.
-Recently, \ac{qldpc} codes have been getting increasingly more
+Recently, \ac{qldpc} codes have been getting increasing
 attention as they have been shown to offer comparable thresholds with
 substantially improved encoding rates \cite[Sec.~1]{bravyi_high-threshold_2024}.
 \ac{qldpc} codes are generally decoded using a syndrome-based variant
 of the \ac{bp} algorithm \cite[Sec.~1]{roffe_decoding_2020}.
 We focus on \ac{qldpc} codes in our work and specifically \ac{bb} codes,
 as they are promising candidates for practical QEC due to their high
 encoding rates, large minimum distances, and short-depth syndrome
 extraction circuits \cite[Sec.~1]{bravyi_high-threshold_2024}.
 % DEMs and fault tolerance
-\content{Syndrome extraction can also be faulty -> Need for fault tolerance}
+The syndrome extraction itself is implemented on quantum hardware and
-\content{Have to repeat syndrome measurements}
+is therefore subject to the same noise as the data qubits.
-\content{DEMs one way of implementing fault tolerance: Model more
+As a consequence, the \ac{qec} procedure, meant to protect the quantum
-error locations -> Larger resulting codes}
+state, itself introduces new \emph{internal errors}.
-\content{Literature deals with latency problem for fault tolerance by
+A procedure is called \emph{fault-tolerant} if it remains effective
-sliding-window decoding}
+even in the presence of these internal errors
 \cite[Sec.~4]{gottesman_introduction_2009}.
 To deal with internal errors that flip syndrome bits, multiple rounds
 of syndrome measurements are performed.
 One approach of implementing fault tolerance is using \acp{dem}.
 A \ac{dem} abstracts away the underlying circuit,
 focusing only on the relationship between possible errors
 and their effects on the syndrome \cite[Sec.~1.4.3]{higgott_practical_2024}.
 A \emph{detector error matrix} is generated from the circuit, which is
 used for decoding instead of the original check matrix.
 Decoding under a \ac{dem} poses a challenge with respect to the
 latency constraint.
 This is because the detector error matrix is much larger than the
 check matrix of the underlying code, since it needs to represent many
 more error locations.
 For example, in our experiments using the $\llbracket 144,12,12
 \rrbracket$ \ac{bb} code with $12$ syndrome measurement rounds, the
 number of \acp{vn} grew from $144$ to $9504$ and the number of
 \acp{cn} grew from $72$ to $1008$.
 To keep the latency of \ac{dem} decoding manageable, one approach is
 \emph{sliding-window decoding}.
 Instead of decoding on the entire detector error matrix at once,
 it is partitioned into several overlapping windows.
 Once decoding of one window is complete, error estimates on the initial part
 that is no longer needed are committed, and the next window is processed.
 This way, decoding can start as soon as the syndrome bits required
 for the first window have been extracted.
 The idea originates with the \emph{overlapping recovery} scheme
 proposed for the surface code in
 \cite[Sec.~IV.B]{dennis_topological_2002} and has since been studied
 for surface and toric codes \cite{kuo_fault-tolerant_2024} as well as
 for \ac{qldpc} codes under both phenomenological and circuit-level
 noise \cite{huang_increasing_2024,gong_toward_2024,kang_quits_2025}.
 % Reseach gap + our work
-\content{Use BP for decoding, but has convergence issues -> Modify BP}
+We observe a structural similarity between sliding-window decoding for
-
+\acp{dem} and window decoding for \ac{sc}-\acs{ldpc} codes.
-\content{We note a striking similarity between sliding-window
+In contrast to the latter, however, where \ac{bp} messages are
-decoding for DEMs and the way SC-LDPC codes are decoded}
+carried between windows \cite[Sec.~III.~C.]{hassan_fully_2016},
-\content{Extend QEC sliding-window decoding by warm start, inspired
+the existing realizations of sliding-window decoding for \ac{qec}
 by SC-LDPC decoders}
 The existing realizations of sliding-window decoding for \ac{qec}
 discard the soft information produced inside one window before moving
-on to the next, in contrast to the analogous \ac{sc}-\ac{ldpc}
+to the next.
 decoders, which carry messages between windows
 \cite[Sec.~III.~C.]{hassan_fully_2016}.
 This thesis investigates whether the same idea can be carried over to
 the \ac{qec} setting.
 We propose \emph{warm-start sliding-window decoding}, in which the
 \ac{bp} messages from the overlap region of the previous window are
 reused to initialize \ac{bp} in the current window in place of the
 standard cold-start initialization.
 We formulate the warm start first for plain \ac{bp} and then for
-\ac{bpgd}, where some care is needed in deciding which information to
+\ac{bpgd}, a variant of \ac{bp} with better convergence properties
-carry over.
+for \ac{qec} codes.
 The decoders are evaluated by Monte Carlo simulation on the
 $\llbracket 144,12,12 \rrbracket$ \ac{bb} code under standard
 circuit-based depolarizing noise over $12$ syndrome extraction rounds.
@@ -131,140 +162,6 @@ The main finding is that warm-starting yields a consistent
 improvement at low iteration budgets, which is the regime relevant for
 low-latency operation.
 % The need for fault tolerance
 % A naive picture of \ac{qec} treats the syndrome extraction circuit as
 % ideal and only considers errors on the data qubits.
 % In reality, every gate, every ancilla, and every measurement involved
 % in extracting the syndrome can itself fail, introducing new faults
 % into the procedure that is supposed to correct them
 % \cite[Sec.~III]{shor_scheme_1995}.
 % A \ac{qec} procedure is called \emph{fault-tolerant} if it remains
 % effective in the presence of these internal faults
 % \cite[Sec.~4]{gottesman_introduction_2009}.
 % Fault tolerance
 % The standard formal definition requires the number of output errors
 % to remain bounded as long as the combined number of input and
 % internal errors does not exceed the correction capability of the code
 % \cite[Def.~4.2]{derks_designing_2025}.
 % To deal with internal errors that flip syndrome bits, multiple rounds
 % of syndrome measurements are performed, and the resulting space-time
 % history of detector outcomes is decoded jointly.
 % The probabilities of errors at each location in the circuit are
 % collected in a \emph{noise model}.
 % The most general such model, in which an arbitrary Pauli error is
 % allowed after each gate, is referred to as \emph{circuit-level noise}
 % \cite[Def.~2.5]{derks_designing_2025} and is the noise model that
 % should be used for fault-tolerance simulations
 % \cite[Sec.~4.2]{derks_designing_2025}.
 % DEMs
 % The combination of circuit-level noise and multiple syndrome
 % measurement rounds yields a complicated, code- and circuit-specific
 % decoding problem.
 % A recent line of work argues that this problem is most cleanly
 % expressed through a \acf{dem} \cite[Sec.~6]{derks_designing_2025}.
 % A \ac{dem} abstracts away the underlying circuit and lists the
 % independent error mechanisms together with the detectors they flip
 % and the logical observables they affect.
 % From the decoder's perspective, decoding under a \ac{dem} is again a
 % classical decoding problem on a parity-check matrix, with the
 % detectors playing the role of \acfp{cn} and the error mechanisms
 % playing the role of \acfp{vn}.
 % The standard tool for generating \acp{dem} from arbitrary stabilizer
 % circuits is Stim \cite{gidney_stim_2021}, in which the \ac{dem}
 % formalism was originally introduced.
 % The issues with deocoding under DEMs
 % For \ac{qec}, the binding constraint on the decoder is latency, not
 % raw computational complexity.
 % This is the \emph{backlog problem}: certain gates can transform
 % existing single-qubit errors into multi-qubit errors, and any
 % correction must be applied before such gates are reached.
 % A decoder that fails to keep up with the rate at which the hardware
 % produces syndromes leads to an exponential slowdown of the computation
 % \cite[Sec.~II.G.3.]{terhal_quantum_2015}.
 % Decoding under a \ac{dem} aggravates this constraint, because the
 % matrix that results from unrolling several rounds of syndrome
 % extraction is much larger than the parity-check matrix of the
 % underlying code.
 % Each error mechanism in the circuit becomes a separate \ac{vn} and
 % each detector becomes a separate \ac{cn}.
 % For the $\llbracket 144,12,12 \rrbracket$ \acf{bb} code
 % \cite[Sec.~3]{bravyi_high-threshold_2024} with $12$ syndrome
 % measurement rounds, the number of \acp{vn} grows from $144$ to $9504$
 % and the number of \acp{cn} grows from $72$ to $1008$.
 % Exiting solutions to these issues (sliding-window decoding + BP modifications)
 % The dominant strategy for keeping the latency of \ac{dem} decoding
 % manageable is \emph{sliding-window decoding}.
 % Instead of decoding the entire space-time history at once, the
 % decoder operates on a window that spans only a few syndrome
 % measurement rounds.
 % After each round, the window slides forward, and the corrections in
 % the part of the previous window that is no longer needed are committed.
 % The idea originates with the \emph{overlapping recovery} scheme
 % proposed for the surface code in \cite[Sec.~IV.B]{dennis_topological_2002}
 % and has since been studied for surface and toric codes
 % \cite{kuo_fault-tolerant_2024} as well as for \ac{qldpc} codes under
 % both phenomenological and circuit-level noise
 % \cite{huang_increasing_2024,gong_toward_2024,kang_quits_2025}.
 % The structure of the decoding problem inside each window is
 % reminiscent of \acf{sc}-\acf{ldpc} decoding from classical
 % communications \cite[Intro.]{costello_spatially_2014}, where similar
 % windowing techniques are used and where soft information is passed
 % between consecutive windows
 % \cite[Sec.~III.~C.]{hassan_fully_2016}.
 % We focus on QLDPC codes
 % In this work we focus on \acf{qldpc} codes, of which the \ac{bb} code
 % mentioned above is one example.
 % \ac{qldpc} codes have emerged as leading candidates for practical
 % \ac{qec} due to their high encoding rates and large minimum distances
 % at short syndrome-extraction-circuit depths
 % \cite[Sec.~1]{bravyi_high-threshold_2024}.
 % The natural decoder for them is \acf{bp}, which is well suited to
 % sparse parity-check matrices and admits an efficient and parallel
 % implementation, but is known to converge poorly on quantum codes due
 % to quantum degeneracy and the unavoidable short cycles in the Tanner
 % graph \cite[Sec.~II.C.]{babar_fifteen_2015}\cite[Sec.~V]{roffe_decoding_2020}.
 % Several modifications of \ac{bp} have been proposed to address this:
 % combining \ac{bp} with \acf{osd} \cite{roffe_decoding_2020}, decoding
 % multiple variations of the code in parallel as in \acf{aed}
 % \cite{koutsioumpas_automorphism_2025}, or extending \ac{bp} with
 % guided decimation as in \acf{bpgd} \cite{yao_belief_2024}.
 % Contributions of this Thesis
 % The existing realizations of sliding-window decoding for \ac{qec}
 % discard the soft information produced inside one window before moving
 % on to the next, in contrast to the analogous \ac{sc}-\ac{ldpc}
 % decoders, which carry messages between windows
 % \cite[Sec.~III.~C.]{hassan_fully_2016}.
 % This thesis investigates whether the same idea can be carried over to
 % the \ac{qec} setting.
 %
 % We propose \emph{warm-start sliding-window decoding}, in which the
 % \ac{bp} messages from the overlap region of the previous window are
 % reused to initialize \ac{bp} in the current window in place of the
 % standard cold-start initialization.
 % We formulate the warm start first for plain \ac{bp} and then for
 % \ac{bpgd}, where some care is needed in deciding which information to
 % carry over.
 % The decoders are evaluated by Monte Carlo simulation on the
 % $\llbracket 144,12,12 \rrbracket$ \ac{bb} code under standard
 % circuit-based depolarizing noise over $12$ syndrome extraction rounds.
 % The main finding is that warm-starting yields a consistent
 % improvement at low iteration budgets, which is the regime relevant for
 % fault-tolerant operation.
 % Outline of the Thesis
 \Cref{ch:Fundamentals} reviews the fundamentals of classical and
@@ -292,6 +189,7 @@ introduces the proposed warm-start sliding-window decoder for
 plain \ac{bp} and for \ac{bpgd}, and reports numerical results on the
 $\llbracket 144,12,12 \rrbracket$ \ac{bb} code.
 % TODO: Possibly extend to mention specific proposed research directions
 \Cref{ch:Conclusion} concludes the thesis and outlines directions for
 further research.
--- a/src/thesis/chapters/4_decoding_under_dems.tex
+++ b/src/thesis/chapters/4_decoding_under_dems.tex
@@ -2274,7 +2274,7 @@ In both panels, every curve again exhibits the expected monotonic
 increase of the per-round \ac{ler} with the physical error rate.
 Across both panels and across all parameter choices, the warm-start
 curves lie above the corresponding cold-start curves, i.e.,
-the warm-start variant performsworse than its cold-start counterpart.
+the warm-start variant performs worse than its cold-start counterpart.
 This is the opposite of what we observed for plain \ac{bp}, where
 warm-start improved upon cold-start at every parameter setting.
 The gap between the warm- and cold-start curves additionally widens
--- a/src/thesis/chapters/5_conclusion_and_outlook.tex
+++ b/src/thesis/chapters/5_conclusion_and_outlook.tex
@@ -1,15 +1,129 @@
 \chapter{Conclusion and Outlook}
 \label{ch:Conclusion}
-\content{Takeaway: Warm-start more effective for lower numbers of max
+% Recap of motivation
    iterations (plays into our hands because lower number of iterations
 means lower latency)}
 \content{Warm-start initialization limited to decoding algorithms
 providing relevant soft information}
-\content{\textbf{Ideas for further research}}
+This thesis investigated decoding under \acp{dem} for fault-tolerant
-\content{Softer way of decimating VNs}
+\ac{qec}, with a focus on low-latency decoding methods for \ac{qldpc} codes.
-\content{Systematic study on using different inner decoders (AED,
+The repetition of the syndrome measurements, especially under
-SED, BPGD, ...)}
+consideration of circuit-level noise, leads to a significant increase
-\content{Investigate SC-LDPC window decoding wave-like effects}
+in decoding complexity: in our experiments on the $\llbracket
 144,12,12 \rrbracket$ \ac{bb} code with $12$ syndrome extraction
 rounds, the check matrix grew from 144 \acp{vn} and 72
 \acp{cn} to 9504 \acp{vn} and 1008 \acp{cn}.
 % Recap of research gap and own work
 Sliding-window decoding addresses the latency constraint by
 exploiting the time-like locality of the syndrome extraction circuit,
 which manifests as a block-diagonal structure in the detector error
 matrix when detectors are defined as the difference of consecutive
 syndrome measurement rounds.
 We drew a comparison to windowed decoding for \ac{sc}-\ac{ldpc}
 codes, but noted that the existing realizations of sliding-window
 decoding discard the soft information produced inside one window
 before moving to the next.
 Building on this observation, we proposed warm-start sliding-window
 decoding, in which the \ac{bp} messages on the edges crossing into
 the overlap region of the previous window are reused to initialise
 the corresponding messages of the next window in place of the
 standard cold-start initialisation.
 We formulated the warm start first for plain \ac{bp} and then for
 \ac{bpgd}, the latter being attractive as an inner decoder because it
 addresses the convergence problems caused by short cycles and
 degeneracy in \ac{qldpc} Tanner graphs.
 The decoders were evaluated by Monte Carlo simulation on the
 $\llbracket 144,12,12 \rrbracket$ \ac{bb} code over $12$ syndrome
 extraction rounds under standard circuit-based depolarizing noise.
 We focused on a qualitative analysis, refraining from further
 optimizations such as introducing a normalization parameter for the
 min-sum algorithm.
 % Recap of experimental conclusions
 For plain min-sum \ac{bp}, the warm start was consistently beneficial
 across the parameter ranges we examined. The size of the gain depended
 on the overlap between consecutive windows: enlarging $W$ or
 shrinking $F$, both of which enlarge the overlap, raised the
 warm-start performance increase.
 We argued that the underlying mechanism is an effective increase in
 the number of \ac{bp} iterations spent on the \acp{vn} in the overlap
 region: each such \ac{vn} is processed by multiple consecutive window
 invocations, and the warm start lets these invocations accumulate
 iterations on the same \acp{vn} rather than restarting from scratch.
 The gain was most pronounced at low numbers of maximum iterations, where
 every additional iteration carries proportionally more information.
 For \ac{bpgd}, we noted that more information is available in the
 overlap region of a window: in addition to the \ac{bp} messages,
 there is information about which \acp{vn} were decimated and to what value.
 Passing this decimation information to the next window in addition to
 the messages turned out to worsen the performance considerably, which
 we attributed to a premature hard decision of the \acp{vn} in the
 overlap region.
 Restricting the warm start to the \ac{bp} messages alone, removed this effect.
 The resulting message-only warm start recovered a consistent
 improvement over cold-start that followed the same qualitative
 behaviour as for plain \ac{bp}: larger overlap, achieved by larger
 $W$ or smaller $F$, yielded a larger gain, and the
 performance difference was most pronounced at low numbers of maximum iterations.
 % Implications from experimental results
 These observations imply that the warm-start modification to
 sliding-window decoding provides a universal improvement, as long as
 some care is taken with specifying the information to be passed to
 the subsequent window.
 Not that this comes at no additional cost to the decoding complexity,
 since the only difference between warm- and cold-start sliding-window
 decoding is the initialization of the \ac{bp} messages.
 We expect similar behavior with other inner decoders that support
 soft information initialization in the overlap region.
 % Further research
 Several directions for further research emerge from this work.
 The most immediate is an extension of the evaluation to other
 \ac{qldpc} code families, to other circuit-level noise models such as
 SI1000 or EM3, and to a range of code sizes.
 This would clarify the generality of the gain due to the warm-start
 initialization.
 We expect the qualitative findings to carry over, since the
 underlying mechanism is structural rather than code-specific, but
 quantifying the gain across code families and noise models is left to
 future work.
 A second direction is a systematic study of inner decoders under the
 warm-start framework.
 We considered plain min-sum \ac{bp} and \ac{bpgd}, but other
 algorithms used for \ac{qldpc} decoding, such as automorphism
 ensemble decoding \cite{koutsioumpas_automorphism_2025} or neural
 \ac{bp} \cite{miao_quaternary_2025} may admit warm-start variants of their own.
 A third direction is a softer treatment of the decimation state in \ac{bpgd}.
 Rather than discarding the decimation information of the previous
 window entirely, as in the message-only warm start used here, one
 could encode the decimation decisions as strong but finite biases on
 the channel \acp{llr} of the next window, allowing the new window's parity
 checks to override them if the syndrome calls for it.
 This would interpolate between the two warm-start variants studied here and
 might combine the benefits of both.
 A related question is whether the decimation schedule itself should
 be aware of the window structure, for instance by deferring
 decimation of \acp{vn} in the overlap region until they have been
 visited by the next window.
 A final direction is suggested by the structural similarity between
 sliding-window decoding for \acp{dem} and windowed decoding for
 \ac{sc}-\ac{ldpc} codes.
 The current approach to generating the syndrome extraction circuitry
 necessarily leads to a coupling width of one between adjacent
 syndrome measurement rounds.
 A natural question is whether the coupling width could be
 increased, e.g., by interleaving two separate realizations of the
 syndrome measurement circuitry instead of always repeating the same one.
 Work in this direction would also be a step toward bringing
 sliding-window decoding under DEMs within the scope of the analytical
 machinery developed for SC-LDPC codes.
Author	SHA1	Message	Date
Andreas Tsouchlos	e9d996155d	Write conclusion	2026-05-04 01:21:26 +02:00
Andreas Tsouchlos	5e26179154	Finish intro	2026-05-03 20:51:32 +02:00