# What are Scientific Publications?

We can distinguish different categories of scientific publications. *Journal Articles* are either articles that have been peer-reviewed by a journal, or preprints that are currently under submission at a journal. *Proceedings and Conference Contributions* are commonly shorter articles, that have been peer-reviewed and accepted for publication with the conference. *Technical Reports* usually contain mathematical details for which there was too little space in a submission. *Books* are often longer collections of academic work, or sometimes written with the goal to educate a specific topic.

## List of my Publications

Here is a list containing my publications. You can click on a title to read an article you are interested in.

### Journal Articles

- Albert Senen-Cerda, Jaron Sanders (2020).
. Preprint. ArXiv 2002.02247. [Abstract]**Almost Sure Convergence of Dropout Algorithms for Neural Networks**We investigate the convergence and convergence rate of stochastic training algorithms for Neural Networks (NNs) that, over the years, have spawned from Dropout (Hinton et al., 2012). Modeling that neurons in the brain may not fire, dropout algorithms consist in practice of multiplying the weight matrices of a NN component-wise by independently drawn random matrices with {0,1}-valued entries during each iteration of the Feedforward-Backpropagation algorithm. This paper presents a probability theoretical proof that for any NN topology and differentiable polynomially bounded activation functions, if we project the NN’s weights into a compact set and use a dropout algorithm, then the weights converge to a unique stationary set of a projected system of Ordinary Differential Equations (ODEs). We also establish an upper bound on the rate of convergence of Gradient Descent (GD) on the limiting ODEs of dropout algorithms for arborescences (a class of trees) of arbitrary depth and with linear activation functions.

- Long Ma, Jaron Sanders (2019).
. Preprint. ArXiv 1909.04432. [Abstract]**Markov chains for error accumulation in quantum circuits**We study a model for the accumulation of errors in multi-qubit quantum computations, as well as a model describing continuous errors accumulating in a single qubit. By modeling the error process in a quantum computation using two coupled Markov chains, we are able to capture a weak form of time-dependency between errors in the past and future. By subsequently using techniques from the field of discrete probability theory, we calculate the probability that error measures such as the fidelity and trace distance exceed a threshold analytically. The formulae cover fairly generic error distributions, cover multi-qubit scenarios, and are applicable to e.g. the randomized benchmarking protocol. To combat the numerical challenge that may occur when evaluating our expressions, we additionally provide an analytical bound on the error probabilities that is of lower numerical complexity, and we also discuss a state space reduction that occurs for stabilizer circuits. Finally, taking inspiration from the field of operations research, we illustrate how our expressions can be used to e.g. decide how many gates one can apply before too many errors accumulate with high probability, and how one can lower the rate of error accumulation in existing circuits through simulated annealing.

- Jaron Sanders, Alexandre Proutiere, Se-Young Yun (2019).
. Accepted for publication in Annals of Statistics. ArXiv 1712.09232v3. [Download] [Abstract]**Clustering in Block Markov Chains**This paper considers cluster detection in Block Markov Chains (BMCs). These Markov chains are characterized by a block structure in their transition matrix. More precisely, the n possible states are divided into a finite number of K groups or clusters, such that states in the same cluster exhibit the same transition rates to other states. One observes a trajectory of the Markov chain, and the objective is to recover, from this observation only, the (initially unknown) clusters. In this paper we devise a clustering procedure that accurately, efficiently, and provably detects the clusters. We first derive a fundamental information-theoretical lower bound on the detection error rate satisfied under any clustering algorithm. This bound identifies the parameters of the BMC, and trajectory lengths, for which it is possible to accurately detect the clusters. We next develop two clustering algorithms that can together accurately recover the cluster structure from the shortest possible trajectories, whenever the parameters allow detection. These algorithms thus reach the fundamental detectability limit, and are optimal in that sense.*Revision of:*Jaron Sanders, Alexandre Proutiere, Se-Young Yun (2018).. Preprint. ArXiv 1712.09232v2. [Download] [Abstract]**Clustering in Block Markov Chains**This paper considers cluster detection in Block Markov Chains (BMCs). These Markov chains are characterized by a block structure in their transition matrix. More precisely, the n possible states are divided into a finite number of K groups or clusters, such that states in the same cluster exhibit the same transition rates to other states. One observes a trajectory of the Markov chain, and the objective is to recover, from this observation only, the (initially unknown) clusters. In this paper we devise a clustering procedure that accurately, efficiently, and provably detects the clusters. We first derive a fundamental information-theoretical lower bound on the detection error rate satisfied under any clustering algorithm. This bound identifies the parameters of the BMC, and trajectory lengths, for which it is possible to accurately detect the clusters. We next develop two clustering algorithms that can together accurately recover the cluster structure from the shortest possible trajectories, whenever the parameters allow detection. These algorithms thus reach the fundamental detectability limit, and are optimal in that sense.*Revision of:*Jaron Sanders, Alexandre Proutiere (2017).. Preprint. ArXiv 1712.09232v1. [Download] [Abstract]**Optimal clustering algorithms in Block Markov Chains**This paper considers cluster detection in Block Markov Chains (BMCs). These Markov chains are characterized by a block structure in their transition matrix. More precisely, the n possible states are divided into a finite number of K groups or clusters, such that states in the same cluster exhibit the same transition rates to other states. One observes a trajectory of the Markov chain, and the objective is to recover, from this observation only, the (initially unknown) clusters. In this paper we devise a clustering procedure that accurately, efficiently, and provably detects the clusters. We first derive a fundamental information-theoretical lower bound on the detection error rate satisfied under any clustering algorithm. This bound identifies the parameters of the BMC, and trajectory lengths, for which it is possible to accurately detect the clusters. We next develop two clustering algorithms that can together accurately recover the cluster structure from the shortest possible trajectories, whenever the parameters allow detection. These algorithms thus reach the fundamental detectability limit, and are optimal in that sense.

- P. Bermolen, M. Jonckheere, Jaron Sanders (2017).
. Journal Article. Journal of Statistical Physics 169(5), p. 989-1018. [Abstract]**Scaling limits and generic bounds for exploration processes**We consider exploration algorithms of the random sequential adsorption type both for homogeneous random graphs and random geometric graphs based on spatial Poisson processes. At each step, a vertex of the graph becomes active and its neighboring nodes become explored. Given an initial number of vertices N growing to infinity, we study statistical properties of the proportion of explored nodes in time using scaling limits. We obtain exact limits for homogeneous graphs and prove an explicit central limit theorem for the final proportion of active nodes, known as the jamming constant, through a diffusion approximation for the exploration process. We then focus on bounding the trajectories of such exploration processes on random geometric graphs, i.e. random sequential adsorption. As opposed to homogeneous random graphs, these do not allow for a reduction in dimensionality. Instead we build on a fundamental relationship between the number of explored nodes and the discovered volume in the spatial process, and obtain generic bounds: bounds that are independent of the dimension of space and the detailed shape of the volume associated to the discovered node. Lastly, we give two trajectorial interpretations of our bounds by constructing two coupled processes that have the same fluid limits.

- Jaron Sanders, S.C. Borst, J.S.H. van Leeuwaarden, A.J.E.M. Janssen (2014).
. Journal Article. Stochastic Systems 7(2). [Abstract]**Optimal admission control for many-server systems with QED-driven revenues**We consider Markovian many-server systems with admission control operating in a QED regime, where the relative utilization approaches unity while the number of servers grows large, providing natural Economies-of-Scale. In order to determine the optimal admission control policy, we adopt a revenue maximization framework, and suppose that the revenue rate attains a maximum when no customers are waiting and no servers are idling. When the revenue function scales properly with the system size, we show that a nondegenerate optimization problem arises in the limit. Detailed analysis demonstrates that the revenue is maximized by nontrivial policies that bar customers from entering when the queue length exceeds a certain threshold of the order of the typical square-root level variation in the system occupancy. We identify a fundamental equation characterizing the optimal threshold, which we extensively leverage to provide broadly applicable upper/lower bounds for the optimal threshold, establish its monotonicity, and examine its asymptotic behavior, all for general revenue structures. For linear and exponential revenue structures, we present explicit expressions for the optimal threshold.

- Jaron Sanders, S.C. Borst, J.S.H. van Leeuwaarden, A.J.E.M. Janssen (2016).
. Journal Article. Operations Research Letters 44(3), p. 359-365. [Abstract]**Optimality gaps in asymptotic dimensioning of many-server systems**The Quality-and-Efficiency-Driven (QED) regime provides a basis for solving asymptotic dimensioning problems that trade off revenue, costs and service quality. We derive bounds for the optimality gaps that capture the differences between the true optimum and the asymptotic optimum based on the QED approximations. Our bounds generalize earlier results for classical many-server systems. We also apply our bounds to a many-server system with threshold control.

- Jaron Sanders, S.C. Borst, J.S.H. van Leeuwaarden (2015).
. Journal Article. IEEE Transactions on Automatic Control 61(7), p. 1838-1853. [Abstract]**Online network optimization using product-form Markov processes**We develop a gradient algorithm for optimizing the performance of product-form networks through online adjustment of control parameters. The use of standard algorithms for finding optimal parameter settings is hampered by the prohibitive computational burden of calculating the gradient in terms of the stationary probabilities. The proposed approach instead relies on measuring empirical frequencies of the various states through simulation or online operation so as to obtain estimates for the gradient. Besides the reduction in computational effort, a further benefit of the online operation lies in the natural adaptation to slow variations in ambient parameters as commonly occurring in dynamic environments. On the downside, the measurements result in inherently noisy and biased estimates. We exploit mixing time results in order to overcome the impact of the bias and establish sufficient conditions for convergence to a globally optimal solution.

We discuss our algorithm in the context of different systems, including queueing networks, loss networks, and wireless networks. We also illustrate how the algorithm can be used in such systems to optimize a service/cost trade-off, to map parameter regions that lead to systems meeting specified constraints, and to achieve target performance measures. For the latter application, we first identify which performance measures can be controlled depending on the set of configurable parameters. We then characterize the achievable region of performance measures in product-form networks, and finally we describe how our algorithm can be used to achieve the target performance in an online, distributed fashion, depending on the application context.

- Jaron Sanders, Matthieu Jonckheere, Servaas Kokkelmans (2015).
. Journal Article. Physical Review Letters 115(4):043002. [Abstract]**Sub-Poissonian statistics of jamming limits in ultracold Rydberg gases**Several recent experiments have established by measuring the Mandel Q parameter that the number of Rydberg excitations in ultracold gases exhibits sub-Poissonian statistics. This effect is attributed to the Rydberg blockade that occurs due to the strong interatomic interactions between highly-excited atoms. Because of this blockade effect, the system can end up in a state in which all particles are either excited or blocked: a jamming limit. We analyze appropriately constructed random-graph models that capture the blockade effect, and derive formulae for the mean and variance of the number of Rydberg excitations in jamming limits. This yields an explicit relationship between the Mandel Q parameter and the blockade effect, and comparison to measurement data shows strong agreement between theory and experiment.

- Jaron Sanders, Rick van Bijnen, Edgar Vredenbregt, Servaas Kokkelmans (2014).
. Journal Article. Physical Review Letters, 112(16):163001. [Abstract]**Wireless network control of interacting Rydberg atoms**We identify a relation between the dynamics of ultra-cold Rydberg gases in which atoms experience a strong dipole blockade and spontaneous emission, and a stochastic process that models certain wireless random-access networks. We then transfer insights and techniques initially developed for these wireless networks to the realm of Rydberg gases, and explain how the Rydberg gas can be driven into crystal formations using our understanding of wireless networks. Notably, we also propose a method to determine Rabi frequencies (laser intensities) such that particles in the Rydberg gas are excited with specified target excitation probabilities.

- A.J.E.M. Janssen, J.S.H. van Leeuwaarden, Jaron Sanders (2013).
. Journal Article. Performance Evaluation, 70(10), p. 750-769. [Abstract]**Scaled control in the QED regime**We develop many-server asymptotics in the QED regime for models with admission control. The admission control, designed to reduce the incoming traffic in periods of congestion, scales with the size of the system. For a class of Markovian models with this scaled control, we identify the QED limits for two stationary performance measures. We also derive corrected QED approximations, generalizing earlier results for the Erlang B, C and A models. These results are useful for the dimensioning of large systems equipped with an active control policy. In particular, the corrected approximations can be leveraged to establish the optimality gaps related to square-root staffing and asymptotic dimensioning with admission control.

## Proceedings and Conference Contributions

- Jaron Sanders, S.C. Borst, J.S.H. van Leeuwaarden (2012).
. Conference Proceeding. Proceedings of the 50th Annual Allerton Conference on Communication, Control, and Computing (Allerton House, Illinois, USA, October 1-5), p. 928-935. [Abstract]**Achievable performance in product-form networks**We characterize the achievable range of performance measures in product-form networks where one or more system parameters can be freely set by a network operator. Given a product-form network and a set of configurable parameters, we identify which performance measures can be controlled and which target values can be attained. We also discuss an online optimization algorithm, which allows a network operator to set the system parameters so as to achieve target performance metrics. In some cases, the algorithm can be implemented in a distributed fashion, of which we give several examples. Finally, we give conditions that guarantee convergence of the algorithm, under the assumption that the target performance metrics are within the achievable range.

- Jaron Sanders, S.C. Borst, J.S.H. van Leeuwaarden (2012).
. Conference Proceeding. Proceedings of the 6th International Conference on Performance Evaluation Methodologies and Tools (Valuetools, Cargèse, France, October 9-12), p. 21-30. [Abstract]**Online optimization of product-form networks**We develop an online gradient algorithm for optimizing the performance of product-form networks through online adjustment of control parameters. The use of standard algorithms for finding optimal parameter settings is hampered by the prohibitive computational burden of calculating the gradient in terms of the stationary probabilities. The proposed approach instead relies on measuring empirical frequencies of the various states through simulation or online operation so as to obtain estimates for the gradient. Besides the reduction in computational effort, a further benefit of the online operation lies in the natural adaptation to slow variations in ambient parameters as commonly occurring in dynamic environments. On the downside, the measurements result in inherently noisy and biased estimates. We exploit mixing time results in order to overcome the impact of the bias and establish sufficient conditions for convergence to a globally optimal solution.

### Technical Reports

- Paola Bermolen, Matthieu Jonckheere, Jaron Sanders (2015).
. Technical report. ArXiv 1504.02438. [Abstract]**Scaling limits for exploration algorithms on random graphs**We consider exploration algorithms of the random sequential adsorption type both for homogeneous random graphs and random geometric graphs based on spatial Poisson processes. At each step, a vertex of the graph becomes active and its neighboring nodes become explored. Given an initial number of vertices N growing to infinity, we study statistical properties of the proportion of explored nodes in time using scaling limits. We obtain exact limits for homogeneous graphs and prove an explicit central limit theorem for the final proportion of active nodes, known as the jamming constant, through a diffusion approximation for the exploration process. We then focus on bounding the trajectories of such exploration processes on random geometric graphs, i.e. random sequential adsorption. As opposed to homogeneous random graphs, these do not allow for a reduction in dimensionality. Instead we build on a fundamental relationship between the number of explored nodes and the discovered volume in the spatial process, and obtain generic bounds: bounds that are independent of the dimension of space and the detailed shape of the volume associated to the discovered node. Lastly, we give two trajectorial interpretations of our bounds by constructing two coupled processes that have the same fluid limits.

### Books

- Jaron Sanders (2016).
. PhD thesis. Eindhoven University of Technology. [Abstract]**Stochastic Optimization of Large-Scale Complex Systems**This thesis develops analysis techniques and optimization procedures that are broadly applicable to large-scale complex systems. The focus is on probabilistic models of interacting particle systems, stochastic networks, and service systems, wwhich are all large systems and display fascinatingly complex behavior. In practice, these systems obey simple local rules leading to Markov processes that are amenable to analyses that shed light on the interplay between the local rules and the global system behavior. Chapter 1 provides an overview of this thesis’s content, and illustrates our approaches of analysis and optimization.

Chapter 2 deals with the topic Control and optimization of large-scale stochastic networks. There, we develop optimization algorithms that are applicable to the whole class of product-form Markov processes. These algorithms can be implemented in such a way that individual components of stochastic networks make autonomous decisions that ultimately lead to globally optimal network behavior, and can for instance be used to balance a network of queues, i.e. to achieve equal average queue lengths. The algorithm does so by solving an inversion problem in an online fashion: every queue individually adapts its service rate based on online observations of its own average queue length. The individual queues do not need global network information (like the network structure), and even though all queues influence each other, the algorithm guarantees that the whole network achieves their common goal of a balanced operation.

Throughout Chapters 3, 4, and 5, we discuss the topic Ultracold Rydberg gases and quantum engineering. Rydberg gases consist of atoms that exhibit strong mutual blockade effects, and this gives rise to an interacting particle system with intriguing complex interactions. These particle systems are investigated in laboratory environments because of their application in quantum computing and condensed matter physics. Our research finds that in certain regimes, the complex interactions of these particles can be described using the stochastic processes that also model the behavior of transmitters in wireless networks. This allows us to identify interesting connections between the fields of physics and mathematics, and to transfer techniques and insights from applied probability to the realm of Rydberg gases. For example, the optimization algorithms for large scale stochastic networks (described above) can be used to actively engineer the atomic system, and by constructing special random graphs we can give theoretical descriptions of statistical properties of the Rydberg gas.

Chapters 6, 7, and 8 cover the final topic Performance analysis and revenue maximization of critically loaded service systems. There, we consider large scale Markovian many-server systems that operate in the so-called Quality-andEfficiency-Driven (QED) regime and dwarf the usual trade-off between high system utilization and short waiting times. In order to achieve these dual goals, these systems are scaled so as to approach full utilization, while the number of servers grows simultaneously large, rendering crucial Economies-of-Scale. Our research extends the applicability of the QED regime by incorporating scalable admission control schemes and general revenue functions. This also allows us to identify for a broad class of revenue functions exactly which nontrivial threshold control policies are optimal in the QED regime, yielding insight into the relation between the optimal control and revenue structure. By studying the system’s precise asymptotic behavior when nearing the QED regime, we are able to also analyze the effectiveness of the QED regime as a framework for system dimensioning.

### Other

- Jaron Sanders (2014).
. Journalistic article. STAtOR.**Atomaire gassen en draadloze netwerken**

- Jaron Sanders (2011).
. MSc thesis. Eindhoven University of Technology.**Interacting particles in wireless networks and Rydberg gasses**