News for August 2025

Our press release this month features seven papers. Let’s take a look: a survey on efficiently testable graph properties; two papers on new directions in graph property testing; one that, after some meditations on distribution testing, introduces a novel direction; another on testing properties of strings; and two papers exploring property testing with a quantum twist. Without further delay, let us analyze our spread.

Polynomial Property Testing by Lior Gishboliner and Asaf Shapira (arXiv) This survey traces the development of property-testers for dense graph properties through the ages. The motivating question echoes throughout the survey: what is it that makes a dense graph property efficiently testable? Noting that any tester for your favorite property tester can be simulated using a canonical tester with at most a quadratic overhead in the number of queries, the survey focuses on properties which admit canonical testers that make at most $poly(1/\varepsilon)$ queries. This part of the story is narrated in Section 3 of the paper. This section begins by reviewing the seminal result of Alon from 2002 which kicked off research in this area which asserts that a property of $H$-freeness can be tested efficiently $\textit{iff}$ $H$ is bipartite. Earlier this year, Gishboliner and Shapira extended this result to testing the property of $H$-freeness in $k$-uniform-hypergraphs. This result asserts that such a property is efficiently testable $\textit{iff}$ $H$ is $k$-partite. The survey then presents a result which extends this result to the setting of induced subgraphs. In particular, this result characterizes for which graphs $H$, is the task of testing $H$-induced-subgraph freeness efficiently testable. The section concludes with a discussion of testing graph properties of bounded $\textit{VC}$-dimension.

Quality control in sublinear time: a case study via random graphs by Cassandra Marcussen, Ronitt Rubinfeld, and Madhu Sudan (arXiv) This paper introduces a general framework in which you can situate the following kinds of questions: So suppose you are given a probability parameter $p$ and a value $k \in \mathbb{N}$. Additionally, suppose I give you a graph $G = (V,E)$ and I ask — is it true that the quantity

$\rho_K(G) \colon = C_k(G)/\textbf{E}_{H \sim \mathcal{G}’_{n,p}}[C_k(G’)] \approx 1$

or is it the case that

$|\rho_K(G) – 1| > \varepsilon$

(where the function $C_k$ counts the number of $k$-sized cliques in $G$). The paper presents algorithms for this problem which issue at most $\Big(\frac{1}{p}\Big)^{O(k)}$ queries.
Now, let me explain the broader framework this paper introduces. You want to think of $\rho_k(G)$ as measuring the quality of your graph $G$. You want to understand whether $G$ is distributed as a genuine sample (that is, whether $G$ is a “high-quality graph”) according to $\mathcal{G}_{n,p}$ as measured by the function $C_k$. In general, as the authors explain via an analogy, this is supposed to capture the following sentiment: you go to a bookstore and you want to buy a book with quality close to $1$. The goal thus, is to choose a high-quality book. Note that this goal is weaker than both of the following goals: testing whether

$\text{quality(book)} \approx 1$ for an adversarial book,

OR whether

$\text{book} \sim \texttt{Some Distribution of your choice over books in store}$.

The authors situate quite a few other problems in this overarching framework and present algorithms for these.

Sublinear-Time Approximation for Graph Frequency Vectors in Hyperfinite Graphs by Gregory Moroie (arXiv) A useful first step for several algorithms on planar graphs of bounded degree is to exploit the fact that such graphs admit a hyperfinite decomposition. Using a primitive called the partition oracle, one can estimate in sublinear time various important graph parameters — like max-cut value, independent set size and so on to within a multiplicative $$(1 \pm \varepsilon)$. Often, a helpful step in these algorithms proceeds via a graph simplification step where one approximates the input graph with a crude sketch which tracks frequency counts of various subgraphs on the pieces produced by the partition oracle each of which have size at most $k = O(1/\varepsilon^2)$. The featured paper gives algorithms which find a graph $H$ in time $poly(1/\varepsilon, d^k)$ which can be used to approximate the corresponding frequency vector in $G$.

Instance-Optimal Uniformity Testing and Tracking by Guy Blanc, Clément L. Canonne, and Erik Waingarten (arXiv) After some meditations on the task of testing uniformity of distributions with support $[n]$, the featured paper suggests that despite the impressive algorithmic success on this task, there is a lot that one still desires. In particular, one of the fundamental reasons for dissatisfaction with the sample-optimal results for the above task, is that often in real-life situations, the only thing we are interested is any evidence of non-uniformity — without any distance notion in the picture. So, the paper formalizes the following question: Suppose we have a setup where at each time step $t \in \mathbb{N}$, we receive as input a sample $x_t \sim \boldsymbol{p}$ where $\boldsymbol{p}$ is an unknown probability distribution supported on $[n]$. We want our algorithm to run continuously and return reject if at any point in time, the algorithm discovers $\boldsymbol{p} \neq \texttt{UNIF}[n]$. The goal is to do this in as less time as possible. To rule out trivial algorithms which straight-up reject $\boldsymbol{p}$, we require that the probability that the algorihtm EVER rejects $\texttt{UNIF}[n]$ itself is at most some sufficiently small $\delta > 0$. Denoting by $OPT(\boldsymbol{p})$ the sample complexity of the best possible algorithm for this task which knows the entire distribution profile $\boldsymbol{p}$ in hindsight. The featured paper devises an algorithm for this problem which is $poly(\log (OPT))$-competitive against this best algorithm.

Counting Distinct Square Substrings in Sublinear Time by Panagiotis Charalampopoulos, Manal Mohamed, Jakub Radoszewski, Wojciech Rytter, Tomasz Waleń, and Wiktor Zuba (arXiv) A square in a string is a substring of the form $XX$, where $X$ is a nonempty sequence of characters. The featured paper studies how to count the number of distinct square substrings in a string of length $n$ over an alphabet of size $\sigma$, given in packed form. The authors design a sublinear-time algorithm running in $O(n/\log_\sigma n)$ in the word-RAM model by exploiting the structure of runs (maximal periodic fragments). To handle these runs systematically, the paper uses sparse Lyndon roots — where a Lyndon root is essentially the lexicographically smallest rotation of a repeated block — so that each repetition has a unique representative. The paper further groups certain overlapping runs into pyramidal structures to make counting efficient. This avoids scanning every position of the string and shows that distinct squares can be counted strictly faster than reading the entire string symbol by symbol, giving the first sublinear-time algorithm for this problem.

Resource-efficient algorithm for estimating the trace of quantum state powers by Myeongjin Shin, Junseo Lee, Seungwoo Lee, and Kabgyun Jeong (arXiv) The problem of estimating the trace of quantum state powers refers to computing $\text{Tr}(\rho^k)$, where $\rho$ is a quantum state (density matrix) and $k$ is an integer — this quantity is key for tasks such as evaluating nonlinear functions of quantum states, Rényi entropy, and detecting entanglement. The featured paper presents a resource-efficient algorithm that reduces the quantum hardware requirements by using only $O(\widetilde r)$ qubits and $O(\widetilde r)$ multi-qubit gates, where $\widetilde{r} = \min\Big({\text{rank}(\rho), \lceil \ln(2k / \varepsilon)\rceil}\Big)$. The featured paper builds on tools from mathematical foundations of entaglement spectroscopy (something called the Newton-Girard method): it estimates the traces of low powers $\text{Tr}(\rho^i)$ for $i = 1, \dots, \widetilde r$ using shallow quantum circuits and then uses classical recursion to reconstruct or approximate $\text{Tr}(\rho^k)$ even when $k$ is large — achieving rank-dependent complexity that works well for low-rank states.

Adversarially robust quantum state learning and testing by Maryam Aliakbarpour, Vladimir Braverman, Nai-Hui Chia, and Yuhan Liu (arXiv) The featured paper studies quantum state learning under a strong corruption model called $\gamma$-adversarial corruption, where an adversary can arbitrarily modify a $\gamma$-fraction of measurement outcomes, going beyond the usual SPAM noise assumptions. The authors give a non-adaptive measurement algorithm that learns a rank-$r$ state up to trace-distance error $\widetilde O(\gamma \sqrt{r})$, and also proves a matching lower bound $\Omega(\gamma \sqrt{r})$, showing the guarantee is optimal. As the abstract notes (and piques your curiosity from the get-go), the results are at once optimistic and pessimistic: for constant-rank states, the error bound is dimension-independent, so adversarial corruption can be tolerated without dependence on the Hilbert space dimension; but for general full-rank states, the error necessarily scales as $\gamma \sqrt{d}$, which means even a vanishingly small corruption rate, around $1/\sqrt{d}$, is enough to destroy any non-adaptive learning algorithm.

The paper underscores that dimension-independent error is inevitable in this adversarial setting because single-copy measurements carry very limited information, so any algorithm must amplify small statistical differences to extract the state. This sensitivity leaves the estimation process inherently vulnerable to adversarially introduced errors unless the state has low rank, in which case the amount of information required to recover the state is substantially smaller.

Property Testing Review

The latest in property testing and sublinear time algorithms

Leave a Reply Cancel reply