Using the enormous photometric catalogues of the Vera C. Rubin Observatory to turn cosmic voids into a competitive probe of dark energy — once we understand how fuzzy distances distort their statistics.
The Universe is not a uniform soup of galaxies. Matter is woven into an intricate web of filaments, sheets and clusters — and the largest pieces of that mosaic are the cosmic voids: vast, underdense bubbles that fill most of the volume of the observable Universe.
A typical cosmic void spans tens of mega-parsecs across and contains only a handful of galaxies. Because they are nearly empty, voids are the cleanest cosmological laboratories we have: their interior dynamics are dominated by dark energy rather than non-linear gravity, and the imprint of modified-gravity theories on their growth is amplified rather than washed out.
Theoretically, the VSF was derived from the same excursion-set machinery as the halo mass function (Sheth & van de Weygaert 2004; Jennings et al. 2013). It predicts a peaked distribution: small voids merge into bigger ones as structure grows, leaving a characteristic falloff at large radii whose position is set by the linear growth of structure — and therefore by cosmology.
To exploit the VSF observationally we need (a) accurate galaxy positions in three dimensions and (b) a faithful void-finding algorithm. Both pieces are challenged when galaxy distances come from photometric, not spectroscopic, surveys — which is exactly where Rubin/LSST will live.
Perched at 2 680 m on Cerro Pachón in the Chilean Andes, the NSF–DOE Vera C. Rubin Observatory is about to carry out the most ambitious optical sky survey ever attempted: a ten-year, six-band scan of half the sky called the Legacy Survey of Space and Time (LSST).
Rubin combines an 8.4-m primary mirror with the largest digital camera ever built (3.2 gigapixels). Every clear night it images the southern sky in two filters, returning to the same patch every few days. After a decade, LSST will have catalogued ~20 billion galaxies and as many stars in six bands (u, g, r, i, z, y).
Rubin's depth means that voids will be detected out to redshifts where the universe was less than half its current age, when dark energy was just beginning to dominate. The statistical power is staggering: tens of thousands of voids, an order of magnitude beyond anything we have today. But statistical power is wasted if we cannot control systematics. Photometric distance errors leak directly into the radii and positions of the voids we recover — and the imprint of that leakage on the VSF is exactly what my thesis aims to model.
Each visit to a patch adds a layer of galaxies; after three full passes the survey footprint is dense enough to detect tens of thousands of cosmic voids. The Rubin/LSST cadence is, in reality, ${\sim}\,3\,000$ visits per night over ten years — this animation is a stylised three-pass sketch of the build-up.
A galaxy's redshift tells us its distance. Spectroscopy measures it to ~0.001 by finding a sharp emission or absorption line, but it is slow and expensive. Photometric redshifts infer the same quantity from broadband colours — fast, scalable, and noisy.
A photo-z algorithm — whether a template-fitting code like BPZ or a machine-learning regressor — guesses redshift from a handful of magnitudes by matching observed colours against spectral templates. For LSST, the precision target is σz/(1+z) ≲ 0.03 for the gold sample, with a small catastrophic-outlier rate.
Catastrophic outliers — galaxies whose photo-z is grossly wrong, often because the 4 000 Å break is misidentified — are particularly insidious: they look like real galaxies in the middle of a void, masking real voids.
The advantage of photometric surveys is sheer numbers: Rubin will deliver photo-z for hundreds of times more galaxies than DESI will ever measure spectroscopically. The challenge is to forward-model the photo-z error distribution into the void statistics, propagate it through the void-finder, and recover an unbiased VSF.
My PhD splits into four intertwined sub-projects, each tackling one piece of the photo-z → void puzzle. Detailed write-ups live behind a passphrase until publication — drop me an email if you'd like access.
Extending the universal multiplicity function — the analytic backbone of the void abundance — to include the smearing introduced by photometric redshift errors.
B Private draftForecasting cosmological constraints from the VSF measured on LSST-like SkySim mocks across three redshift bins and three cosmologies.
C Private draftBridging voids detected on the projected sky to their three-dimensional counterparts — a key step for photometric surveys.
D Private draftA deep graph-attention model that learns to classify cosmic voids directly from the Voronoi tessellation of a galaxy catalogue.
I'm always happy to chat about cosmic structure, simulations, or anything that lives in the underdense parts of the Universe.