PhD Research · 2025 — 2028

Systematic effects of photometric redshifts
on the Void Size Function

Using the enormous photometric catalogues of the Vera C. Rubin Observatory to turn cosmic voids into a competitive probe of dark energy — once we understand how fuzzy distances distort their statistics.

Supervisor: Alice Pisani CPPM · CNRS / IN2P3 Aix-Marseille Université LSST collaboration
Scroll
01 / Foundations

Cosmic voids and the Void Size Function

The Universe is not a uniform soup of galaxies. Matter is woven into an intricate web of filaments, sheets and clusters — and the largest pieces of that mosaic are the cosmic voids: vast, underdense bubbles that fill most of the volume of the observable Universe.

A typical cosmic void spans tens of mega-parsecs across and contains only a handful of galaxies. Because they are nearly empty, voids are the cleanest cosmological laboratories we have: their interior dynamics are dominated by dark energy rather than non-linear gravity, and the imprint of modified-gravity theories on their growth is amplified rather than washed out.

The Void Size Function (VSF) — the number density of voids as a function of their radius — encodes how the cosmic web grew under the competing pull of gravity and the push of dark energy. Its shape is sensitive to Ωm, the dark-energy equation of state w, and to modifications of general relativity.
  • Voids cover ≳60 % of the Universe by volume yet host a small fraction of galaxies.
  • Their abundance follows a near-universal excursion-set prediction.
  • Combined with the void–galaxy cross-correlation, they constrain w to a few percent.
Schematic of a cosmic void embedded in the cosmic web
An underdense void carved out of the cosmic web of galaxies and filaments.

Theoretically, the VSF was derived from the same excursion-set machinery as the halo mass function (Sheth & van de Weygaert 2004; Jennings et al. 2013). It predicts a peaked distribution: small voids merge into bigger ones as structure grows, leaving a characteristic falloff at large radii whose position is set by the linear growth of structure — and therefore by cosmology.

To exploit the VSF observationally we need (a) accurate galaxy positions in three dimensions and (b) a faithful void-finding algorithm. Both pieces are challenged when galaxy distances come from photometric, not spectroscopic, surveys — which is exactly where Rubin/LSST will live.

The void size function with photometric distortions
The theoretical VSF (solid cyan) versus a photometrically-distorted measurement (dashed pink) — the gap is what my PhD aims to model and remove.
02 / The Survey

The Vera C. Rubin Observatory & LSST

Perched at 2 680 m on Cerro Pachón in the Chilean Andes, the NSF–DOE Vera C. Rubin Observatory is about to carry out the most ambitious optical sky survey ever attempted: a ten-year, six-band scan of half the sky called the Legacy Survey of Space and Time (LSST).

Star trails over the Vera C. Rubin Observatory at night
The Rubin Observatory dome on Cerro Pachón, with star trails recording the rotation of the southern sky. Credit: NSF / NOIRLab / AURA / Rubin Observatory — CC BY 4.0.

Rubin combines an 8.4-m primary mirror with the largest digital camera ever built (3.2 gigapixels). Every clear night it images the southern sky in two filters, returning to the same patch every few days. After a decade, LSST will have catalogued ~20 billion galaxies and as many stars in six bands (u, g, r, i, z, y).

For cosmology, LSST is a photometric survey: its enormous depth and area come at the cost of low-resolution galaxy spectra. Distances must be inferred from colours rather than from absorption lines.
  • ~37 billion stars and galaxies across 18 000 deg² of sky.
  • 6 bands · 10 years · ~3 000 images per night.
  • Dark Energy Science Collaboration (DESC) — the gateway through which I contribute.

Rubin's depth means that voids will be detected out to redshifts where the universe was less than half its current age, when dark energy was just beginning to dominate. The statistical power is staggering: tens of thousands of voids, an order of magnitude beyond anything we have today. But statistical power is wasted if we cannot control systematics. Photometric distance errors leak directly into the radii and positions of the voids we recover — and the imprint of that leakage on the VSF is exactly what my thesis aims to model.

SOUTHERN SKY * 6 PATCHES * 5 PASSES CERRO PACHON * 2680 m
scroll ↓ to drive the survey

Each visit to a patch adds a layer of galaxies; after three full passes the survey footprint is dense enough to detect tens of thousands of cosmic voids. The Rubin/LSST cadence is, in reality, ${\sim}\,3\,000$ visits per night over ten years — this animation is a stylised three-pass sketch of the build-up.

03 / The Challenge

Photometric redshifts — power and pitfalls

A galaxy's redshift tells us its distance. Spectroscopy measures it to ~0.001 by finding a sharp emission or absorption line, but it is slow and expensive. Photometric redshifts infer the same quantity from broadband colours — fast, scalable, and noisy.

A photo-z algorithm — whether a template-fitting code like BPZ or a machine-learning regressor — guesses redshift from a handful of magnitudes by matching observed colours against spectral templates. For LSST, the precision target is σz/(1+z) ≲ 0.03 for the gold sample, with a small catastrophic-outlier rate.

Why this matters for voids: a void radius is literally a distance. Photo-z noise smears galaxies along the line of sight, blurring void boundaries and erasing small voids. Photo-z bias shifts the radial scale, stretching or compressing the entire VSF.

Catastrophic outliers — galaxies whose photo-z is grossly wrong, often because the 4 000 Å break is misidentified — are particularly insidious: they look like real galaxies in the middle of a void, masking real voids.

Scatter of photo-z vs spec-z with catastrophic outliers
The photo-z error budget: a tight scatter around the y = x line, a small bias, and a population of catastrophic outliers (pink) that mimic foreground or background galaxies.

The advantage of photometric surveys is sheer numbers: Rubin will deliver photo-z for hundreds of times more galaxies than DESI will ever measure spectroscopically. The challenge is to forward-model the photo-z error distribution into the void statistics, propagate it through the void-finder, and recover an unbiased VSF.

  • ~99 % of LSST galaxies will have only photometric redshifts.
  • Photo-z scatter σ ~ 0.03 (1+z) translates to a radial smearing of ~100 Mpc/h at z = 1.
  • Need joint calibration of the n(z) using a small spec-z subset (DESI, 4MOST).
Cosmic web from a large cosmological simulation
A slice of the cosmic web from a large cosmological simulation — the kind of synthetic data I use to test how photo-z errors propagate into the void catalogue. Credit: Illustris Collaboration / ESO — public domain.
04 / Ongoing work

Sub-projects

My PhD splits into four intertwined sub-projects, each tackling one piece of the photo-z → void puzzle. Detailed write-ups live behind a passphrase until publication — drop me an email if you'd like access.

Want to discuss voids, photo-z or LSST?

I'm always happy to chat about cosmic structure, simulations, or anything that lives in the underdense parts of the Universe.