The Splashback Radius of Halos from Particle Dynamics. III. Halo Catalogs, Merger Trees, and Host–Subhalo Relations

Benedikt Diemer

doi:10.3847/1538-4365/abbf51

1. Introduction

In the widely accepted ΛCDM framework, dark matter halos are the building blocks of structure, with galaxies growing at their centers (Rees & Ostriker 1977; Silk 1977; White & Rees 1978). As soon as we develop this basic picture into a more quantitative understanding, we face the need to assign halos some measure of size and mass. For example, the mass function of halos is predicted to follow a relatively simple form (Press & Schechter 1974; Bond et al. 1991), but this statement is naturally predicated on there being a meaningful way to measure a halo's mass. Similarly, we wish to connect the properties of halos to their galaxies via techniques such as abundance matching, occupation distributions, or semianalytical modeling, all of which rely on a definition of the halo boundary (see Wechsler & Tinker 2018 for a review).

The most common solution is the spherical overdensity (SO) definition of halo properties, where the radius ${R}_{{\rm{\Delta }}}$ is defined to enclose a fixed overdensity Δ with respect to either the critical or mean density of the universe (e.g., Lacey & Cole 1994). This definition leads to variants such as ${R}_{500{\rm{c}}}$ , ${R}_{200{\rm{c}}}$ , ${R}_{\mathrm{vir}}$ , and ${R}_{200{\rm{m}}}$ and the corresponding masses ${M}_{{\rm{\Delta }}}$ (in descending order of overdensity; see Table 1 for the detailed definitions and Figure 1 for a visualization). The main issue with SO radii is how to choose the density threshold. One popular definition is the so-called virial overdensity, ${{\rm{\Delta }}}_{\mathrm{vir}}=18{\pi }^{2}\approx 178$ , derived from the collapse of an isolated top-hat overdensity in an Einstein–de Sitter universe (Gunn & Gott 1972; Peebles 1980; Lacey & Cole 1993). In a ΛCDM universe, the overdensity evolves with time (Lahav et al. 1991; Bryan & Norman 1998). However, the assumptions of the top-hat collapse model are not well justified, chiefly because halos do not form in isolation and because the initial peaks are not top-hat in shape (e.g., Dalal et al. 2010). Nevertheless, SO definitions are simple to compute and easy to understand, and various density thresholds are adopted for mostly practical reasons. For example, ${R}_{200{\rm{c}}}$ is often used for historical reasons, whereas the X-ray cluster community prefers small radii such as ${R}_{500{\rm{c}}}$ or even R_2500c because the X-ray signal is measurable only within a small aperture.

**Figure 1.** Visualization of radius definitions for two representative halos with low and high accretion rates (left and right panels, respectively). Both halos have masses of about $1.5\times {10}^{14}\,{h}^{-1}\,{M}_{\odot }$ and were selected from the L0125-WMAP7 simulation at z = 0. The logarithmic color map shows projected density through slabs of thickness $0.3{R}_{200{\rm{m}}}$ . Each line shows one of the radius definitions discussed in this paper. The red circles indicate, from the center outward, ${R}_{500{\rm{c}}}$ , ${R}_{200{\rm{c}}}$ , ${R}_{\mathrm{vir}}$ , and ${R}_{200{\rm{m}}}$ . The yellow circles show different estimates of the splashback radius, namely ${R}_{\mathrm{sp},\mathrm{mn}}$ , ${R}_{\mathrm{sp},75 \% }$ , and ${R}_{\mathrm{sp},90 \% }$ . In the left panel, all splashback radii are larger than ${R}_{200{\rm{m}}}$ owing to the relatively low accretion rate of the halo, whereas ${R}_{200{\rm{m}}}\approx {R}_{\mathrm{sp},75 \% }$ for the fast-accreting halo in the right panel. The dotted–dashed yellow lines show the More et al. (2015) prediction for the radius where the slope of the density profile is steepest, which corresponds to a visible drop in the projected density. The relationship of this radius to the dynamically calculated radii from Sparta also depends on the accretion rate: in the left panel, this radius is greater than ${R}_{\mathrm{sp},75 \% }$ ; in the right panel, it is smaller. The white dashed lines show slices through the three-dimensional splashback shells computed by Shellfish (Mansfield et al. 2017). For the slowly accreting halo on the left, the shell is almost spherical and coincides with ${R}_{\mathrm{sp},90 \% }$ . The fast-accreting halo on the right is less spherical, and the Shellfish shell visibly traces the density drop toward the lower left of the halo. The background visualizations were created using the gotetra code by P. Mansfield (https://github.com/phil-mansfield/gotetra), which uses a tetrahedron-based estimate of the density field (Abel et al. 2012; Kaehler et al. 2012; Hahn et al. 2013). The same halos are shown in Figure 1 of More et al. (2015), although with a thinner projection depth and a different color scheme.
Download figure:
Standard image High-resolution image

**Figure 1.** Visualization of radius definitions for two representative halos with low and high accretion rates (left and right panels, respectively). Both halos have masses of about $1.5\times {10}^{14}\,{h}^{-1}\,{M}_{\odot }$ and were selected from the L0125-WMAP7 simulation at z = 0. The logarithmic color map shows projected density through slabs of thickness $0.3{R}_{200{\rm{m}}}$ . Each line shows one of the radius definitions discussed in this paper. The red circles indicate, from the center outward, ${R}_{500{\rm{c}}}$ , ${R}_{200{\rm{c}}}$ , ${R}_{\mathrm{vir}}$ , and ${R}_{200{\rm{m}}}$ . The yellow circles show different estimates of the splashback radius, namely ${R}_{\mathrm{sp},\mathrm{mn}}$ , ${R}_{\mathrm{sp},75 \% }$ , and ${R}_{\mathrm{sp},90 \% }$ . In the left panel, all splashback radii are larger than ${R}_{200{\rm{m}}}$ owing to the relatively low accretion rate of the halo, whereas ${R}_{200{\rm{m}}}\approx {R}_{\mathrm{sp},75 \% }$ for the fast-accreting halo in the right panel. The dotted–dashed yellow lines show the More et al. (2015) prediction for the radius where the slope of the density profile is steepest, which corresponds to a visible drop in the projected density. The relationship of this radius to the dynamically calculated radii from Sparta also depends on the accretion rate: in the left panel, this radius is greater than ${R}_{\mathrm{sp},75 \% }$ ; in the right panel, it is smaller. The white dashed lines show slices through the three-dimensional splashback shells computed by Shellfish (Mansfield et al. 2017). For the slowly accreting halo on the left, the shell is almost spherical and coincides with ${R}_{\mathrm{sp},90 \% }$ . The fast-accreting halo on the right is less spherical, and the Shellfish shell visibly traces the density drop toward the lower left of the halo. The background visualizations were created using the gotetra code by P. Mansfield (https://github.com/phil-mansfield/gotetra), which uses a tetrahedron-based estimate of the density field (Abel et al. 2012; Kaehler et al. 2012; Hahn et al. 2013). The same halos are shown in Figure 1 of More et al. (2015), although with a thinner projection depth and a different color scheme.
Download figure:
Standard image High-resolution image

Table 1. Definitions of the Symbols Used in This Paper

Symbol	Meaning
${\rho }_{{\rm{m}}}$	Mean matter density of the universe
${\rho }_{{\rm{c}}}$	Critical density of the universe
Δ	An overdensity with respect to either ${\rho }_{{\rm{m}}}$ or ${\rho }_{{\rm{c}}}$
${R}_{{\rm{X}}}$	A particular definition X of the halo boundary
${M}_{{\rm{X}}}$	Mass inside ${R}_{{\rm{X}}}$
${N}_{{\rm{X}}}$	Number of particles inside ${R}_{{\rm{X}}}$
${R}_{200{\rm{m}}}$	Radius enclosing an overdensity of $200\times {\rho }_{{\rm{m}}}$
${R}_{200{\rm{c}}}$	The radius enclosing an overdensity of $200\times {\rho }_{{\rm{c}}}$
${R}_{\mathrm{vir}}$	${R}_{{\rm{\Delta }}}$ with varying overdensity (Bryan & Norman 1998)
${M}_{{\rm{X}},\mathrm{peak}}$	Peak mass attained during halo history
${M}_{{\rm{X}},\mathrm{acc}}$	Mass of subhalo at accretion onto host
${M}_{{\rm{\Delta }},\mathrm{all}}$	SO mass computed from all particles
${M}_{{\rm{\Delta }},\mathrm{bnd}}$	SO mass computed from bound particles only
${M}_{{\rm{\Delta }},\mathrm{tcr}}$	SO mass computed from only tracer particles in subhalos
${V}_{{\rm{\Delta }}}$	Circular velocity, ${V}_{{\rm{\Delta }}}\equiv \sqrt{{{GM}}_{{\rm{\Delta }}}/{R}_{{\rm{\Delta }}}}$
${V}_{\max }$	Maximum of circular velocity at any radius
${V}_{\mathrm{peak}}$	Maximum ${V}_{\max }$ attained during halo history
ν	Peak height, $\nu \equiv {\nu }_{200{\rm{m}}}={\delta }_{{\rm{c}}}/\sigma ({M}_{200{\rm{m}}},z)$
${R}_{\mathrm{sp}}$	Splashback radius of a halo
${M}_{\mathrm{sp}}$	Splashback mass of a halo (estimated separately from ${R}_{\mathrm{sp}}$ )
${{\rm{\Delta }}}_{\mathrm{sp}}$	Splashback overdensity with respect to ${\rho }_{{\rm{m}}}$ , ${{\rm{\Delta }}}_{\mathrm{sp}}\equiv 3{M}_{\mathrm{sp}}/(4\pi {R}_{\mathrm{sp}}^{3})/{\rho }_{{\rm{m}}}$
${R}_{\mathrm{sp},\mathrm{mn}}$	${R}_{\mathrm{sp}}$ defined as the mean of the particle apocenters
${R}_{\mathrm{sp},50 \% }$	${R}_{\mathrm{sp}}$ defined as the median of the particle apocenters
${R}_{\mathrm{sp},75 \% }$	${R}_{\mathrm{sp}}$ defined as the 75th percentile of the particle apocenters
${t}_{\mathrm{dyn}}$	Dynamical time or crossing time, ${t}_{\mathrm{dyn}}\equiv 2{R}_{200{\rm{m}}}/{V}_{200{\rm{m}}}$
${{\rm{\Gamma }}}_{\mathrm{dyn}}$	Logarithmic mass accretion rate over ${t}_{\mathrm{dyn}}$ , ${\rm{\Delta }}\mathrm{log}(M)/{\rm{\Delta }}\mathrm{log}(a)$

Download table as: ASCII Typeset image

Given the variety of SO definitions, one should always compare halo-related simulation results using multiple definitions. In practice, however, such checks are often unrealistic because halo catalogs are available in only one definition. In principle, it is possible to convert between definitions with a fitting function or by assuming a particular form of the density profile (White 2001, 2002; Hu & Kravtsov 2003; Lukić et al. 2009; Diemer 2018), but these conversions can be fairly inaccurate owing to deviations from the assumed profile or scatter in the concentration–mass relation (e.g., Appendix C of Diemer & Kravtsov 2015). Thus, it is often best to choose one definition and understand the consequences of that choice.

The situation is even more complicated for subhalos, whose center lies within another, larger halo. There, we cannot always define SO masses because the density profile includes a large, possibly dominant contribution from the host. A common solution is to remove gravitationally unbound particles (e.g., Springel et al. 2001; Han et al. 2012; Behroozi et al. 2013a), but the results depend on the exact algorithm. Moreover, subhalos tend to undergo unphysical stripping due to numerical effects—the classic overmerging problem that still plagues modern simulations (e.g., van Kampen 1995; Moore et al. 1996; Klypin et al. 1999; van den Bosch 2017). Both issues can be avoided by defining subhalo masses at their peak or at infall, neglecting the subsequent mass evolution. The complex density structure around subhalos can also be circumvented by using ${V}_{\max }$ , the maximum of the circular velocity curve. However, this definition measures the mass within a rather small radius and is still prone to spikes during mergers (Behroozi et al. 2014). The chosen definition has a sizable impact on the galaxy–halo connection (Reddick et al. 2013), highlighting that there is no one correct choice.

In addition to these practical considerations, we may question whether SO definitions are the physically correct choice for the halo boundary. Besides their somewhat indecisive theoretical foundation, SO definitions lead to a sizable number of "backsplash" or "ejected" halos that reside outside ${R}_{\mathrm{vir}}$ but eventually merge (e.g., Balogh et al. 2000; Mamon et al. 2004; Gill et al. 2005). The sphere of influence of halos also extends beyond ${R}_{\mathrm{vir}}$ in that infalling subhalos begin to lose mass at around two "virial" radii (Behroozi et al. 2014). Finally, SO radii and masses suffer from pseudo-evolution, arguably unphysical changes due to the evolution of the threshold density with the critical or mean density of the universe (Diemand et al. 2005; Cuesta et al. 2008; Diemer et al. 2013b; Zemp 2014; More et al. 2015). All of these problems indicate that SO radii do not typically encompass the entire halo.

To remedy these issues, the splashback radius, ${R}_{\mathrm{sp}}$ , has been suggested as a more physically motivated alternative (Diemer & Kravtsov 2014; Adhikari et al. 2014; More et al. 2015). This radius corresponds to the apocenter of the first orbits of particles and subhalos and is inspired by the spherical collapse model, where it represents the boundary between infalling and orbiting material (Fillmore & Goldreich 1984; Bertschinger 1985; Lithwick & Dalal 2011; Vogelsberger et al. 2011; Adhikari et al. 2014; Shi 2016). Even though the boundary is not as uniquely defined in realistic ΛCDM halos (e.g., Aung et al. 2020), the sharp drop due to particles piling up at their apocenter has been detected in simulations (Diemer & Kravtsov 2014; More et al. 2015; Fong et al. 2018; Okumura et al. 2018; Xhakaj et al. 2019a; Banerjee et al. 2020; Mansfield & Kravtsov 2020) and observations (More et al. 2016; Baxter et al. 2017; Chang et al. 2018; Nishizawa et al. 2018; Shin et al. 2019; Zürcher & More 2019; Murata et al. 2020, see also Tully 2015; Patej & Loeb 2016; Zu et al. 2016; Busch & White 2017; Umetsu & Diemer 2017; Contigiani et al. 2019; Tomooka et al. 2020).

These detections have provided powerful motivation to further explore the splashback radius in simulations. While the initial investigations and observational detections have relied on the steepening feature in stacked density profiles, the profiles of individual halos are often too noisy and influenced by substructure to detect it. Two other techniques to measure ${R}_{\mathrm{sp}}$ have been suggested thus far. First, the Shellfish code of Mansfield et al. (2017) considers the full 3D density field around halos and finds the density drop in infinitesimally thin lines of sight, combining those measurements into a nonspherical splashback shell. This method relies only on density data at the current time and resolves the full shape of the splashback boundary, but it works reliably only for very well resolved halos. Second, the Sparta code introduced in Diemer (2017, hereafter Paper I) tracks the trajectories of dark matter particles to find their first apocenter and defines the halo's splashback radius as an average of those apocenters. First results from this technique were shown in Diemer et al. (2017, hereafter Paper II). While the initial results from Shellfish and Sparta have given us a sense of the relation between ${R}_{\mathrm{sp}}$ , SO radii, and mass accretion rates, they did not allow for a systematic exploration of the impact of mass definitions on structure formation in general. For such investigations, we need halo catalogs, that is, lists of halos at each snapshot including information about the relation of host and subhalos. The latter is critical: without knowing the subhalo status, we cannot separate host halos (e.g., to compute the mass function or assembly bias) or assign galaxies to (sub)halos.

In the present work, we fill this gap by providing the first halo catalogs and merger trees that contain a large range of SO and splashback definitions, as well as the corresponding host–subhalo relations. We have slightly improved the splashback calculations of Paper I and have added a number of SO-related calculations to Sparta. We present a new tool called Moria that combines results from Sparta with the original halo catalogs to create enhanced catalogs and merger trees (see Figure 2 for a schematic of the workflow). Our catalogs are publicly available in multiple formats and will enable the community to easily switch between mass definitions. In this paper, we present the new algorithms and some numerical tests. In a number of follow-up papers, we will analyze the impact of the halo boundary definition on subhalo occupation and mass functions.

**Figure 2.** Flow chart for the Sparta and Moria codes. Red fields represent codes; blue fields represent data products. From the codes' perspective, blue arrows signify inputs, whereas red arrows signify outputs. Particle data, the original output from the simulation, are fed into a halo finder to create catalogs. Sparta needs the halos to be connected between snapshots, that is, it needs progenitor/descendant information. Based on the information in the snapshot and catalog files, Sparta creates an hdf5 output file. Moria combines the information in that file with the original halo catalogs to create enhanced catalogs and merger trees. The simulation data (the left-hand side of the schematic) are discussed in Section 2, the Sparta and Moria codes in Sections 3 and 4, respectively.
Download figure:
Standard image High-resolution image

The paper is structured as follows. We begin by describing the N-body simulations and halo catalogs on which we base our enhanced catalogs in Section 2. We describe the new algorithms in the Sparta and Moria codes in Sections 3 and 4, respectively. We test our data products and present some basic results in Section 5. We discuss the strengths, weaknesses, and potential applications of our catalogs in Section 6, before concluding in Section 7.

Throughout the paper, we follow the notation established in Papers I and II with minor changes (Table 1). The variance of the power spectrum, σ, is computed based on the transfer function approximation of Eisenstein & Hu (1998). The dynamical time is calculated as the time to cross ${R}_{200{\rm{m}}}$ at velocity V_200m, which is independent of halo mass and approaches 1/5 of the Hubble time at high redshift (Paper I). The accretion rate, ${{\rm{\Gamma }}}_{\mathrm{dyn}}$ , is then defined as the logarithmic mass growth over this time interval. All cosmological calculations are performed using the Colossus code (Diemer 2018). The Sparta code and its documentation are publicly available at benediktdiemer.com/code, as are our catalogs and merger trees at benediktdiemer.com/data.

2. Simulation Data

Our catalogs are based on the Erebos suite of N-body simulations, essentially the same as in Papers I and II (Table 2). This collection contains two ΛCDM cosmologies, scale-free universes, and a lower-resolution simulation for testing. The first ΛCDM cosmology is that of the Bolshoi simulation (Klypin et al. 2011), which is consistent with WMAP7 (Komatsu et al. 2011, Ω_m = 0.27, Ω_b = 0.0469, h = 0.7, σ₈ = 0.82, and n_s = 0.95). For this cosmology, we have run seven boxes with side lengths decreasing by factors of two from 2000 down to $31.25\,{h}^{-1}\,\mathrm{Mpc}$ and corresponding particle masses that span more than five orders of magnitude (Table 2). The second cosmology is similar to the Planck Collaboration et al. (2014) cosmology (Ω_m = 0.32, Ω_b = 0.0491, h = 0.67, σ₈ = 0.834, and n_s = 0.9624). For this cosmology, we use three boxes of 500, 250, and $125\,{h}^{-1}\,\mathrm{Mpc}$ . We will denote these cosmologies as WMAP7 and Planck wherever we refer to the respective simulations collectively. The self-similar simulations represent Einstein–de Sitter universes with power-law initial power spectra of slopes −1, −1.5, −2, and −2.5. The parameters of such simulations can be expressed without explicit reference to length or timescales (e.g., Knollmann et al. 2008), but we adjusted them to ΛCDM-like parameters of $L=100\,{h}^{-1}\,\mathrm{Mpc}$ , h = 0.7, and σ₈ = 0.82 for historical reasons. Finally, we use three versions of the same, smaller test simulation, where we have removed none, half, and three-quarters of the snapshots in order to test the convergence of our algorithms with snapshot spacing.

Table 2. N-body Simulations

Name	$L\,({h}^{-1}\,{\rm{Mpc}})$	N³	${m}_{{\rm{p}}}\,({h}^{-1}\,{M}_{\odot })$		$\epsilon /(L/N)$	${z}_{\mathrm{initial}}$	${z}_{\mathrm{final}}$	${N}_{\mathrm{snaps}}$	${z}_{{\rm{f}}-\mathrm{snap}}$	${z}_{{\rm{f}}-\mathrm{cat}}$	Cosmology	Reference
L2000-WMAP7	2000	1024³	$5.6\times {10}^{11}$	65	1/30	49	0	100	20	4.2	WMAP7	DK15
L1000-WMAP7	1000	1024³	$7.0\times {10}^{10}$	33	1/30	49	0	100	20	6.2	WMAP7	DKM13
L0500-WMAP7	500	1024³	$8.7\times {10}^{9}$	14	1/35	49	0	100	20	8.8	WMAP7	DK14
L0250-WMAP7	250	1024³	$1.1\times {10}^{9}$	5.8	1/42	49	0	100	20	11.5	WMAP7	DK14
L0125-WMAP7	125	1024³	$1.4\times {10}^{8}$	2.4	1/51	49	0	100	20	14.5	WMAP7	DK14
L0063-WMAP7	62.5	1024³	$1.7\times {10}^{7}$	1.0	1/60	49	0	100	20	17.6	WMAP7	DK14
L0031-WMAP7	31.25	1024³	$2.1\times {10}^{6}$	0.25	1/122	49	2	64	20	20	WMAP7	DK15
L0500-Planck	500	1024³	$1.0\times {10}^{10}$	14	1/35	49	0	100	20	9.1	Planck	DK15
L0250-Planck	250	1024³	$1.3\times {10}^{9}$	5.8	1/42	49	0	100	20	12.3	Planck	DK15
L0125-Planck	125	1024³	$1.6\times {10}^{8}$	2.4	1/51	49	0	100	20	15.5	Planck	DK15
L0100-PL-1.0	100	1024³	$2.6\times {10}^{8}$	0.5	1/195	119	2	64	20	20	PL, n = −1.0	DK15
L0100-PL-1.5	100	1024³	$2.6\times {10}^{8}$	0.5	1/195	99	1	78	20	20	PL, n = −1.5	DK15
L0100-PL-2.0	100	1024³	$2.6\times {10}^{8}$	1.0	1/98	49	0.5	100	20	15.5	PL, n = −2.0	DK15
L0100-PL-2.5	100	1024³	$2.6\times {10}^{8}$	1.0	1/98	49	0	100	20	5.4	PL, n = −2.5	DK15
TestSim200	62.5	256³	$1.1\times {10}^{9}$	5.8	1/42	49	−0.1	193	9	9	WMAP7	Paper I
TestSim100	62.5	256³	$1.1\times {10}^{9}$	5.8	1/42	49	−0.1	96	9	9	WMAP7	Paper I
TestSim50	62.5	256³	$1.1\times {10}^{9}$	5.8	1/42	49	−0.1	48	9	9	WMAP7	Paper I

Note. The Erebos suite of N-body simulations. L denotes the box size in comoving units, N³ the number of particles, ${m}_{{\rm{p}}}$ the particle mass, and the force softening length in physical ${h}^{-1}\,{\rm{kpc}}$ . The redshift range of each simulation is determined by the first and last redshifts ${z}_{\mathrm{initial}}$ and ${z}_{\mathrm{final}}$ , but snapshots were output only between ${z}_{{\rm{f}}-\mathrm{snap}}$ and ${z}_{\mathrm{final}}$ . The earliest snapshots of some simulations do not yet contain any halos, and the first catalog with halos is output at ${z}_{{\rm{f}}-\mathrm{cat}};$ the Sparta and Moria data also begin at that redshift. The cosmological parameters are given in Section 2; "PL" indicates self-similar cosmologies with a power-law initial spectrum with slope n. The references correspond to Diemer et al. (2013a, DKM13), Diemer & Kravtsov (2014, DK14), and Diemer & Kravtsov (2015, DK15). Our system for choosing force resolutions is discussed in DK14.

Download table as: ASCII Typeset image

The initial power spectra for the ΛCDM simulations were generated using Camb (Lewis et al. 2000), and the initial conditions for all simulations were created using the 2LPTic code (Crocce et al. 2006). The simulations were run with Gadget2 (Springel 2005). We use Rockstar (Behroozi et al. 2013a, version 0.99.9-RC3+) and Consistent-Trees (Behroozi et al. 2013b, version 1.01) to construct halo catalogs and merger trees. Rockstar finds the particles in friends-of-friends groups in six-dimensional phase space. As our new Moria catalogs are directly built on top of the Rockstar catalogs, we will discuss details of the Rockstar algorithm throughout the paper. Our catalogs use ${R}_{200{\rm{m}}}$ as the main mass definition, where we use only gravitationally bound particles (although we will discuss other definitions at length).

Our ΛCDM cosmologies bracket the currently favored range of those cosmological parameters that have a large impact on dark matter structure formation, most notably Ω_m. Power-law cosmologies are less commonly studied, but their self-similarity offers a unique opportunity to study halo structure in a simplified manner with only one free parameter (the power spectrum slope). Any meaningful result must be independent of redshift, allowing us to establish resolution limits by comparing different redshifts (Joyce et al. 2020). We further discuss the strengths and weaknesses of the Erebos simulation suite in Section 6.

3. New Algorithms in SPARTA

In this section, we briefly review the splashback algorithm presented in Papers I and II, referring the reader to those works for details. We describe a number of improvements and additions to Sparta, namely, an updated splashback algorithm (Section 3.1), SO calculations with and without gravitational unbinding (Sections 3.2 and 3.3), and the concept of tracer masses (Section 3.4).

3.1. Computing the Splashback Radius

The original purpose of the Sparta code was to calculate the splashback radius from the dynamics of individual particles. Sparta tracks particles as they approach a halo and records the time and location of their infall and their first apocenter (or "splashback event"). We tag particles at infall if they are deemed to belong to a subhalo. Subhalos whose mass is greater than 1/100 of the host mass may suffer from significant dynamical friction and are thus biased in their splashback radii. After removing particles from such subhalos from consideration, we smooth the distribution of the remaining particle splashbacks in time and define the halo's splashback radius as their mean, median, or higher percentiles (Figure 1). The same procedure is repeated for the mass enclosed within the radii of the particle splashbacks (meaning that the splashback mass can slightly deviate from the mass within the splashback radius).

Since the publication of Paper II, we have made a number of improvements to the Sparta algorithm. First, we have fixed a bug due to which the Hubble drag term was underestimated by a factor of h (Croton 2013), leading to earlier splashback times and splashback radii that were a few percent too large (because halos grow over time). Furthermore, the Paper I version of Sparta could not compute ${R}_{\mathrm{sp}}$ for up to 5% of halos depending on the mass range. We have increased the completeness (the fraction of halos with valid splashback measurement) of the ${R}_{\mathrm{sp}}$ catalogs in a number of ways. First, the previous version of Sparta stopped tracking all particles as soon as a halo became a subhalo. The new version keeps tracking particles if the subhalo epoch lasts for only one snapshot because such fast flyby events do not necessarily mean that the particle trajectories are interrupted. Similarly, we do not categorically prohibit computing ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ for subhalos any more, as long as there are sufficient particle splashback events near the time in question (Paper I). We do, however, still abort all particle trajectories when a halo becomes a subhalo for more than one snapshot. Subsequently, the subhalo will quickly run out of past splashback events, and a determination of ${R}_{\mathrm{sp}}$ will not be possible any longer. This behavior is physically sensible because the splashback radius is ill-defined for subhalos. We have also updated our algorithm to determine which particles belong to a subhalo at infall (Section 3.4). Altogether, the algorithmic changes shift the average ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ by a few percent. To incorporate these differences, we recalibrate the model of Paper II in Section 5.6.

3.2. SO Masses (All Particles)

SO radii and masses are routinely calculated by halo finders such as Rockstar, but a number of subtleties can make it difficult to obtain exactly the desired definition. First, many halo finders offer only a limited number of definitions or particularly common ones such as ${R}_{200{\rm{c}}}$ or ${R}_{\mathrm{vir}}$ . Second, we can include all particles (strict SO) or only gravitationally bound particles. The resulting masses tend to be close for host halos, but there are some exceptions. For subhalos, strict SO masses are often ill-defined, as they can include the entire host halo in some cases. To facilitate a thorough exploration of mass definitions, Sparta allows the user to compute any number of SO radii and masses with arbitrary density thresholds.

Sparta computes SO masses by considering a sorted list of particle radii within a user-defined factor times ${R}_{200{\rm{m}}}$ . This factor generally needs to be somewhat larger than unity because the common ${{\rm{\Delta }}}_{\mathrm{vir}}$ definition tends to overdensities of about 178 at high redshift, rendering ${R}_{\mathrm{vir}}$ larger than ${R}_{200{\rm{m}}}$ . For subhalos, we start at $4{R}_{200{\rm{m}},\mathrm{tcr}}$ , where the halo radius is computed from tracked particles only (Section 3.4 and B. Diemer & P. S. Behroozi 2020, in preparation). Given a list of particles inside the search radius, sorted by radius, we start from the outermost particle and move inward until a density threshold is achieved for the first time. In some rare cases, the threshold can be crossed multiple times, and we wish to obtain the outermost occurrence. We set ${M}_{{\rm{\Delta }}}=i\times {m}_{{\rm{p}}}$ , where i is the index of the particle where the threshold was crossed. We note that the internally used ${R}_{200{\rm{m}}}$ definition in Sparta interpolates the density between particles, but this difference is negligible in all but the smallest halos.

The calculation of an SO radius can fail for two reasons. First, the density may not reach the required threshold at any radius because even the very core of the halo is not dense enough. This issue occurs mostly for high-density definitions such as ${R}_{500{\rm{c}}}$ and can arise even in hosts halos. Second, the density may never fall below the threshold within the available particle distribution or within any reasonable radius. This issue cannot occur for hosts in Sparta because halos are defined via their ${R}_{200{\rm{m}}}$ internally. For subhalos, however, the entire host halo may be included if the subhalo is close enough to its center. This issue predominantly affects low-density definitions such as ${R}_{\mathrm{vir}}$ and ${R}_{200{\rm{m}}}$ . If the calculation fails, we record an error code in the catalogs. We investigate the completeness of our SO measurements in Section 5.2.

3.3. SO Masses (Bound Particles)

In addition to strict SO masses, we also wish to include bound-only masses in our catalogs, particularly for subhalos. Our base catalogs from Rockstar already contain bound-only masses that are computed as follows. Rockstar considers the particles in a friends-of-friends (FOF) group (or subgroup for subhalos) and unbinds particles if their kinetic energy exceeds their potential energy. The process is not repeated iteratively. If a halo contains more than some fraction (half, in our case) of unbound particles, it is abandoned as a transient feature and not included in the catalog.

While this unbinding procedure is sensible in theory and gives excellent results in practice, all unbinding techniques turn out to be highly subjective and code dependent because they strongly depend on the initial particle distribution considered. For example, if we wanted to be conservative and be sure to include all bound halo particles, we could start from all particles inside $2{R}_{200{\rm{m}}}$ —but with this much mass in the overall distribution, virtually all particles within ${R}_{200{\rm{m}}}$ would be bound. Conversely, if we considered only particles within ${R}_{500{\rm{c}}}$ , very large fractions would be unbound. Since no SO definition is special, there is no "correct" distribution of initial particles. FOF-based halo finders such as Rockstar or SubFind use FOF groups as the initial particle set, which is sensible but depends on the linking length and the algorithm for discerning subclumps (e.g., Springel et al. 2001; Han et al. 2012; Behroozi et al. 2013a). This issue is exacerbated in subhalos, where the position of the subhalo within the host will influence its overall density and thus how many particles would be considered bound. After much experimentation, we provide a fairly general unbinding algorithm in Sparta. We check the inequality

$\begin{eqnarray}&&-{{\rm{\Phi }}}_{{\rm{i}}}=G\displaystyle \sum _{j\ne i}^{{N}_{\mathrm{ptl}}}\displaystyle \frac{1}{{r}_{\mathrm{ij}}+\epsilon }\geqslant {f}_{\mathrm{bnd}}\displaystyle \frac{{v}_{{\rm{i}}}^{2}}{2},\end{eqnarray} \tag{ 1 }$

where ${N}_{\mathrm{ptl}}$ is the number of particles to consider, f_bnd is a threshold for the binding-to-kinetic energy ratio, and is the force softening scale of the simulation. To compute the sum over many particles efficiently, we have adapted the tree algorithm of Rockstar for Sparta (see B. Diemer & P. S. Behroozi 2020, in preparation, for details). The user can set the initial radius for particles, the boundness factor f_bnd in Equation (1), and whether the procedure is iterated until it converges. We set the initial radius for host halos to ${R}_{200{\rm{m}}}$ and for subhalos to the tracer-only radius (Section 3.4), which eliminates the issue of unphysical all-particle radii in subhalos; however, it does not eliminate the issue of host material contributing to the potential.

We compare our bound-only masses to those from Rockstar in Section 5.4. Broadly speaking, we find that our results agree reasonably well for the majority of subhalos, although with strong outliers where the host contributes a significant amount of material. These tests demonstrate that the potential calculation in Sparta works as expected, but, given the issue of host contamination, it is not clear how useful the bound-only masses computed by Sparta would be. We have thus left them out of the catalogs and instead included the Rockstar bound-only masses for four common overdensity thresholds (Section 5.1).

3.4. Ghosts and Tracer Masses

The algorithms described in the previous sections apply mostly to host halos, where we can meaningfully compute splashback and SO radii and masses without appealing to FOF algorithms or other ways to decide on the membership of particles. We have, however, also introduced entirely new algorithms for dealing with subhalos in Sparta. These methods will be described in detail in B. Diemer & P. S. Behroozi (2020, in preparation), but we briefly review them here because the results are included in our catalogs.

Unlike the Sparta version of Paper I, the code now tracks all particles in subhalos. When the halo first becomes a subhalo (when it crosses ${R}_{200{\rm{m}}}$ ), we identify the particles that truly belong to the subhalo using an updated algorithm. Specifically, we apply user-defined thresholds to exclude particles that first joined the subhalo close to the host (as they were probably swept up during infall) and that are not strongly gravitationally bound. Thereafter, we assume that subhalos only ever lose particles and never accrete new ones. When particles stray more than $2{R}_{200{\rm{m}},\mathrm{tcr}}$ from the subhalo, they are removed. The "tracer masses," including ${R}_{200{\rm{m}},\mathrm{tcr}}$ , are computed like normal SO masses but only including the remaining distribution of tracked particles. This definition gives similar but distinct results to Rockstar's unbinding algorithm (B. Diemer & P. S. Behroozi 2020, in preparation).

We also use the tracking algorithm to extend the lives of subhalos after they are lost by the halo finder. When a subhalo disappears from the input catalogs, we keep following its particles, at which point we term the halo a "ghost." We track the ghost's particles until there are only 10 particles left or until it has sunk to the center of the host halo. We compute the position and velocity of the ghost based on its most bound particles. Conceptually, ghosts are similar to "orphans" and "cores," where subhalos are represented as a single particle or a number of the most bound particles (e.g., Wang et al. 2006; Guo et al. 2010; Heitmann et al. 2019, 2020). Tracking all particles, however, allows us to compute physically meaningful tracer masses for ghosts. In B. Diemer & P. S. Behroozi (2020, in preparation), we show that tracking ghosts significantly increases the completeness of our merger trees.

4. The MORIA Extension

Sparta creates an output file with information about halos throughout a simulation's range of cosmic time, including their splashback radii and masses. This output file, however, does not constitute a halo catalog because it is structured as a series of halo histories, because it does not duplicate the majority of the halo finder data, and because Sparta's splashback algorithm is complete to only about 95% (Paper I). Moreover, a halo catalog should contain information about the relationships between halos, namely, their host–subhalo relationships.

In this section, we introduce a new extension to Sparta called Moria,² which creates halo catalogs and merger trees that contain both the halo finder's original output and Sparta's results. We describe the general code design (Section 4.1), algorithms to increase the completeness of the splashback data (Section 4.2), calculations of the mass accretion rate (Section 4.3), the computation of host–subhalo assignments (Section 4.4), our new merger tree format (Section 4.5), and Moria's system for setting resolution limits (Section 4.6). The procedure for running Sparta and Moria on a simulation is summarized in Figure 2; we refer the reader to the Sparta code documentation for detailed instructions.

4.1. General Code Design

Moria works in postprocessing, meaning that it can be run multiple times on the same Sparta output (Figure 2). The main reason for this choice is that the CPU time consumed by Moria is orders of magnitude smaller than that of Sparta because the former processes only halo data but not particles. For example, it would be highly inconvenient to rerun Sparta only to create a new halo catalog with a few extra fields from the original catalogs. Nevertheless, Moria performs I/O operations on large catalog and hdf5 files and uses tree searches to find host–subhalo relations, making it too computationally demanding for scripting languages. Thus, Moria is, like Sparta, written in pure C, but it runs on a single process.

The inputs to Moria are a Sparta output file and the original catalogs used by Sparta to create that file. The output format is flexible: the user can choose to write catalogs in Moria's native hdf5 format, the original catalog format (e.g., ASCII files for Rockstar), or a merger tree format that is essentially the same hdf5 format as the catalogs. The content of those files is entirely up to the user, who can choose any number of SO and splashback mass definitions that are present in the Sparta output file, as well as any number of fields from the original catalog. Mass definitions that exist in both the halo finder and Sparta outputs will be named accordingly, allowing for comparisons between different algorithms. Finally, the user can choose to compute an arbitrary number of host–subhalo relations based on any of the available radius definitions (Section 4.4). Virtually all parameters mentioned in the following sections can be changed by the user at run-time.

4.2. Increasing the Completeness of Splashback Data

In Paper I, we showed that the Sparta algorithm for determining ${R}_{\mathrm{sp}}$ is at least about 95% complete for host halos with ${N}_{200{\rm{m}}}\geqslant 1000$ (and often better depending on halo mass and redshift). This completeness is somewhat unsatisfactory for a number of reasons. First, we would like to push below 1000 particles to include the large fraction of halos that are smaller than this limit. Second, even 5% of halos without splashback values can lead to inconvenient systematics, e.g., in mass functions. Third, many of the interrupted splashback histories occur because the halo temporarily becomes a subhalo, meaning that it enters within ${R}_{200{\rm{m}},\mathrm{bnd}}$ of a larger halo. When we compute host–subhalo relations for the splashback definitions, some of those subhalos may become hosts but not have ${R}_{\mathrm{sp}}$ measurements. This case occurs relatively rarely because the splashback radius is typically larger than ${R}_{200{\rm{m}}}$ , but, depending on the exact definition, there is a significant fraction of halos for which it is slightly smaller (Paper II). We have partially addressed this issue in Section 3.1, but we would ideally like to achieve a completeness of 100% for host halos.

Moria addresses this issue by interpolating ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ across time. In many cases, splashback histories are interrupted by flyby events with more than one snapshot as a subhalo or a temporary lack of particles at first apocenter. Such gaps can be bridged because the overall splashback radius and mass histories tend to be relatively smooth, particularly when expressed as ${R}_{\mathrm{sp}}/{R}_{200{\rm{m}}}$ and ${M}_{\mathrm{sp}}/{M}_{200{\rm{m}}}$ . As a first guess, Moria looks for an earlier and later epoch within one dynamical time where the halo was a host and where ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ were successfully measured by Sparta. Furthermore, we check that ${R}_{\mathrm{sp}}/{R}_{200{\rm{m}}}$ and ${M}_{\mathrm{sp}}/{M}_{200{\rm{m}}}$ (for all splashback definitions) are not too dissimilar from our expectation for a halo of the given mass and mass accretion rate, as quantified by the model from Paper II (see Section 5.6 and Equation (3)). We require the value to be within a factor of two or a logarithmic factor of $5\sigma$ , where the standard deviation is quantified by Equation (6). These limits may seem overly generous, but the nominal standard deviation from the model can be as small as 0.02 dex or 5%. This deviation does not capture the significant tails in the distribution, which we do want to allow to avoid biasing our results. If at least one valid past or future epoch is found, we assume that ${R}_{\mathrm{sp}}/{R}_{200{\rm{m}}}$ and ${M}_{\mathrm{sp}}/{M}_{200{\rm{m}}}$ vary slowly over a dynamical time and accept their values for the missing epoch. If both a past and future epoch are found, we linearly interpolate ${R}_{\mathrm{sp}}/{R}_{200{\rm{m}}}$ and ${M}_{\mathrm{sp}}/{M}_{200{\rm{m}}}$ in time. Another potential source of error are the values of ${R}_{200{\rm{m}}}$ and ${M}_{200{\rm{m}}}$ used to convert the ratios to ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ . If ${R}_{200{\rm{m}}}$ includes a large amount of material from a nearby halo (Section 3.2), keeping the ratio fixed can lead to extreme values of ${R}_{\mathrm{sp}}$ . Thus, we check the ratio between ${R}_{200{\rm{m}},\mathrm{all}}$ and ${R}_{200{\rm{m}},\mathrm{bnd}}$ and use the latter if the ratio exceeds two. This choice is consistent with the fitting function on Equation (3), where we exclude halos with extreme all-to-bound ratios.

The accuracy of the interpolation depends on a number of factors, including the time spacing of snapshots, how many snapshots lack ${R}_{\mathrm{sp}}$ determinations, and the lowest particle number considered. We have tested the interpolation from past snapshots at z = 0 in TestSim100. For halos with ${N}_{200{\rm{m}}}\gt 200$ , we find that the guess based on previous snapshots is biased low by about 3% in radius and 5% in mass on average, with about 15% scatter. The low estimate is expected as ${R}_{\mathrm{sp}}/{R}_{200{\rm{m}}}$ typically increases with time. These results are likely to be a worst-case scenario because ${R}_{\mathrm{sp}}/{R}_{200{\rm{m}}}$ increases rapidly at z = 0 as mass accretion rates are falling quickly; this is particularly relevant in a small box such as TestSim100, where the majority of halos are low-mass galaxy halos. We find similar values for guesses based on future snapshots and much better results for interpolated values, depending on the time interval over which we interpolate. For short intervals of one or a few snapshots, the typical error is less than 1%. We have also experimented with extrapolating the evolution of ${R}_{\mathrm{sp}}/{R}_{200{\rm{m}}}$ from past snapshots but found that, while it can slightly reduce the bias, it leads to somewhat erratic results and large scatter.

If no epoch with valid ${R}_{\mathrm{sp}}$ determination is found within a dynamical time, we use the mass and mass accretion rate of the halo to compute the fitting model of Section 5.6. This procedure is not circular because the model is calibrated using only halos where ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ were computed by Sparta. The fitting function is problematic in that we are imposing a median relation with significant scatter, but this guess is preferable over having no estimate at all (which could lead to significant differences in the subhalo statistics; see Section 4.4). The catalogs contain a status value that indicates how the splashback data for each halo and epoch were computed, allowing the user to easily exclude interpolated or predicted values. We test the completeness of our catalogs in Section 5.2.

4.3. Mass Accretion Rates

Over the past few years, it has become clear that the mass accretion rate of halos is an important property that determines their density profiles, splashback radius, and certain baryonic properties (Diemer & Kravtsov 2014; Lau et al. 2015; More et al. 2015; Green et al. 2020). However, the optimal definition of the accretion rate is not obvious: while we would ideally like to define an infinitesimal derivative of mass akin to theoretical models (Adhikari et al. 2014; Shi 2016), this quantity has no meaning in simulations where it is entirely dominated by shot noise and mergers.

In Moria, we keep the definition of Paper I, the logarithmic change in all-particle ${M}_{200{\rm{m}}}$ per scale factor over one dynamical time, or ${{\rm{\Gamma }}}_{\mathrm{dyn}}$ . However, Xhakaj et al. (2019b) recently reported that our calculation of ${{\rm{\Gamma }}}_{\mathrm{dyn}}$ differs from Rockstar's owing to three seemingly subtle differences: the mass definition (bound-only ${M}_{\mathrm{vir}}$ in Rockstar, all-particle ${M}_{200{\rm{m}}}$ in Moria), interpolating between two past snapshots (which is done in Rockstar but not in Moria), and the definition of the dynamical time ( ${t}_{\mathrm{dyn},\mathrm{vir}}$ in Rockstar, ${t}_{\mathrm{dyn},200{\rm{m}}}$ in Moria). The latter is by far the most important factor, highlighting how sensitive accretion rates can be to the time interval over which they are computed. For comparison, the Rockstar mass accretion rates are included in our catalogs (Section 5.1). We note that the Behroozi et al. (2013a) definition of dynamical time is a factor of two shorter than ours, meaning that their rate measured over $2{t}_{\mathrm{dyn}}$ corresponds most closely to our definition.

Once again, a complication arises for subhalos and backsplash halos that were subhalos one dynamical time ago. We could replace the unphysical all-particle masses of subhalos with bound-only masses from Rockstar, but this mixture would lead to an apples-to-oranges comparison and ignore the physical reality that there is no clear equivalency between the SO masses of hosts and subhalos. Moreover, the splashback radius is influenced by the total change of mass within the orbits of particles, which includes all matter regardless of whether it "truly" belongs to the halo. Thus, we use ${M}_{200{\rm{m}},\mathrm{all}}$ but adaptively reduce the interval over which we compute ${{\rm{\Gamma }}}_{\mathrm{dyn}}$ to avoid subhalo epochs. We stop if the interval becomes shorter than $0.25{t}_{\mathrm{dyn}}$ , which occurs if the halo was a subhalo until very recently. In this rare case, we use the median value of ${{\rm{\Gamma }}}_{\mathrm{dyn}}$ as predicted by the fitting function of Section 5.6, ignoring the substantial scatter of (0.41–0.07ν) dex. Physically speaking, we have to accept that there are halos for which it is difficult to define a meaningful accretion rate. For example, the evolution of backsplash halos can be dominated by stripping due to the flyby event. For subhalos, there is no sensible way to define an accretion rate at all. For completeness, the catalogs give ${{\rm{\Gamma }}}_{\mathrm{dyn}}$ at the epoch when the subhalo was accreted, which we compute as for host halos. The catalogs contain a flag that indicates how the accretion rate was computed.

At z ≈ 0, ${{\rm{\Gamma }}}_{\mathrm{dyn}}$ is computed by the default procedure for about 99% of host halos, over a reduced interval for 1%, and from the fitting function for a negligible fraction. In the smallest box at z = 0, L0063-WMAP7, those numbers rise to 4% and 0.2%. For subhalos, between 1% and 17% have reduced-interval accretion rates (at their infall redshift), and between 1% and 4% have only the fitting function estimate. These numbers vary only slightly with redshift. Moreover, Mansfield & Avestruz (2020) find that mass accretion rates in the Erebos simulations are well converged even at 100 particles per halo. In conclusion, we are able to measure mass accretion rates for virtually all halos for which this quantity is physically sensible.

4.4. Host–Subhalo Relations

Our process of assigning host–subhalo relations is illustrated in Figure 3. We find all halos whose centers lie within another, larger halo (see García & Rozo 2019, for alternative percolation algorithms). Here we need to define what "larger" means, since we will be dealing with a range of mass definitions. This ordering definition is a user-defined parameter in Moria. Given that none of the mass definitions discussed so far can be computed and are meaningful for all halos, we follow Behroozi et al. (2013a) in using ${V}_{\max }$ (as measured by Rockstar) to sort the halos. Starting with the largest halo, we use a tree search algorithm to look for all halos within its radius in a given mass definition (respecting the periodic boundary conditions). We mark all such halos as subhalos of the current halo unless they are already a subhalo.

**Figure 3.** Schematic illustration of the subhalo assignment in Moria, which essentially follows the Consistent-Trees method. Each circle indicates a host halo (red) or subhalo (orange) radius according to some radius definition. Halos are subhalos if their center lies within the radius of another halo with higher ${V}_{\max }$ . In this drawing, we assume that ${V}_{\max }\propto R$ with the exception of halos 8 and 9, where the halo with the smaller radius becomes the host because it has a larger ${V}_{\max }$ . Halos 3 and 5 illustrate choices we need to make about sub-subhalos. In the case of halo 3, we assign halo 1 as the parent (rather than halo 2, which is also a parent). In the case of halo 5, the user can choose whether to assign the direct parent or parent-parent. In our catalogs, we have chosen to assign parent ID 4.
Download figure:
Standard image High-resolution image

Some subhalos are inside multiple hosts, such as halo 3 in Figure 3. In this case, we assign the largest parent's ID (i.e., 1 in our example; this logic is equivalent to the "upid" field in Consistent-Trees catalogs). Another choice arises for subhalos whose only host is also a subhalo (halo 5 in Figure 3). Moria lets the user decide which parent ID to use; in the catalogs presented in this paper we assign the direct parent's ID (4 in this example). We have tested our assignment against Consistent-Trees and find perfect agreement when using the same radius definition.

One subtle question is how to handle missing radius estimates, for instance, when SO definitions cannot be computed (Section 3.2). We set those radii to zero, meaning that those halos cannot have subhalos. This solution seems sensible regardless of whether the SO could not be computed because the the halo never reaches the required central density at the center or because it is so close to a host that it includes its mass. Both issues apply predominantly to small halos that would not contain many subhalos regardless. Finally, we note that not all parent halos are necessarily part of the catalogs, depending on the chosen resolution cut (Section 5.1).

4.5. Merger Tree Format

The term "merger tree" can refer to a format of outputting time-ordered halo information or to the history of one halo and its progenitors; here, we mean the former. The Moria merger trees contain basically the same information as the collection of catalogs from all snapshots, but the information is ordered differently. A typical ordering is "depth-first," where the subsequent entries represent the most massive progenitor branch of one halo going back in time. Once that branch ends, the next progenitor (starting at some earlier time) is listed, and so on. In this way, one can construct a logical scheme where halos are listed sequentially as in the catalog entries, except that a redshift or snapshot number must be assigned to each entry (e.g., Springel et al. 2001; Behroozi et al. 2013b; Rodriguez-Gomez et al. 2015; Elahi et al. 2019; see Srisawat et al. 2013 for a code comparison).

The Moria merger tree format is designed in an entirely different fashion as illustrated in Figure 4. We store halos in a 2D array spanning all halo histories and the number of snapshots in the simulation (a similar system is used in the HBT+ halo finder of Han et al. 2018). This format has the obvious advantage that it is easy to extract a catalog-like data set (by selecting a time slice) or a history-like data set (by selecting a halo index). At first sight, the format seems wasteful since about one-third of the array is occupied by halos that did not yet or do no longer exist. These empty fields, however, occupy very little additional disk space owing to the hdf5 compression algorithm. The only difference between our catalogs and a time slice of the tree data is that the tree includes the entire histories of all halos that are included in the catalogs at any snapshot, meaning that, at a given redshift, the tree will include some halos that would not pass the threshold in the corresponding catalog (Section 4.6). If so, a flag in the tree data indicates that a halo is present only in the tree file. We provide Python code to load the catalog and tree data, which takes care of this cut automatically if desired.

The halo histories in the trees correspond to progenitor–descendant relations that we take directly from the original Consistent-Trees catalogs. These relations uniquely determine the next epoch in a halo history (horizontal gray arrows in Figure 4) or the tree branch into which a halo merges at the end of its life if it does not survive until the end of the simulation (vertical arrows). Moria does not attempt to improve the progenitor–descendant relations except for ghosts (blue fields in Figure 4).

We are now free to sort the tree file along the halo axis in any convenient fashion. We split the halos into subtrees, where each subtree contains a host halo at the final snapshot, its subhalos, and all histories that merged into them at previous redshifts. The subtrees are sorted by the same quantity that is used to impose a mass cut on the catalogs and trees, the peak ${M}_{200{\rm{m}}}$ along a halo history in our case. Thus, the first "line" in a tree file is the halo with the largest ${M}_{200{\rm{m}},\mathrm{peak}}$ that survived until the final snapshot. This row in the 2D array will be followed by the halos that merged into the top halo at the second-to-last snapshot, again ordered by their ${M}_{200{\rm{m}},\mathrm{peak}}$ . Thus, a halo history is not necessarily followed by its own progenitors, leading to upward-pointing gray arrows that cross multiple lines in Figure 4.

4.6. Limits on Mass and Halo Status

Moria makes it a priority to apply a well-defined lower-mass cutoff to the catalogs, with the twofold purpose of dissuading the user from accidentally using poorly resolved halos and saving disk space. The latter effect is substantial because the steep mass function of halos means that halos near the resolution limit account for a significant fraction of a catalog's size. The cutoff can be given in units of halo mass according to some definition, as a number of particles corresponding to that mass, ${V}_{\max }$ , or ${V}_{\max }$ corresponding to a number of particles. In the latter case, we estimate the limiting value of ${V}_{\max }$ using the empirical formula of Klypin et al. (2011),

$\begin{eqnarray}&&{V}_{\max }\approx 2.8\times {10}^{-2}{M}_{\mathrm{vir}}^{0.316}.\end{eqnarray} \tag{ 2 }$

Generally, it is preferable to set limits in units of particle number since they directly connect to the mass resolution of each simulation. We discuss the mass limits chosen for our catalogs in Section 5.1.

Furthermore, the user can decide how to treat phantoms and ghosts. Phantoms are rare cases where Consistent-Trees infers the existence of a halo from past and future snapshots even though Rockstar's FOF algorithm did not detect it. This can happen to both host and subhalos, although the latter make up for the vast majority of phantoms. Sparta uses the tracked particles in phantoms to compute their positions and velocities, which will not exactly agree with those from the halo finder (Section 3.4). The user can choose to replace the Rockstar values with those from Sparta, but we have not used this feature in our catalogs because it causes the host–subhalo relations to slightly differ from Consistent-Trees even if the same mass definition is used. We do not include ghosts in our catalogs because they do not have any catalog-defined properties, by definition; they are, however, included in the merger trees.

5. Results

The main product of this work are publicly available halo catalogs and merger trees. We summarize their most important properties in Section 5.1 and analyze their completeness in Section 5.2. We compare different SO definitions in Sections 5.3 and 5.4. In Sections 5.5–5.7, we consider the relationship between SO and splashback definitions and present an updated model for the splashback–SO connection.

5.1. Halo Catalogs and Merger Trees

Our catalogs and trees are based on Rockstar and Consistent-Trees catalogs run with ${R}_{200{\rm{m}},\mathrm{bnd}}$ as the main definition, with the "strict SO" setting switched off and the "bound-props" setting on, meaning that all properties, including SO masses, are calculated from bound particles only. While the distance units used in the Rockstar catalogs are comoving $\,{h}^{-1}\,\mathrm{Mpc}$ and $\,{h}^{-1}\,\mathrm{kpc}$ , all halo radii are given in physical $\,{h}^{-1}\,\mathrm{kpc}$ in the Moria catalogs.

We have chosen a mass limit of ${N}_{200{\rm{m}},\mathrm{bnd},\mathrm{peak}}\geqslant 200$ in Moria, meaning that a halo is included in the catalogs if it had at least 200 particles within ${R}_{200{\rm{m}},\mathrm{bnd}}$ at some time in the past. The trees contain the entire history of a halo if it exceeds the limit at any snapshot. There is no one ideal limit for all purposes because different halo properties converge at different particle numbers (Mansfield & Avestruz 2020). SO masses tend to be relatively well converged down to a few hundred particles, even for subhalos (e.g., Leroy et al. 2020), although their mass is subject to significant statistical variance (Trenti et al. 2010; Benson 2017). For splashback radii, we would ideally enforce a higher limit but instead caution the user that splashback masses and radii below 1000 particles become less complete (Paper I and Section 5.2).

We have included a large number of mass definitions, accretion rates, and status fields, some of which we summarize in Table 3. We refer the reader to the code documentation for units and further details. Most importantly, we include SO masses and radii in the ${M}_{500{\rm{c}}}$ , ${M}_{200{\rm{c}}}$ , ${M}_{\mathrm{vir}}$ , and ${M}_{200{\rm{m}}}$ definitions, each calculated with all particles (from Sparta), bound-only (from Rockstar), and from tracers only (for subhalos, from Sparta). We also include splashback radii and masses corresponding to the mean, median, 70%, 75%, 80%, 85%, and 90% of the particle splashback distribution. For each of these radius definitions, we provide host–subhalo relations, that is, an array of host IDs for each halo. We also include the original host–subhalo assignment from Consistent-Trees. We include a number of flags that indicate the status of the halo in Sparta, which definitions were successfully computed, and whether fields were modified or added by Moria. In addition to the fields listed in Table 3, we have added almost all fields available in the Rockstar/Consistent-Trees catalogs, including scale radius, velocity dispersion, angular momentum, spin parameters, axis ratios, position and velocity offsets, half-mass radii and times, and tidal forces (see Behroozi et al. 2013a, 2013b, although some fields have been added since those original publications).

Table 3. Partial List of Fields Included in the Moria Catalogs and Merger Trees

Field	From	Explanation
Organizational and merger information:
`id`	C,S	Halo ID from the original catalogs (except ghosts, where the ID is determined by Sparta)
`descendant`_`id`	C,S	ID of the halo at the next snapshot, or of the halo it merged into if it is ending (tree only)
`descendant`_`index`	M	Tree index of descendant; same as current halo unless merging (tree only)
`mask`_`alive`	M	True if there is an alive halo at this index and snapshot, false if not (tree only)
`mask`_`cut`	M	True if halo is part of catalog as well as tree (tree only)
`num`_`prog`	C	Number of progenitors (not counting ghosts)
Halo status:
`status`_`sparta`	S	Indicates host/sub/ghost status, switches in host, becoming a subhalo, becoming host, etc.
`phantom`	C	Nonzero if halo was interpolated by Consistent-Trees
`status`_`acc`_`rate`	M	Status indicating whether accretion rate was computed or guessed
`status`_`sparta`_`rsp`	S	Status of ${R}_{\mathrm{sp}}$ analysis in Sparta at this snapshot (success, not enough particles, etc.)
`status`_`moria`_`rsp`	M	Status of ${R}_{\mathrm{sp}}$ analysis in Moria (taken from Sparta, interpolated, guessed from fitting function, etc.)
`status`_`moria`_`hps`_`M`<`definition`>	M	Status of each SO mass (success, not dense enough, never below threshold, etc.)
`status`_`moria`_`hps`_`R`<`definition`>	M	Status of each SO radius definition (probably the same as for the SO mass of the same definition)
Important times in halo history and merger information:
`Mpeak`_`Scale`	C	Scale where highest ${M}_{200{\rm{m}},\mathrm{bnd}}$ was reached during halo's history
`Acc`_`Scale`	C	Scale where subhalo last became a subhalo, if it ever did
`First`_`Acc`_`Scale`	C	Scale where this halo first became a subhalo, if ever
`scale`_`of`_`last`_`MM`	C	Scale where last merger with mass ratio greater than 0.3 occurred, if ever
`Time`_`to`_`future`_`merger`	C	Time (in Gyr) until the halo merges into a larger halo, if ever
`Future`_`merger`_`MMP`_`ID`	C	ID of most massive progenitor into which halo merges (−1 if it does not exist at this time)
Mass and radius definitions:
`R200m`_`all`_`spa`_`internal`	S	Internally used ${R}_{200{\rm{m}}}$ from Sparta ( ${R}_{200{\rm{m}},\mathrm{all}}$ for hosts, ${R}_{200{\rm{m}},\mathrm{tcr}}$ for subhalos)
`M200m`_`all`_`spa`_`internal`	S	Internally used ${M}_{200{\rm{m}}}$ from Sparta (corresponding to `R200m`_`all`_`spa`_`internal`)
`nu200m`_`internal`	M	Peak height, ${\nu }_{200{\rm{m}}}$ , calculated from internal ${M}_{200{\rm{m}}}$ or bound mass if ${M}_{200{\rm{m}},\mathrm{all}}\gt 2{M}_{200{\rm{m}},\mathrm{bnd}}$
`R`<`delta`>_`all`_`spa`	S	SO radius computed from all particles (with Δ = 500c, 200c, vir, and 200m)
`R`<`delta`>_`tcr`_`spa`	S	SO radius computed from tracked particles for subhalos, same as all-particle radius for hosts
`R`<`delta`>_`bnd`_`cat`	R	SO radius computed from bound particles only
`M`<`delta`>_`all`_`spa`	S	SO mass computed from all particles
`M`<`delta`>_`tcr`_`spa`	S	SO mass computed from tracked particles for subhalos, same as all-particle mass for hosts
`M`<`delta`>_`bnd`_`cat`	R	SO mass computed from bound particles only
`Rsp-apr-mn`	S	${R}_{\mathrm{sp}}$ computed from mean of the particle splashback distribution
`Rsp-apr-p`<`percentile`>	S	${R}_{\mathrm{sp}}$ computed from percentiles of the particle splashback distribution (50, 70, 75, 80, 85, 90)
`Msp-apr-mn`	S	${M}_{\mathrm{sp}}$ computed from mean of the particle splashback distribution
`Msp-apr-p`<`percentile`>	S	M_sp computed from percentiles of the particle splashback distribution (50, 70, 75, 80, 85, 90)
`Vmax`	R	Maximum circular velocity
`Macc`	C	${M}_{200{\rm{m}},\mathrm{bnd}}$ at accretion (for subhalos)
`Vacc`	C	${V}_{\max }$ at accretion (for subhalos)
`M200m`_`peak`_`cat`	C	Highest ${M}_{200{\rm{m}},\mathrm{bnd}}$ attained during halo's history
`Vpeak`	C	Highest ${V}_{\max }$ attained during halo's history
`Vmax@Mpeak`	C	${V}_{\max }$ at scale `Mpeak`_`Scale`
`First`_`Acc`_`Mvir`	C	${M}_{\mathrm{vir}}$ when first becoming subhalo (if ever)
`First`_`Acc`_`Vmax`	C	${V}_{\max }$ when first becoming subhalo (if ever)
Host–subhalo relations:
`parent`_`id`_`cat`	C	Parent ID according to original catalog; equivalent to `upid` (most massive host)
`parent`_`id`_`R`<`delta`>_`all`_`spa`	M	Parent ID for all-particle SO definitions
`parent`_`id`_`R`<`delta`>_`bnd`_`cat`	M	Parent ID for bound-only SO definitions
`parent`_`id`_`R`<`delta`>_`tcr`_`spa`	M	Parent ID for tracer SO definitions (experimental and very similar to all-particle assignment)
`parent`_`id`_`Rsp-apr-mn`	M	Parent ID for ${R}_{\mathrm{sp}}$ from mean of the particle splashback distribution
`parent`_`id`_`Rsp-apr-p`<`percentile`>	M	Parent ID for ${R}_{\mathrm{sp}}$ from percentiles of the particle splashback distribution (50, 70, 75, 80, 85, 90)
Accretion rates:
`acc`_`rate`_`200m`_`dyn`	M	Fiducial accretion rate ${{\rm{\Gamma }}}_{\mathrm{dyn}}$ as computed by Moria, see `status`_`acc`_`rate`; dimensionless units
`Acc`_`Rate`_`1`*`Tdyn`	C	Acc. rate over $0.5{t}_{\mathrm{dyn}}$ as defined here but using ${t}_{\mathrm{dyn},\mathrm{vir}};$ in absolute units of $\,{M}_{\odot }/h/\mathrm{yr}$
`Acc`_`Rate`_`2`*`Tdyn`	C	Acc. rate in ${M}_{200{\rm{m}},\mathrm{bnd}}$ over ${t}_{\mathrm{dyn}}$ as defined here but using ${t}_{\mathrm{dyn},\mathrm{vir}}$
`Acc`_`Rate`_`Inst`	C	Acc. rate in ${M}_{200{\rm{m}},\mathrm{bnd}}$ over one snapshot (very noisy)
`Acc`_`Rate`_`100Myr`	C	Acc. rate in ${M}_{200{\rm{m}},\mathrm{bnd}}$ over 100 Myr (typically short compared to ${t}_{\mathrm{dyn}}$ )
`Acc`_`Rate`_`Mpeak`	C	Acc. rate in ${M}_{200{\rm{m}},\mathrm{bnd},\mathrm{peak}}$ from current z to $z+0.5$
`Acc`_`Log`_`Vmax`_`1`*`Tdyn`	C	Difference in ${\mathrm{log}}_{10}{V}_{\max }$ over $0.5{t}_{\mathrm{dyn},\mathrm{vir}}$
`Acc`_`Log`_`Vmax`_`Inst`	C	Difference in ${\mathrm{log}}_{10}{V}_{\max }$ over one snapshot
`Log`_`(VmaxVmax`_`max(Tdyn;Tmpeak))`	C	${\mathrm{log}}_{10}{V}_{\max }$ over ${V}_{\max }(t-{t}_{\mathrm{dyn}})$ or ${V}_{\max }({t}_{\mathrm{peak}})$ (the latter if ${M}_{\mathrm{peak}}$ happened more than ${t}_{\mathrm{dyn}}$ ago)
Other halo properties:
`x`	R,S	Halo position from Rockstar, except for ghosts (and optionally phantoms) where computed by Sparta
`v`	R,S	Halo velocity from Rockstar, except for ghosts (and optionally phantoms) where computed by Sparta

Note. The letters in the "From" column indicate the possible sources of a field, namely, R (Rockstar), C (Consistent-Trees), S (Sparta), or M (Moria). This list is incomplete; see the Sparta documentation for additional fields and units. In identifiers, "delta" can be replaced by "200m," "vir," "200c," or "500c." The "percentile" for ${R}_{\mathrm{sp}}$ definitions can be 50, 70, 75, 80, 85, or 90.

Download table as: ASCIITypeset images: 1 2

We have set the compression level of the hdf5 files to the lowest level, 1, which gives a good compromise between file size and speed. The files are readable with any up-to-date hdf5 distribution. In total, our catalogs for the 14 Erebos simulations contain about 7 million halo histories and a total of 342 million halo epochs. On average, 64% of the tree arrays contain halos that are alive; the remaining 36% are filled with zeros. Out of the active halo epochs, about 88% are occupied by hosts, 8.6% by subhalos, and 3.8% by ghosts. These fractions vary from simulation to simulation, with smaller box sizes containing higher fractions of subhalos and ghosts, as well as an overall greater number of halos due to the higher variance of the density field on smaller scales.

5.2. Completeness

As discussed in Sections 3.1–3.3, the calculation of SO and splashback radii can fail in certain situations. Figure 5 shows the corresponding incompleteness of our catalogs for a variety of simulations and redshifts. The top four panels address SO definitions, where the completeness varies strongly between all-particle and bound-only definitions and between hosts and subhalos (different line styles in Figure 5).

**Figure 5.** Incompleteness of the catalogs as a function of the peak particle number. The colors refer to the different simulations in the WMAP7 sample, except in the bottom right panel, where they indicate the self-similar simulations. Top row: each panel shows an SO definition at z = 0.1. The lines show the fraction of halos for which an SO mass could not be computed, either because the threshold density was not reached at the center (common for high-density definitions such as ${R}_{200{\rm{c}}}$ ) or because the density of the halo did not fall below the threshold even at large radii (common for low-density definitions such as ${R}_{200{\rm{m}}}$ ). Ending lines mean that the corresponding fraction falls to zero. Dashed lines refer to all-particle masses for hosts, solid lines to all-particle masses for subhalos, and dotted–dashed lines to bound-only masses for subhalos. The all-particle definitions are frequently ill-defined for subhalos, leading to large incompleteness. The fractions depend on the box size because the small boxes contain more resolved subhalos whose strict SO masses would include the entire host halo. All-particle masses for hosts and bound-only masses for subhalos can almost always be computed, with the exception of a small fraction of halos in high-contrast definitions such as ${R}_{500{\rm{c}}}$ . The lines for bound-only host halos look very similar to the all-particle versions and are thus omitted. Bottom row: the panels show the completeness of the splashback definitions for host halos; all splashback definitions share the same status. Missing values are recovered by Moria by either interpolating using adjacent snapshots (solid lines) or predicting a value from a fitting function (dashed lines). Both fractions are low, especially at $N\gt 1000$ , which is the suggested threshold for using the splashback data (gray area). At high redshift, the number of interpolated and modeled radii drops. The bottom right panel demonstrates that the completeness is somewhat worse in the self-similar simulations with shallow slopes, especially n = −1. See Section 5.2 for details.
Download figure:
Standard image High-resolution image

The calculation of bound-only radii can fail if the halo never reaches the threshold density at its center. By construction, there are no halos for which the halo finder fails to calculate the main definition, ${R}_{200{\rm{m}},\mathrm{bnd}}$ . As the threshold increases to 500c, an increasing number of subhalos fail to reach the threshold, but the number remains well below a percent (dotted–dashed lines). The same issue can occur even for host halos, although with a frequency of less than 10⁻³ (dashed lines). The results for bound-only definitions in host halos are similar to those for all particles, which is not surprising since virtually all particles near the halo center are bound.

We also attempt to determine all-particle SO masses for subhalos even though they are often intrinsically ill-defined because the density never falls below the threshold (unless the entire host halo is included). The resulting incompleteness can reach up to 50% for ${R}_{200{\rm{m}}}$ but decreases significantly toward ${R}_{500{\rm{c}}}$ , which can be computed for the vast majority of subhalos (solid lines in Figure 5). The incompleteness in all-particle subhalo masses is a function of mass rather than particle number, leading to the large differences between simulations. This is not the case for bound-only definitions, host halos, or splashback definitions though. Either way, the large incompleteness of all-particle subhalo masses is not important because they are given for comparison rather than as a physically meaningful mass definition.

In some very rare cases, a host halo is close enough to another halo that Sparta cannot find a radius where the density profile decreases below $200{\rho }_{{\rm{m}}}$ . We observe this case in the 10⁻⁴ of host halos in L0063-WMAP7 (dashed line in top left panel of Figure 5). We emphasize that this incompleteness captures only cases where ${R}_{200{\rm{m}},\mathrm{all}}$ could not be computed; there are many more cases where it is computed but includes significant material from neighboring halos (Section 5.3).

We now turn to the splashback radius (bottom row of Figure 5). The issue of completeness was discussed at length in Paper I, with the conclusion that Sparta produces splashback data for about 95% of halos with ${N}_{200{\rm{m}}}\geqslant 1000$ . Here, the situation has changed because we have included smaller halos down to 200 particles, selected them by peak mass rather than current mass, and computed a value for ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ regardless of whether they could be computed by Sparta. As discussed in Section 4.2, we either interpolate the splashback radius from past and/or future values (solid lines) or estimate it using the fitting function (dashed lines). For most applications, the model estimates should not be used because the scatter in the true ${R}_{\mathrm{sp}}/{R}_{200{\rm{m}}}$ ratio is large (Paper II). At z ≈ 0, we find that model estimates account for just under 10% of halos with N_peak = 1000 in the smallest boxes and much less than a percent in the larger boxes. This trend is driven by the larger number of mergers and close flybys in the smaller boxes, which lead to interrupted splashback histories (Diemer 2020a). The second and third panels show redshifts 2 and 4 (including a pink line for L0031, which runs only up to z = 2). By z = 4, the fraction of estimated values falls below a percent in all simulations. Finally, we consider the self-similar simulations in the bottom right panel. At N = 1000, the modeled fraction varies between 10% for n = −1 and around 1% for the other simulations.

In summary, Figure 5 confirms that pushing far below the canonical value of 1000 particles per halo leads to possibly problematic fractions of interpolated or estimated splashback values. We find very similar results for the Planck cosmology at the same box sizes. For SO definitions, we do not find a large evolution in the fractions with redshift, meaning that the general trends shown in Figure 5 persist. Given that we will largely use bound-only and tracer masses for subhalos, we can think of the SO definitions as complete. The splashback definitions are nearly complete for hosts as long as we use reasonably well resolved halos. One exception are the last few snapshots of a simulation: due to the missing future particle splashback events, the completeness and reliability of Sparta's splashback results decrease for the last $\approx 0.2{t}_{\mathrm{dyn}}$ (Paper I). Thus, it is generally advisable to consider the splashback results at z ≈ 0.1 rather than at z = 0. In the following sections, we combine data from all simulations of a given cosmology.

5.3. SO Definitions: All-particle versus Bound-only

We now consider the ratios between different mass and radius definitions quantitatively, starting with SO definitions. Both all-particle and bound-only definitions are commonly used, for instance, in studies of the halo mass function and merger trees, respectively. While it is widely known that unbinding and the density threshold can have a significant effect, these differences are rarely quantified.

In Figure 6, we show histograms of the ratio between all-particle and bound-only SO definitions for halos from all simulations in our WMAP7 sample for which both were measured and for which the bound-only radius contains at least 200 particles. We use the ${R}_{200{\rm{m}},\mathrm{bnd}}$ definition to split hosts and subhalos. Furthermore, we crudely split the sample by peak height into low-mass (0.5 < ν < 1) and higher-mass halos (1.5 < ν < 2). Similarly, we plot the ratio at z = 0 and z = 2 to highlight the general redshift trends. When referring to SO definitions, we use radii and masses interchangeably as they are uniquely coupled via M ∝ R³. The logarithmic mass ratios translate into three times smaller logarithmic radius ratios.

**Figure 6.** Ratio of all-particle and bound-only SO masses for host halos (top row) and subhalos (bottom row), low and high redshift, and low and high masses (defined as $0.5\lt \nu \lt 1$ and $1.5\lt \nu \lt 2$ ). In each panel, the colored lines show histograms of the radius ratio for four mass definitions. Since $M\propto {R}^{3}$ for SO definitions, the logarithmic differences in radius are three times smaller. For host halos, the definitions agree for the vast majority of halos, but there is a tail toward high ratios caused by halos that are close to another, larger halo. In such cases, the all-particle mass can encompass much or all of the other halo's mass. As expected, this effect is more common for lower-threshold definitions such as ${R}_{200{\rm{m}}}$ , at low mass, and at high redshift. The all-particle masses of a large fraction of subhalos are ill-defined; the cutoff of the distribution around a ratio of 1000 is partly caused by Sparta's algorithm. The dependence on mass definition is even stronger for subhalos, with high-threshold definitions such as ${R}_{500{\rm{c}}}$ performing better. However, high thresholds also lead to more subhalos for which no mass can be found at all, which are excluded from this figure. See Section 5.3 for details.
Download figure:
Standard image High-resolution image

For host halos (top row of Figure 6), we notice a very small fraction where ${R}_{\mathrm{all}}\lt {R}_{\mathrm{bnd}}$ owing to numerical effects; in general, the bound-only mass is always smaller. We have checked that all-particle masses measured by Sparta and Rockstar agree perfectly (if Rockstar's strict SO option is activated), except in some rare cases where the FOF groups used by Rockstar are slightly incomplete at large radii. However, in TestSim100 we find that only about 1% of halos exhibit more than 1% difference in ${R}_{200{\rm{m}}}$ . Returning to the all-bound comparison, Figure 6 shows that unbinding plays almost no role for the vast majority of host halos. The median ratios are unity in all definitions, and the mean ratios deviate from unity by no more than 3%. The strength of the tail depends on the density threshold, with ${R}_{500{\rm{c}}}$ leading to fewer large ratios. For example, at z = 0, the all-bound ratio for ${M}_{500{\rm{c}}}$ is higher than 10% for only 0.5% of halos but for about 9% when using ${M}_{200{\rm{m}}}$ . These differences are caused by halos that are close to another, larger halo whose material is partially included in the all-particle mass. High density thresholds are useful in avoiding this proximity issue, but they also lead to more failures of the calculation.

While it is generally safe to use all-particle and bound-only masses interchangeably for hosts, there are noticeable differences for the SO radii in the lower-threshold definitions. For example, there is a standard deviation of 14% in ${R}_{\mathrm{vir}}$ in the z = 2 low-mass example. The corresponding tail toward large radius ratios highlights the importance of unbinding in the context of subhalo relations: even few halos with spuriously large radii could, in principle, create a larger number of subhalos. We investigate this difference in Diemer (2020a) and find that subhalo relations based on all-particle definitions lead to up to 5% more subhalos at the low-mass end.

We now turn to subhalos (bottom panels of Figure 6). Here, the ratios are peaked near unity, but a significant fraction of halos have larger ratios. The distributions begin to fall off around a mass ratio of 100, but this cutoff is not necessarily physical because Sparta stops looking for a solution when the all-particle radius gets too large. On the one hand, the main conclusion from Figure 6 is that all-particle masses are ill-defined for subhalos and should simply not be used. The median mass ratios vary between 1.2 and 5, increasing with redshift and decreasing with mass and overdensity threshold. Considering radii, on the other hand, a surprisingly large fraction of subhalos do have well-defined all-particle radii: the median radius ratio varies between 1.1 and 1.8. The large standard deviations of up to unity reflect that the result of the calculation will strongly depend on how close to the host center the subhalo is located. In summary, the bound-to-all ratio appears to increase in cases where halos are close to neighbors (for hosts) or to their host center (for subhalos).

We now explicitly confirm this picture by considering the mass evolution of subhalos in the WMAP7 cosmology in Figure 7. We use our new merger tree format to pick out all epochs where a halo becomes a subhalo and to extract its mass evolution before that time (while it was a host) and afterward (while it is a subhalo, as opposed to a ghost or backsplash halo). We define the time of infall as the last snapshot before each halo became a subhalo according to the ${R}_{200{\rm{m}}}$ definition (vertical dashed line in Figure 7). We require halos to contain at least 200 particles at infall, but this limit does not qualitatively change the results. Similarly, the mass evolution does not change much if we limit the infall redshift or peak height; the figure includes infalls at any redshift. We omit epochs where a mass definition cannot be measured, meaning that the median all-particle masses would be even larger if we could include the most extreme cases.

**Figure 7.** Mass evolution of subhalos before and after infall (left and right of vertical gray line). The lines show the median evolution of ${M}_{200{\rm{m}}}$ (blue) and ${M}_{500{\rm{c}}}$ (orange) in their bound-only (solid) and all-particle (dashed) versions, including only epochs where the respective mass could be measured. Time is measured in units of the dynamical time at infall. All masses are normalized to ${M}_{200{\rm{m}},\mathrm{bnd}}$ at infall, so that the solid blue line is unity at ${t}_{\mathrm{infall}}$ by construction. Before infall, the all-particle and bound-only masses grow in unison and with modest scatter in their evolution. Just before infall, the all-particle masses sweep up additional mass from the future host, including roughly 40% more mass on average for ${M}_{200{\rm{m}}}$ and about 10% for ${M}_{500{\rm{c}}}$ . After infall, the bound-only mass begins to decrease whereas the all-particle mass rapidly grows. However, around one dynamical (or crossing) time, the mass decreases again as the subhalos reach their apocenter.
Download figure:
Standard image High-resolution image

Figure 7 shows that, before infall, the all-particle and bound-only masses evolve more or less in unison. After infall, the median all-particle mass rapidly increases by a factor of up to 10. The large scatter highlights that the mass evolution strongly depends on the orbital parameters and on the internal structure of host and subhalo. Despite the scatter, we observe a dip in the median all-particle mass after one dynamical time, when many subhalos have reached their first apocenter in the less dense outskirts of their hosts. The corresponding dips from subsequent orbits are smoothed out by phase mixing. Focusing on the evolution around infall, the all-particle mass begins to deviate from the bound-only mass about half a dynamical time before infall. As expected, the increase over the bound-only mass is larger for low-threshold definitions such as ${M}_{200{\rm{m}}}$ because their larger radius includes more host material. While ${M}_{500{\rm{c}}}$ increases by only about 10% at infall, ${M}_{200{\rm{m}}}$ has increased about 40% on average. This mass increase is largely responsible for the tails in the host halo distribution in Figure 6.

5.4. SO Definitions: Unbinding Algorithm

As mentioned in Section 3.3, we found that bound-only masses depend strongly on the chosen algorithm, most notably on the initial set of particles. In Figure 8, we compare bound-only masses computed by Sparta to those from Rockstar. Here, we consider all particles within ${R}_{200{\rm{m}}}$ , defined as the all-particle radius for hosts and the tracer radius for subhalos. We do not iteratively unbind to match Rockstar's procedure and because iterating does not significantly alter our conclusions. The results in Figure 8 refer to TestSim100 because we have not included Sparta's bound-only masses in our fiducial catalogs.

**Figure 8.** Ratio of bound-only masses computed from an initial FOF group (by Rockstar) and from all particles within ${R}_{200{\rm{m}}}$ (by Sparta), at z = 0 in TestSim100. The bound masses agree almost exactly for host halos (dark blue). For subhalos (light blue), there is a large fraction where the masses agree but also an extensive tail toward large values (the shaded areas show the 68% and 95% intervals for subhalos). The tail is caused by halos where ${R}_{200{\rm{m}}}$ includes sufficient host particles to spuriously bind a lot of material. This case is more likely for ${R}_{200{\rm{m}}}$ than for ${R}_{500{\rm{c}}}$ because the latter relies on more strongly bound particles.
Download figure:
Standard image High-resolution image

**Figure 8.** Ratio of bound-only masses computed from an initial FOF group (by Rockstar) and from all particles within ${R}_{200{\rm{m}}}$ (by Sparta), at z = 0 in TestSim100. The bound masses agree almost exactly for host halos (dark blue). For subhalos (light blue), there is a large fraction where the masses agree but also an extensive tail toward large values (the shaded areas show the 68% and 95% intervals for subhalos). The tail is caused by halos where ${R}_{200{\rm{m}}}$ includes sufficient host particles to spuriously bind a lot of material. This case is more likely for ${R}_{200{\rm{m}}}$ than for ${R}_{500{\rm{c}}}$ because the latter relies on more strongly bound particles.
Download figure:
Standard image High-resolution image

As expected, the results agree very well for host halos, with median ratios of unity in all definitions and standard deviations between 2% in ${M}_{500{\rm{c}}}$ and 8% in ${M}_{200{\rm{m}}}$ . This agreement reconfirms that we can safely neglect boundedness for host halos except in some extreme merger scenarios. For subhalos, the estimates can differ significantly, with a long tail toward large ratios. The shaded areas in Figure 8 show the 68% and 95% intervals for subhalos. While the logarithmic scale of the figure highlights the tails, the median ratios vary between only 1.1 for ${M}_{500{\rm{c}}}$ and 1.35 for ${M}_{200{\rm{m}}}$ . For ${M}_{500{\rm{c}}}$ , the estimates agree to 10% or better for about half the halos; for ${M}_{200{\rm{m}}}$ that fraction goes down to 30%. However, the tails and standard deviations are extremely large in all definitions. The reason for these disagreements are halos where ${R}_{200{\rm{m}}}$ includes numerous host particles, rendering the entire distribution more bound than it would be if we considered only "true" subhalo particles. The 6D-FOF groups used for subhalos in Rockstar do not constitute a "correct" answer either, but they contain less host material because the velocity of the particles is considered. We thus provide the Rockstar bound-only masses in our catalogs. We continue the search for physically motivated definitions of subhalo masses in B. Diemer & P. S. Behroozi (2020, in preparation), where we further explore the concept of tracer masses.

5.5. Splashback Definitions

In Paper II, we investigated the splashback radii and masses computed by Sparta as a function of ${R}_{200{\rm{m}}}$ and ${M}_{200{\rm{m}}}$ , accretion rate, redshift, and cosmology. However, we did not show the distribution of ratios and did not consider subhalos. Figure 9 shows ${R}_{\mathrm{sp}}/{R}_{200{\rm{m}}}$ and ${M}_{\mathrm{sp}}/{M}_{200{\rm{m}}}$ but using the bound-only SO radii and masses from Rockstar to make the ratios more comparable to Figure 6 and to avoid the issue of spuriously large all-particle radii (Section 5.3). The distributions are clearly peaked around the median (shaded areas), but they do exhibit broad tails. Some of the most extreme ratios are caused by flyby encounters that can result in negative accretion rates and large splashback radii. The cutoff above a radius ratio of three is partly caused by Sparta's algorithm, which does not trace particles to arbitrary distances (Paper I). The distribution of ratios does not strongly depend on redshift, although the average splashback radius and mass slightly decrease with z (in agreement with Paper II). We choose ${R}_{\mathrm{sp},75 \% }$ as a representative definition in Figure 9; the distributions for the other percentiles look similar (although slightly shifted in radius and mass). One important difference between Paper II and our new catalogs is that Moria reconstructs splashback quantities by interpolation, extrapolation, and estimates from our fitting function. Those results are shown as dashed lines in Figure 9. The good agreement with the measured splashback quantities is somewhat by construction since we interpolate in ${R}_{\mathrm{sp}}/{R}_{200{\rm{m}}}$ and ${M}_{\mathrm{sp}}/{M}_{200{\rm{m}}}$ space, but it demonstrates that the interpolation does not introduce any unexpected features.

**Figure 9.** Ratio of the 75%-splashback radius and mass to ${R}_{200{\rm{m}}}$ (left) and ${M}_{200{\rm{m}}}$ (right), for z ≈ 0 (top) and z = 4 (bottom). The histograms include all halos in the WMAP7 sample with ${N}_{200{\rm{m}}}\gt 1000$ . The splashback radius depends on the accretion rate and mass, which broadens the distribution. The solid lines show splashback quantities computed by Sparta, the dashed lines those interpolated or estimated by Moria. For subhalos, only a small fraction has Sparta estimates. While the distribution has large tails, the 68% and 95% intervals for hosts (shaded areas) are well confined. The tails for the estimated values are limited by construction as described in Section 4.2. The tails toward high ratios are partially due to outliers with genuinely large splashback radii and partly due to differences between including all particles and the bound-only SO mass to which we are comparing. The broadness of the distributions also highlights that the splashback radius is a unique definition that cannot easily be guessed based on SO definitions. See Section 5.5 for details.
Download figure:
Standard image High-resolution image

For subhalos, Sparta does not compute particle splashbacks because orbits reflect the combined potential of host and subhalo and do not typically exhibit well-defined apocenters. However, if a halo becomes a subhalo only briefly, there may be enough past and future particle splashbacks to compute ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ . The corresponding distributions are similar to those of host halos (solid light-blue lines in Figure 9). The estimated subhalo splashback radii (dashed light-blue lines) refer to the time of infall and are thus not directly comparable to the current-time radius and mass according to the halo finder; they are given mostly for completeness.

Based on the mass evolution of SO halos shown in Figure 7, we might worry about the evolution of splashback radii and masses close to merger events. For example, if the radii were artificially inflated as a result of particles being stripped from the halo, that could lead to other halos in the vicinity falsely being identified as subhalos. However, we find no evidence for such an issue. Figure 10 shows the average radius and mass evolution before infall for all (future) subhalos with more than 1000 particles in the WMAP7 cosmology. As in Figure 7, we observe a smooth mass evolution in bound-only ${M}_{200{\rm{m}}}$ (blue lines) and a sudden growth of all-particle ${M}_{200{\rm{m}}}$ shortly before infall (purple lines); ${R}_{200{\rm{m}},\mathrm{all}}$ follows this growth by construction. The splashback radii (orange and red lines), on the other hand, do not exhibit any sudden growth. The splashback mass is independent of the radius and does include material in the vicinity of the host, leading to a gradual increase in ${M}_{\mathrm{sp}}$ during the last dynamical time before the merger. The mean (dashed lines) grows much more sharply than the median, indicating a strongly asymmetric distribution with a tail toward large mass increases. We conclude that our splashback radii and masses behave as expected in the vicinity of mergers, at least on average.

**Figure 10.** Evolution of SO and splashback radii (top) and masses (bottom) prior to infall into a larger halo. The figure corresponds to the left-hand side of Figure 7 for ${M}_{200{\rm{m}},\mathrm{bnd}}$ and ${M}_{200{\rm{m}},\mathrm{all}}$ (blue and purple lines), but we are not showing the evolution after infall because splashback radii are not physically meaningful for subhalos. The solid lines show the median evolution, the dashed lines the mean. For SO definitions such as ${M}_{200{\rm{m}},\mathrm{all}}$ , the inclusion of material belonging to the future host leads to a growth in both radius and mass. For the splashback definitions (red and orange lines), the situation is quite different: while the mean mass does grow significantly, the splashback radius follows a smooth trajectory that does not seem to be impacted much by the future merger.
Download figure:
Standard image High-resolution image

**Figure 10.** Evolution of SO and splashback radii (top) and masses (bottom) prior to infall into a larger halo. The figure corresponds to the left-hand side of Figure 7 for ${M}_{200{\rm{m}},\mathrm{bnd}}$ and ${M}_{200{\rm{m}},\mathrm{all}}$ (blue and purple lines), but we are not showing the evolution after infall because splashback radii are not physically meaningful for subhalos. The solid lines show the median evolution, the dashed lines the mean. For SO definitions such as ${M}_{200{\rm{m}},\mathrm{all}}$ , the inclusion of material belonging to the future host leads to a growth in both radius and mass. For the splashback definitions (red and orange lines), the situation is quite different: while the mean mass does grow significantly, the splashback radius follows a smooth trajectory that does not seem to be impacted much by the future merger.
Download figure:
Standard image High-resolution image

5.6. An Updated Model for the Splashback–SO Relation

As discussed in Section 3.1, we have made improvements to the Sparta code that change the splashback radii and masses by a few percent. Thus, we recalibrate the Paper II model including the following changes. We calculate mass accretion rates using Moria as described in Section 4.3. Including halos that were subhalos or did not exist one dynamical time ago helps to extend the high-z samples, especially at high accretion rates. We have confirmed that excluding backsplash halos makes a small difference and thus leave them in the sample. Since we are constraining ${R}_{\mathrm{sp}}/{R}_{200{\rm{m}}}$ and ${M}_{\mathrm{sp}}/{M}_{200{\rm{m}}}$ , we should use only halos for which ${R}_{200{\rm{m}}}$ and ${M}_{200{\rm{m}}}$ are reliably measured. In particular, we wish to avoid halos that include a significant contribution from a neighboring halo, which is relatively common at low masses (Sections 5.2 and 5.3). We could sidestep this issue by using bound-only masses, but there are good reasons to calibrate the relationship against all-particle quantities. First, bound-only masses cannot be measured observationally and depend on the unbinding algorithm; second, particle orbits (and thus ${R}_{\mathrm{sp}}$ ) react to the total mass inside their radius rather than the bound mass; and third, all-particle masses are more clearly related to the halo density profile and the steepening feature. Thus, we use all-particle radii and masses but exclude halos for which ${M}_{200{\rm{m}},\mathrm{all}}$ is more than 20% greater than ${M}_{200{\rm{m}},\mathrm{bnd}}$ . While this cut may seem strict, including halos with mass increases of up to 50% visibly increases the scatter in the lowest peak height bins, indicating that the splashback–SO relation is erratic in such halos. The higher peak height bins are barely affected by the cut. As in Paper II, we use only halos with ${N}_{200{\rm{m}}}\gt 1000$ and with successful ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ measurements from Sparta (as opposed to interpolated values).

With this halo sample in hand, we repeat the fitting procedure of Paper II. In particular, we fit the splashback results from both the WMAP7 and Planck cosmologies at redshifts 0.13, 0.3, 0.5, 1, 2, 4, and 8, avoiding z = 0 owing to the corrections at the final snapshots (Section 3.1). We bin ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ as a function of mass accretion rate ( $0\lt {{\rm{\Gamma }}}_{\mathrm{dyn}}\lt 12$ ) and peak height (with bin edges at 0.5, 1, 1.5, 2, and 3). We use a least-squares fit to minimize the difference between the binned relations and our fit. We combine the statistical uncertainty in each bin (due to the finite number of halos) with an additional 1% systematic error in quadrature. This systematic error prevents high-occupation bins from dominating the fit and captures a number of inaccuracies in our methodology. The fitting function remains the same as in Equations (5)–(7) in Paper II. In particular, the relationships between ${{\rm{\Gamma }}}_{\mathrm{dyn}}$ and the splashback quantities ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ (which we summarily call ${X}_{\mathrm{sp}}$ ) are fit with the general function

$\begin{eqnarray}&&{X}_{\mathrm{sp}}={A}_{0}+{{Be}}^{-{{\rm{\Gamma }}}_{\mathrm{dyn}}/C},\end{eqnarray} \tag{ 3 }$

where A₀, B, and C are free parameters. The latter two are functions of peak height and ${{\rm{\Omega }}}_{{\rm{m}}}(z)$ ,

$\begin{eqnarray}\begin{array}{rcl}B & = & ({B}_{0}+{B}_{{\rm{\Omega }}}{{\rm{\Omega }}}_{{\rm{m}}})\times (1+{B}_{\nu }\nu )\\ C & = & ({c}_{0}+{C}_{{\rm{\Omega }}}{{\rm{\Omega }}}_{{\rm{m}}}+{C}_{{\rm{\Omega }}2}{{\rm{\Omega }}}_{{\rm{m}}}^{2})\times (1+{C}_{\nu }\nu +{C}_{\nu 2}{\nu }^{2}).\end{array}\end{eqnarray} \tag{ 4 }$

We separately fit ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ , as well as the mean-based definition and percentiles, leading to four sets of best-fit parameters. The dependence of the parameters on the percentile is parameterized in terms of p, the percentile divided by 100,

$\begin{eqnarray}\begin{array}{rcl}{A}_{0} & = & {a}_{0}+{a}_{{\rm{p}}}\times p\\ {B}_{0} & = & {b}_{0}+{b}_{{\rm{p}}}\times p\\ {B}_{{\rm{\Omega }}} & = & {b}_{{\rm{\Omega }}}+{b}_{{\rm{\Omega }}{\rm{p}}}\times \exp ({b}_{{\rm{\Omega }}{\rm{p}}2}\times p)\\ {B}_{\nu } & = & {b}_{\nu }+{b}_{\nu {\rm{p}}}\times p\\ {C}_{{\rm{\Omega }}} & = & {c}_{{\rm{\Omega }}}+{c}_{{\rm{\Omega }}{\rm{p}}}\times \exp ({c}_{{\rm{\Omega }}{\rm{p}}2}\times p)\\ {C}_{{\rm{\Omega }}2} & = & {c}_{{\rm{\Omega }}2}+{c}_{{\rm{\Omega }}2{\rm{p}}}\times \exp ({c}_{{\rm{\Omega }}2{\rm{p}}2}\times p)\\ {C}_{\nu } & = & {c}_{\nu }+{c}_{\nu {\rm{p}}}\times p\\ {C}_{\nu 2} & = & {c}_{\nu 2}+{c}_{\nu 2{\rm{p}}}\times p.\end{array}\end{eqnarray} \tag{ 5 }$

We also fit for the logarithmic scatter in the relation,

$\begin{eqnarray}&&{\sigma }_{\mathrm{sp}}={\sigma }_{0}+{\sigma }_{{\rm{\Gamma }}}{{\rm{\Gamma }}}_{\mathrm{dyn}}+{\sigma }_{\nu }\nu +{\sigma }_{{\rm{p}}}p.\end{eqnarray} \tag{ 6 }$

The new best-fit parameters are given in Table 4; some parameters are not necessary in all four fits as indicated by zeros. The new model is implemented in the publicly available Colossus code. The fit quality is very similar to that reported in Paper II, namely, better than 5% essentially everywhere in the Γ−ν− ${{\rm{\Omega }}}_{{\rm{m}}}$ parameter space where we have data. ${{\rm{\Delta }}}_{\mathrm{sp}}$ is derived from ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ , with a combined uncertainty of 15% or better. For a visual impression of these accuracies, we refer the reader to Figure 3 of Paper II. The model is valid for $0\lt {{\rm{\Gamma }}}_{\mathrm{dyn}}\lt 12$ and any percentile between 50 and 90 (higher percentiles become extremely noisy and were thus not computed by Sparta). We have checked the model against both the WMAP7 and Planck cosmologies and find similarly good agreement. The model does not describe self-similar simulations, although it comes close for slopes of n ≈ −2.5 that resemble ΛCDM (Paper II).

Table 4. Best-fit Parameters for the Splashback–SO Model

Parameter	${R}_{\mathrm{sp},\mathrm{mn}}$	${R}_{\mathrm{sp}, \% }$	${M}_{\mathrm{sp},\mathrm{mn}}$	${M}_{\mathrm{sp}, \% }$
Parameters for ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$

a₀	0.6597	0.3071	0.6962	0.2874
b₀	0.5562	0.2508	0.3736	0.6616
${b}_{{\rm{\Omega }}}$	0.1141	0.1527	0.3005	0.1321
${b}_{\nu }$	0.0698	0.1956	0	0
c₀	−0.8508	−1.2214	3.3445	4.5913
${c}_{{\rm{\Omega }}}$	18.4464	17.5374	1.3718	3.0928
${c}_{\nu }$	−0.3332	0	−0.0825	−0.1155
${c}_{{\rm{\Omega }}2}$	−10.0596	−10.3158	0	0
${c}_{\nu 2}$	0.0474	−0.0189	0	0

Meta-parameters for Dependence on Percentile

${a}_{{\rm{p}}}$	0	0.6428	0	0.8228
${b}_{{\rm{p}}}$	0	0.5074	0	−0.6567
${b}_{{\rm{\Omega }}{\rm{p}}}$	0	0	0	0.0032
${b}_{{\rm{\Omega }}{\rm{p}}2}$	0	0	0	4.9536
${b}_{\nu {\rm{p}}}$	0	−0.2128	0	0.2882
${c}_{{\rm{\Omega }}{\rm{p}}}$	0	0.0024	0	−0.6609
${c}_{{\rm{\Omega }}{\rm{p}}2}$	0	9.7115	0	−1.1050
${c}_{{\rm{\Omega }}2{\rm{p}}}$	0	−0.0005	0	0
${c}_{{\rm{\Omega }}2{\rm{p}}2}$	0	10.7626	0	0
${c}_{\nu {\rm{p}}}$	0	−0.4735	0	−0.3761
${c}_{\nu 2{\rm{p}}}$	0	0.0940	0	0.0784

Parameters for 68% Scatter (in dex)

${\sigma }_{0}$	0.0501	0.0419	0.0456	0.0224
${\sigma }_{{\rm{\Gamma }}}$	0.0035	0.0043	0.0017	0.0009
${\sigma }_{\nu }$	−0.0108	−0.0141	−0.0079	−0.0091
${\sigma }_{{\rm{p}}}$	0	0.0235	0	0.0454

Download table as: ASCII Typeset image

Figure 11 shows a visual comparison of the old and new fitting functions. The differences in ${R}_{\mathrm{sp}}/{R}_{200{\rm{m}}}$ and ${M}_{\mathrm{sp}}/{M}_{200{\rm{m}}}$ are below 5% for all redshifts, masses, and cosmologies that we tested and below 2% for most of the parameter space. The differences in ${{\rm{\Delta }}}_{\mathrm{sp}}$ (which is derived from ${R}_{\mathrm{sp}}$ and ${M}_{\mathrm{sp}}$ ) multiply to a maximum of 15% but are typically around 5%. There is no systematic upward or downward trend in the results. These small differences confirm that, despite numerous changes and bug fixes, Sparta's algorithm to determine ${R}_{\mathrm{sp}}$ is robust.

We have also updated our fit to the mass accretion rate as a function of peak height and redshift. This formula is convenient when computing splashback properties without any knowledge of the mass accretion rate,

$\begin{eqnarray}&&{{\rm{\Gamma }}}_{\mathrm{dyn}}=A\nu +B{\nu }^{3/2},\end{eqnarray} \tag{ 7 }$

where

$\begin{eqnarray}\begin{array}{rcl}A & = & 1.1721+0.3255z\\ B & = & -0.2565+0.0932z-0.0571{z}^{2}+0.0042{z}^{3}.\end{array}\end{eqnarray} \tag{ 8 }$

The differences to the fit from Paper II are minuscule, but the new fit is more consistent with the parameters in Table 4.

5.7. Splashback and the Radius of Steepest Slope

While we have extensively investigated the relationship between Sparta's dynamically determined ${R}_{\mathrm{sp}}$ and SO radii, we have not yet established a connection to the most commonly used definition of the splashback radius: the radius where the logarithmic slope of the spherically averaged density profile is steepest, R_steep (Figure 1). The splashback radius was first discovered as a result of the sharp drop in density (Diemer & Kravtsov 2014), first quantified according to this definition (More et al. 2015), and all observational detections have hitherto relied on the density profile (More et al. 2016, and following work). Moreover, in theoretical models, there is no ambiguity between R_steep and the radius where the most recently accreted particles reach their first apocenter (Adhikari et al. 2014; Shi 2016). In realistic halos, however, ${R}_{\mathrm{sp}}$ and ${R}_{\mathrm{steep}}$ are not equivalent because the apocenters are spread out over a significant radial range owing to nonsphericity, complex accretion histories, and interactions with other halos (Paper I). Similarly, the density drop represents a trade-off between the radially decreasing density of orbiting particles (or galaxies) and the stream of infalling material at large radii. As a result, it is not clear that there is a one-to-one correspondence between features measured in density profiles (or correlation functions) and the dynamically determined splashback radii from Sparta (Xhakaj et al. 2019a; Aung et al. 2020; Garcia et al. 2020).

We attempt to quantify the connection in Figure 12, where we show the percentiles that most closely match R_steep as approximated by the formula of More et al. (2015). This fitting function was derived from the averaged (median) density profiles of halos binned in accretion rate and peak height. To be comparable to observations, the profiles included subhalo particles. More et al. (2015) used the Diemer & Kravtsov (2014) definition of accretion rate, ${{\rm{\Gamma }}}_{\mathrm{DK}14}$ , rather than ${{\rm{\Gamma }}}_{\mathrm{dyn}}$ . There, the time interval was defined by fixed redshift intervals instead of the dynamical time. To evaluate the fitting function, we crudely convert ${{\rm{\Gamma }}}_{\mathrm{dyn}}$ to ${{\rm{\Gamma }}}_{\mathrm{DK}14}$ based on linear fits to the median relation in the WMAP7 sample, which is well approximated as

$\begin{eqnarray}&&{{\rm{\Gamma }}}_{\mathrm{DK}14}={{\rm{\Gamma }}}_{0}+\gamma {{\rm{\Gamma }}}_{\mathrm{dyn}}.\end{eqnarray} \tag{ 9 }$

At redshifts 0 and 0.5, the accretion rates are virtually identical because both methods choose the same redshift to compare the current masses to. For the higher redshifts defined in Diemer & Kravtsov (2014), z = [1, 2, 4], we find Γ₀ = [0.51, 1.43, 0.88] and γ = [0.74, 0.53, 0.78], respectively. We now obtain ${R}_{\mathrm{steep}}/{R}_{200{\rm{m}}}$ and ${M}_{\mathrm{steep}}/{M}_{200{\rm{m}}}$ from the More et al. (2015) fit at each z and ${{\rm{\Gamma }}}_{\mathrm{dyn}}$ and find the percentile where the fitting function of Equation (3) most closely matches these values (if such a value exists between the 50th and 90th percentiles). We also evaluate the scatter using Equation (6) and find the range of percentiles where R_steep and M_steep lie within the 1σ scatter (shaded areas in Figure 12).

**Figure 12.** Percentiles of particle splashback events that best match the radius of steepest slope according to the formula of More et al. (2015). Results are shown for z = 0 (left) and z = 1 (right), as well as for radius (top) and mass (bottom). The dependencies on mass accretion rate, halo mass, and redshift are complicated, highlighting that there is no one percentile that corresponds to the radius of steepest slope in general.
Download figure:
Standard image High-resolution image

Clearly, the relationship between dynamical ${R}_{\mathrm{sp}}$ and ${R}_{\mathrm{steep}}$ depends on accretion rate, mass, and redshift in a complex fashion. For example, Figure 12 explains the trend we noticed in the example of Figure 1, where higher ${{\rm{\Gamma }}}_{\mathrm{dyn}}$ drives ${R}_{\mathrm{steep}}$ toward a lower percentile. The mass dependence is driven solely by Sparta's measurements because the More et al. (2015) formula does not depend on mass, a conclusion that should be revisited with improved simulation data. The ranges where the scatter around the medians matches ${R}_{\mathrm{steep}}$ are large, which reflects the fact that the differences between the percentiles are comparable to the scatter in the relation. Nevertheless, the median relations are statistically well defined.

We conclude that there is no one percentile value that can be used to mimic ${R}_{\mathrm{steep}}$ and ${M}_{\mathrm{steep}}$ . The exact conversion will depend not only on accretion rate, mass, and redshift but also on the exact fitting procedure used to derive R_steep and on whether the profile of dark matter particles or subhalos is considered (Xhakaj et al. 2019a). We leave a more detailed calibration of the conversion for future work. For practical purposes, percentiles such as those in Figure 12 can be obtained by numerically matching the fitting functions of More et al. (2015) and Section 5.6 for a given accretion rate, mass, and redshift, for example, using the implementations in Colossus.

6. Discussion: Opportunities and Limitations

We have presented publicly available halo catalogs and merger trees for the Erebos simulations with the purpose of enabling the community to explore the impact of mass and radius definitions. In this section, we discuss the advantages and limitations of this data set.

Our catalogs present new opportunities because they are, to the best of our knowledge, the first to contain SO definitions with multiple thresholds, bound-only and all-particle definitions, splashback radii and masses, and the new tracer mass definition for subhalos. Previously, exploring the impact of radius definition often meant running a halo finder with multiple density thresholds and manually matching the resulting catalogs (e.g., Villarreal et al. 2017). Our catalogs obviate the need for such complex procedures, as they contain multiple host–subhalo relations side by side.

Similarly, our new tree format makes it easy to analyze the evolution of halos. For instance, a few simple lines of Python code can pick out all epochs where subhalos fall into their hosts and plot the evolution of their masses before or after such events. The uniform catalog format and nature of the Erebos simulations make them perfect for combining data from different box sizes, as demonstrated in Section 5 and in previous works. With box sizes spanning from 62.5 to $2000\,{h}^{-1}\,\mathrm{Mpc}$ at z = 0, even a strict limit of 1000 particles per halo results in a resolved mass range of more than five orders of magnitude (see also Mansfield & Avestruz 2020). The self-similar universes do not represent realistic cosmologies but allow the user to explore the behavior of structure in extreme, yet easy-to-understand, limits of the power spectrum shape (e.g., Diemer & Joyce 2019).

However, no set of simulations is appropriate for all types of investigation. The most obvious limitation of the Erebos simulations is their particle number. With 1024³ ≈ 1 billion particles per simulation, they yield more than sufficient number statistics at any given mass, but they are dwarfed by recent ultralarge simulations such as Millennium-XXL, DarkSky, Multidark-Planck, or Outer Rim (Angulo et al. 2012; Skillman et al. 2014; Klypin et al. 2016; Heitmann et al. 2019). Increasing the particle number does not necessarily add much to the already sufficient statistics on host halos, but larger simulations can resolve a wider range of subhalo-to-host mass ratios. For the particular use case of small subhalos in large hosts, combining simulations of different box sizes does not help. On the other hand, the more moderate particle numbers of Erebos mean that we can store 100 snapshots per simulation, enabling the kind of dynamical analysis performed by Sparta. Another limitation is that the Erebos suite contains only two ΛCDM cosmologies, meaning that it cannot be used for many cosmological analyses such as forecasts on parameter constraints. Large suites of simulations with different cosmologies exist for such purposes, but generally only a few snapshots are stored (e.g., Heitmann et al. 2010; DeRose et al. 2019; Nishimichi et al. 2019).

Finally, we caution the user regarding certain limitations of our catalogs. We have tried to exclude poorly resolved halos with our cut of ${N}_{200{\rm{m}},\mathrm{peak}}\geqslant 200$ , but different definitions converge at different resolutions. For example, splashback radii and masses are reliable for halos with at least 500, or ideally 1000, particles. The cutoff can also lead to selection effects at the low-mass end. We recommend to always compare simulations with different resolutions at fixed mass; any nonconvergence typically shows up as disagreements between the boxes (see, e.g., Diemer 2020b, 2020a; Mansfield & Avestruz 2020). Similarly, not all definitions make sense for all types of halos: all-particle SO and splashback masses and radii should be trusted for hosts only, while bound-only and tracer masses are applicable to subhalos. Due to a correction applied to splashback radii in the final snapshots of a simulation, the splashback data should ideally be used at z ≥ 0.13 in the ΛCDM simulations, or about 0.2 dynamical times before the end of the simulation in general. We also note that the snapshot spacing in the self-similar simulations corresponds to only about four snapshots per dynamical time. Such sparse coverage can lead to biases of a few percent in the splashback radii (Paper I). Finally, due to a bug, the positions of ghost halos are missing for the final snapshot of each simulation. This error will be fixed in future iterations of the catalogs.

7. Conclusions

We have introduced Moria, an extension to the Sparta code framework that creates catalogs and merger trees with an arbitrary number of mass and radius definitions, as well as the corresponding host–subhalo relations. We present catalogs and merger trees that are based on Rockstar calculations but also include SO definitions with all and bound particles, splashback definitions, and subhalo masses computed using a novel particle tracking scheme. The Sparta code and our catalogs are publicly available at benediktdiemer.com. Our main conclusions are as follows:

1.
We have introduced a merger tree format based on 2D arrays in compressed hdf5 files, which facilitates the extraction of information at fixed time and for a particular halo.
2.
Our catalogs are almost entirely complete for all sensible SO masses and mostly complete for splashback definitions, partly due to Moria's algorithms for interpolation and model estimation.
3.
The halo boundary definition profoundly affects all aspects of halo masses, radii, and subhalo assignments. This statement holds both for the difference between SO and splashback definitions and for different SO thresholds.
4.
We confirm that gravitational unbinding is not important for host halos but critical for subhalos. We compare different unbinding algorithms and conclude that the results depend strongly on a prior determination of halo membership.
5.
We update our previous model of the splashback–SO connection. Despite numerous improvements to the splashback calculations, the results change by only a few percent, highlighting the robustness of the Sparta algorithm.
6.
We show that there is a complex relation between the dynamically determined splashback radii from Sparta and the steepest slope in the density profile.

The main purpose of this paper is to introduce new algorithms and data products; we have deferred most scientific questions to future work. In two companion papers, we investigate the impact of the halo boundary definition on mass functions (Diemer 2020b) and subhalo abundances (Diemer 2020a). We will also present our algorithms for subhalo particle tracking and ghost halos (B. Diemer & P. S. Behroozi 2020, in preparation) and analyze the correlation between splashback and other halo properties (T.-H. Shin & B. Diemer 2020, in preparation). We anticipate further investigations of the galaxy–halo connection, semianalytic models of galaxy formation, and assembly bias. Most importantly, however, we encourage the community to use our catalogs to probe the impact of mass definition on any area of structure formation.

I am deeply indebted to Peter Behroozi for making Rockstar publicly available, for generously allowing me to adapt his kd-tree algorithm in Sparta, and for many enlightening discussions. I am grateful to Han Aung, Andrew Hearin, Philip Mansfield, Daisuke Nagai, and Enia Xhakaj for valuable feedback on a draft and to Alexie Leauthaud, Rüdiger Pakmor, and Tae-Hyeon Shin for helpful conversations. This work was partially completed during the coronavirus lockdown and would not have been possible without the essential workers who did not enjoy the privilege of working from the safety of their homes. All computations were run on the Midway computing cluster provided by the University of Chicago Research Computing Center. This research made extensive use of the Python packages NumPy (Oliphant 2006), SciPy (Virtanen et al. 2020), Matplotlib (Hunter 2007), and Colossus (Diemer 2018). This research was supported in part by the National Science Foundation under grant No. NSF PHY-1748958. Support for program No. HST-HF2-51406.001-A was provided by NASA through a grant from the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Incorporated, under NASA contract NAS5-26555.

The Splashback Radius of Halos from Particle Dynamics. III. Halo Catalogs, Merger Trees, and Host–Subhalo Relations

Article metrics

Permissions

Author e-mails

Author affiliations

Author notes

ORCID iDs

Dates

Abstract

1. Introduction

2. Simulation Data

3. New Algorithms in SPARTA

3.1. Computing the Splashback Radius

3.2. SO Masses (All Particles)

3.3. SO Masses (Bound Particles)

3.4. Ghosts and Tracer Masses

4. The MORIA Extension

4.1. General Code Design

4.2. Increasing the Completeness of Splashback Data

4.3. Mass Accretion Rates

4.4. Host–Subhalo Relations

4.5. Merger Tree Format

4.6. Limits on Mass and Halo Status

5. Results

5.1. Halo Catalogs and Merger Trees

5.2. Completeness

5.3. SO Definitions: All-particle versus Bound-only

5.4. SO Definitions: Unbinding Algorithm

5.5. Splashback Definitions

5.6. An Updated Model for the Splashback–SO Relation

5.7. Splashback and the Radius of Steepest Slope

6. Discussion: Opportunities and Limitations

7. Conclusions

Footnotes

The Splashback Radius of Halos from Particle Dynamics. III. Halo Catalogs, Merger Trees, and Host–Subhalo Relations

Article metrics

Permissions

Share this article

Author e-mails

Author affiliations

Author notes

ORCID iDs

Dates

Abstract

1. Introduction

2. Simulation Data

3. New Algorithms in SPARTA

3.1. Computing the Splashback Radius

3.2. SO Masses (All Particles)

3.3. SO Masses (Bound Particles)

3.4. Ghosts and Tracer Masses

4. The MORIA Extension

4.1. General Code Design

4.2. Increasing the Completeness of Splashback Data

4.3. Mass Accretion Rates

4.4. Host–Subhalo Relations

4.5. Merger Tree Format

4.6. Limits on Mass and Halo Status

5. Results

5.1. Halo Catalogs and Merger Trees

5.2. Completeness

5.3. SO Definitions: All-particle versus Bound-only

5.4. SO Definitions: Unbinding Algorithm

5.5. Splashback Definitions

5.6. An Updated Model for the Splashback–SO Relation

5.7. Splashback and the Radius of Steepest Slope

6. Discussion: Opportunities and Limitations

7. Conclusions

Footnotes