Probabilistic Association of Transients to their Hosts (PATH)

, , , , , , and

Published 2021 April 20 © 2021. The American Astronomical Society. All rights reserved.
, , Citation Kshitij Aggarwal et al 2021 ApJ 911 95 DOI 10.3847/1538-4357/abe8d2

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/911/2/95

Abstract

We introduce a new method to estimate the probability that an extragalactic transient source is associated with a candidate host galaxy. This approach relies solely on simple observables: sky coordinates and their uncertainties, galaxy fluxes, and angular sizes. The formalism invokes Bayes' rule to calculate the posterior probability $P\left({O}_{i}| x\right)$ from the galaxy prior P(O), observables x, and an assumed model for the true distribution of transients in/around their host galaxies. Using simulated transients placed in the well-studied Cosmic Evolution Survey field, we consider several agnostic and physically motivated priors and offset distributions to explore the method sensitivity. We then apply the methodology to the set of 13 fast radio bursts (FRBs) localized with an uncertainty of several arcseconds. Our methodology finds nine of these are securely associated to a single host galaxy, $P\left({O}_{i}| x\right)\gt 0.95$. We examine the observed and intrinsic properties of these secure FRB hosts, recovering distributions similar to those found in previous works. Furthermore, we find a strong correlation between the apparent magnitude of the securely identified host galaxies and the estimated cosmic dispersion measures of the corresponding FRBs, which results from the Macquart relation. Future work with FRBs will leverage this relation and other measures from the secure hosts as priors for future associations. The methodology is generic to transient type, localization error, and image quality. We encourage its application to other transients where host galaxy associations are critical to the science, e.g., gravitational wave events, gamma-ray bursts, and supernovae. We have encoded the technique in Python on GitHub: https://github.com/FRBs/astropath.

Export citation and abstract BibTeX RIS

1. Introduction

Transient phenomena offer terrific potential to explore astrophysical processes on the smallest scales and in the most extreme conditions. This includes spectacular explosions, intense magnetic fields, gravity in the strong limit, and the structure of dark matter. Given the very short timescales, the majority of these events are linked to compact objects, e.g., neutron stars and black holes (e.g., Fishman & Meegan 1995; Cordes & Chatterjee 2019; Gal-Yam 2019), and therefore they provide unique insight to the processes that generate and destroy these exotic bodies.

The three-dimensional location of transient sources is a critical aspect influencing the interpretation of their nature, allowing measured properties to be translated into absolute energetics and determining the nature of their environment. While many transient sources can be reasonably well localized on the sky, depending on the nature of the discovery instrument (and any sufficiently prompt follow-up), the third dimension of distance is often challenging to obtain. For some transient phenomena—supernovae (SNe), the afterglows of gamma-ray bursts (GRBs)—spectra of their electromagnetic emission can be used to identify their redshift (e.g., Blondin & Tonry 2007; Fynbo et al. 2009). Many other transients, however, encode no direct measure of the source redshift. This includes the enigmatic fast radio bursts (FRBs) whose dispersion measures (DMs) imply a cosmological origin (Lorimer et al. 2007), yet do not provide a precise redshift estimate (e.g., McQuinn 2014; Prochaska & Zheng 2019). Another example includes short-duration GRBs whose afterglows are too faint to record a high signal-to-noise ratio (S/N) spectrum for precise redshift estimation (e.g., Fong et al. 2010).

When a redshift cannot be measured based on the transient source itself, a suitable alternative is to associate the transient event with a galaxy and then measure the galaxy redshift (e.g., Tendulkar et al. 2017). This presumes, of course, that the transient is generated in (or at least near) a galaxy—a reasonable assumption for compact objects which, aside from exotic and unproven phenomena such as primordial black holes, are born in the dense regions of galaxies. Some progenitors of transient sources may travel considerable distances from their birth sites, for instance, via "kicks" during formation events or disruptions of stellar multiples. For most such cases, however, offsets of up to tens of kiloparsecs or several arcseconds on the sky can be expected for distant events (e.g., Fong & Berger 2013).

The process of associating a transient to its host galaxy is a nontrivial exercise. Primarily, this is influenced by the uncertainty in the transient localization combined with the relatively high surface density of galaxies on the sky, meaning that the allowed region for the transient site can potentially overlap with multiple galaxies. The initially unknown origins of many such phenomena—including the transients of interest to us, FRBs—further compounds the problem by introducing an uncertainty in the characteristic offset of a source from the center of a galaxy.

To date, host associations for transient sources have focused on the probability of chance association. Bloom et al. (2002) introduced the concept of a chance probability Pc to ascertain the likelihood that a given galaxy was a coincident association to a transient event. By inference, a galaxy with a very low Pc value might be considered the host, while galaxies with Pc ∼ 1 may be disfavored as unrelated sources. Tunnicliffe et al. (2014) advanced this approach by allowing for galaxy–galaxy clustering, which modifies the random incidence from a strictly Poisson process. Most recently, Eftekhari & Berger (2017) discussed this approach in the context of FRBs, emphasizing the need for subarcsecond localizations for secure host galaxy associations.

While we are primarily motivated by FRB science, the formalism introduced here is general, and we identify obvious applications to other transients, e.g., GRBs and gravitational wave (GW) events. Our guiding principles for the development of a new methodology to assess host associations are to:

  • 1.  
    Be driven by simple observables (which are defined below).
  • 2.  
    Assign a posterior probability to every candidate galaxy in consideration.
  • 3.  
    Develop an extendable framework that can evolve as the field matures. This includes incorporation of additional observational constraints and priors.
  • 4.  
    Accommodate transients both in the local (hundred megaparsecs) and very distant universe.
  • 5.  
    Allow for insufficient data, e.g., the non-detection of the host galaxy due to imaging depth.

In the following, we strive to limit the analysis to these easily attainable, direct observables:

  • 1.  
    The transient localization (R.A., decl., uncertainty ellipse): αFRB, δFRB, epsilonFRB.
  • 2.  
    The apparent magnitudes of the galaxy candidates: mi .
  • 3.  
    The candidate galaxy coordinates: αi , δi .
  • 4.  
    The angular size of the galaxy candidates: ϕi .

Future work will consider additional observables (e.g., the FRB DM) and also priors based on "secure" host associations.

This paper introducing the probabilistic association of transients to hosts (PATH) is organized as follows: Section 2 briefly reviews the historical approaches to associations, Section 3 introduces our new method, Section 4 defines the priors adopted for our FRB analysis, Section 5 presents analysis of simulated transients, and Section 6 applies the formalism to real FRBs. Throughout we adopt the Planck Collaboration et al. (2016) cosmology as encoded in astropy.

2. Historical: Chance Probability

A standard approach to associating transients to their host galaxies is through assessing the chance probabilities Pc of galaxies being located close to the transient position. We reintroduce the formalism for evaluating Pc here, propose a new variant, and comment further on its application and limitations.

2.1. Formalism

Figure 1 shows a Very Large Telescope/FOcal Reducer and low dispersion Spectrograph 2 (VLT/FORS2) g-band cutout image (30'' × 30'') of the field surrounding FRB 180924 with its localization marked by a red circle. As described by Bannister et al. (2019), this localization lies ≈1'' from the centroid of galaxy DES J214425.25−405400.81 and within 5'' of two additional galaxies. At their redshifts (Bannister et al. 2019), the galaxies all have projected separations of less than 40 kpc, i.e., separations less than the estimated radii of their halos. Therefore, while one may be predisposed to assign FRB 180924 to the brighter and closer galaxy, one should also consider the possibility that the FRB occurred in the stellar halo of one of the others.

Figure 1.

Figure 1. Cutout g-band image (VLT/FORS2) centered on FRB 180924 (small red circle indicates the 5σ localization). There are four extended sources with a separation of θ < 10'' marked as candidates for the host galaxy of the FRB. These are labeled (just above them, except for the brightest) by their θ, angular size ϕ, apparent magnitude m, and the chance probability Pc of an association.

Standard image High-resolution image

The chance probability approach is powerfully simple: estimate the Poisson probability of finding one or more galaxies as bright or brighter within an effective search area A around the FRB, with A determined from the angular size ϕ, the separation θ, and the FRB localization uncertainty σFRB. Namely, one defines the probability of a chance coincidence

Equation (1)

where $\bar{N}$ is the average number of sources in A. It is given by

Equation (2)

where Σ(m) is the angular surface density of galaxies on the sky with magnitude mmi , and θeff is the effective search radius ($A=\pi {\theta }_{\mathrm{eff}}^{2}$). For the former, we adopt the galaxy number count distribution of Driver et al. (2016), while the latter quantity bears some arbitrariness.

Previous works have considered several definitions for θeff. We introduce yet another more conservative one here: the quadrature sum of all three angular quantities with semi-arbitrary weightings,

Equation (3)

Adopting these, we estimate Pc = 0.01 for DES J214425.25−405400.81 and Pc > 0.1 for the other sources owing to their fainter magnitudes and larger angular separations. From this perspective, the probability of a chance association is nearly negligible for DES J214425.25−405400.81 and sufficiently large for the other sources that one is inclined to favor the former. However, this technique cannot assign a likelihood to this association, nor even a relative assessment of one source over another. Yet worse, if two or more candidates have low Pc (e.g., FRB 181112; Prochaska et al. 2019), one has no means to favor one over the other. Last, the Pc formalism does not naturally allow one to introduce additional observational measures as evidence (e.g., DM). Together, these considerations motivate our development of a full probabilistic treatment.

2.2. Nuisances and Nuances

There are several aspects of the Pc analysis that require further definition. For completeness, we describe these here, although Pc does not formally enter into the new formalism. First, one requires measurements of the galaxy centroids and angular size. We advocate a nonparametric approach owing to the complexity of galaxy morphology. In the following, we adopt the centroiding algorithm encoded in the photutils software package, and use the semimajor_axis_sigma parameter to estimate the angular size. In the following, we will refer it as a_image as it matches the definition of that parameter in the more widely used SExtractor package.

Another issue is Galactic extinction. FRBs are detected across the sky including on sightlines that show large Galactic extinction (E(BV) > 0.1 mag). In contrast, the number count analyses have intentionally been derived from high-latitude fields with low Galactic extinction. Therefore, the apparent magnitudes of the galaxy candidates should be corrected for Galactic extinction prior to the probability estimation.

We provide the following set of recommendations to optimize the detection and characterization of both faint and bright sources. Namely, we recommend observations in the r band, which provide a trade-off between the mapping of stellar content and extinction. We further recommend an image depth of mr = 25.5 (5σ), corresponding to the limiting magnitude for spectroscopy, and sufficient for probing 0.01 L* galaxies at z ∼ 0.5. Finally, given the subarcsecond accuracy of many FRB localizations, we recommend better than 1'' seeing, and a >1' field of view for background estimation.

Similarly, source detection should employ a nonparametric approach to accommodate galaxies of varying morphologies. For consistency in this work, we recommend the use of the a_image parameter to estimate the angular size of sources, and note that all galaxies should be corrected for Galactic extinction prior to applying the Bayesian formalism.

The formalism of Equations (1) and (2) ignores galaxy clustering. Since matter in the universe is not uniformly distributed, the probability of observing either no galaxies, or a large number of them, in proximity to a random direction is enhanced compared to the probability of observing one or a few. Tunnicliffe et al. (2014) showed that including clustering decreases the probability of a nearby random galaxy by 25%–50% in the case of a random direction. Our primary concern however is not whether or not all the observed galaxies are merely chance coincidences, but rather which of the observed galaxies is the true host. Clustering is discussed further in this context in Section 4.2. For now, we remark that given that FRBs truly are associated with galaxies, clustering acts to increase, not decrease, the probability of a chance association, since the FRB observation has preferentially selected a direction of the universe in which there is a cluster of matter.

Furthermore, in this work we ignore for the sake of simplicity the ellipticity of the prospective host galaxies. Our method will be readily adaptable however to such ellipticity, or indeed arbitrarily complex functions, since the approach described below does not rely on any particular functional forms.

3. Probabilistic Approach

Association of transients to galaxies is not like the usual cross identification for which probabilistic methods have been in place for over a decade (Budavári & Szalay 2008). Matching stars and galaxies typically involves asking whether a set of detections (across separate exposures, instruments, telescopes) are of the same celestial object. If they are, their true (latent) directions would have to be the same; see more in the review by Budavári & Loredo (2015).

In strong contrast to that, FRBs are presumed to simply originate from within (or at least near to) galaxies, hence, their true direction should not be required to coincide with the center of a galaxy. While this is admittedly a small difference for the faintest galaxies, resolved extragalactic source are expected to yield different results. Here we consider a general scenario where the shape of galaxies can be incorporated along with a geometric model about from where FRBs would originate within or around galaxies.

3.1. General Formalism

Since FRBs are sparse on the sky, we can study them separately, which also simplifies the following description of our approach. Let us consider a catalog of galaxies across the entire sky and a single FRB that either belongs to one of the many catalog objects (its host galaxy) or it does not, i.e., its host is not detected or not included in the catalog. If U is the event that the FRB's host is unseen, and Oi is the event that the FRB is from galaxy i, their prior probabilities must add up to 1,

Equation (4)

as there are no other possibilities. The single scalar quantity P(U) encodes all the complications that arise from the difference in the radial selection functions of the catalog and the FRB instruments. For now, we assume its value to be known, but note that it could be inferred in a hierarchical fashion when considering multiple FRBs. Also, one could assume a uniform prior for all observable Oi as the simplest possible scenario that essentially ignores any additional information about the galaxies, e.g., magnitude, color, and redshift. We leave such considerations to a future work.

Given a vector x representing all measured properties of the detected FRB, we ask what the posterior probabilities P(Ux) and $P\left({O}_{i}| x\right)$ are for all i. Using Bayes' rule, the unseen posterior is

Equation (5)

where p(xU) is the probability density of the FRB properties given the host is unseen. From hereon, we consider x to represent only the measured FRB direction. Without constraints, x could be anywhere on the sky, hence, it is natural to assume a uniform (isotropic) distribution with a value of 1/4π. Similarly, the posterior probability for object i is

Equation (6)

where p(xOi ) is the probability density function (PDF) of x given that the FRB comes from galaxy i. With data x, this is the marginal likelihood of Oi , which includes the galaxy geometry and the uncertainty of the FRB direction. This key component of the approach is discussed in-depth in the next paragraph. The normalizing constant must be

Equation (7)

to guarantee that these posteriors also add up to 1,

Equation (8)

3.2. Marginal Likelihoods

Let the 3D unit vector ω represent the true and unknown direction of the FRB on the sky. Given that it comes from a particular galaxy, the direction ω has to point somewhere near the host. The function p(ωOi ) captures the physical and geometric model for the FRBs specific to galaxy i, e.g., taking into account its type, distance, orientation, etc.

The observed FRB direction x is a measurement of ω with known uncertainty, represented by the localization error function, L(xω). Given p(ωOi ), p(xOi ) is calculated by integrating over the ω model directions to obtain the marginalized likelihood of the association hypothesis Oi ,

Equation (9)

which now accounts for all possibilities in various FRB origins as well as the astrometric uncertainty in the measurement. If a galaxy is unresolved, p(ωOi ) may become the Dirac-δ, and p(xOi ) is just the astrometric uncertainty, as it would be the case for matching point sources. Calculating the above quantity for all i completes the framework, which now provides posterior probabilities via Equation (6).

3.3. Limited Field of View

Previously we assumed a galaxy catalog over the entire sky, but catalogs typically have more limited footprints. It is interesting to think about a scenario when the field of view Ω is smaller but still large enough not to miss any possible counterparts to the FRB in question. Intuitively, galaxies very far away from the FRB should not have any effect on the association analysis.

Smaller sky coverage would mean fewer observed galaxies Nc , which in turn affect the P(Oi ) priors as there are fewer galaxies to choose from. Going with the previous uniform assumption, Equation (4) implies

Equation (10)

which captures the dependence.

Looking now back at Bayes' rule in Equation (5), the sky coverage seems to affect the $P\left({O}_{i}| x\right)$ posteriors, too. Fortunately, this is not the case. The denominator p(x) changes in accord due to the scaling in p(xU), which is uniform over the field of view,

Equation (11)

where the 1Ω(x) is the indicator function that takes the value 1 if x is within the field of view and 0 otherwise, and the denominator Ω normalizes the PDF. The framework not only matches common-sense expectations, but provides for more efficient computation where only candidates within close proximity of FRBs are considered.

Given the plethora of deep, wide-field imaging surveys, it is possible to consider very large Ω for each FRB in the analysis. In practice, however, sensible and conservative assumptions for p(ωOi ) will greatly limit the list of viable candidates Oi . To ease the analysis, we adopt for Ω the union of the area encompassing all galaxies within 10 half-light radii and the 99.9% FRB localization. As emphasized in the previous section, it is only necessary to consider a large enough area to be certain to include all possible hosts.

4. Priors and Assumptions

4.1. Undetected Prior P(U)

It is difficult to a priori assign a prior P(U) to the probability that the FRB host is undetected. Undoubtedly, P(U) is related to the depth of the imaging and the (unknown) source distance, i.e., redshift. 12 In the following, we consider an arbitrarily assigned value, and we advocate a low value, based on the paucity of our data and the set of confidently assigned associations reported to date (e.g., Heintz et al. 2020). In the analyses that follow, we typically assume P(U) = 0 (Occam's razor!) and discuss the impacts of increasing it.

4.2. Candidate Priors P(O)

For the set of host candidates Oi , absent any assumptions on the distance or the typical separations of FRBs from their host galaxies, any galaxy on the sky could be a viable candidate. Given the plethora of potential models for FRB progenitors and the limited existing constraints, we are motivated to consider (for now) simple approaches to the prior for the galaxy candidates. The most agnostic approach is to assign an "identical" prior to every galaxy in consideration, i.e., Equation (10).

Inspired by the chance probability calculations approach described in Section 2, we introduce an additional prior based on Pc . Specifically, we consider a prior that inversely weights by Σ(m).

Equation (12)

For this "inverse" prior, brighter candidates have higher prior probability according to their number density on the sky. The normalization of these priors is set by Equation (4). In Section 5, we also briefly consider two other priors with P(Oi ) ∼ 1/Σ(mi )ϕ (inverse1) and P(Oi ) ∼ 1/Σ(mi )ϕ2 (inverse2). We adopt the prior in Equation (12) in part because of its simpler form and also because of the results of simulated experiments (Section 5).

Figure 2 illustrates P(Oi ) for the two models for two example cases—(a) FRB 180924 and (b) FRB 190523—where we assume P(U) = 0. For this illustration, we have restricted the analysis to galaxies within 10'' of the FRB, which captures all of the viable candidates. As expected, the inverse priors for FRB 180924 significantly favor the brighter galaxy closest to the FRB. Perhaps less intuitively, the inverse priors favor the most distant (yet brightest) source near FRB 190523. This motivates the inclusion of the next ingredient—the offset function p(ωOi ).

Figure 2.

Figure 2. Illustration of the two approaches to candidate priors P(O) assumed in this manuscript: identical and inverse. The former assumes an identical prior for every galaxy with θi < 10ϕ or within the 99.9% localization error of the FRB. The latter adopts the inverse of the estimated chance probability Pc of these galaxies, which is equivalent to asserting the prior is proportional to the integrated probability that the other sources are all chance coincidences.

Standard image High-resolution image

4.3. Offset Function p(ω∣Oi)

The p(ωOi ) function for the probability of the true angular offset from the galaxy is unknown yet required for the analysis. In our formalism, we develop priors for p(ωOi ) based solely on the angular offset θ between the galaxy centroid and ω, and also normalized by the observed galaxy's angular size ϕ. This simultaneously accounts for galaxies of different intrinsic size and differing observed size owing to their distance.

As the predominance of models associate FRBs to stellar sources or compact objects (active galactic nucleus at the very centers of galaxies are currently disfavored; Bhandari et al. 2020b), one might expect the FRB events to track the stellar light. While we wish to remain largely agnostic to the underlying distribution of offsets, we are physically motivated to presume p(ωOi ) decreases with increasing θ. We assert this despite the fact that geometrical considerations do favor large ω, e.g., a model where FRBs occur with identical probability anywhere in a circular galaxy will have p(ωOi ) ∝ ω until one reaches the "edge" of the galaxy. Therefore, a uniform prior $p(\omega | {O}_{i})=1/\pi {\theta }_{\max }^{2}$ is formally one that assumes FRBs occur proportional to the galaxy radius. The other two models considered are a core model:

Equation (13)

which implies an approximately 1/r2 weighting, and an exponential model

Equation (14)

which assumes an underlying exponential distribution. All of these functions are normalized to unity when integrating to ${\theta }_{\max }$, ignoring the curvature of the sky.

For all of the p(ωOi ) priors, we assert a maximum offset ${\theta }_{\max }=6\phi ;$ this is especially important for the uniform prior. This value is arbitrary and was chosen to be large enough to accommodate prevailing models of FRBs without being too conservative. Applying an arbitrary cutoff to the exponential distribution is not strictly necessary, but we keep it for simplicity and consistency, and demonstrate in Section 6.3 that the results for an exponential distribution are insensitive to this choice.

Figure 3 shows the offset functions, normalized to have the same total probability. Clearly the exponential model favors FRBs located in the inner regions of galaxies. For comparison to the offset distributions of known transients, we note that long GRBs appear highly concentrated in the inner regions of their hosts relative to Type Ib/c and IIn SNe, which occur preferentially near their host half-light radii (Lunnan et al. 2015; Blanchard et al. 2016). Conversely, short GRBs exhibit significant offsets from their host centers, indicative of progenitors born in compact object mergers (Fong & Berger 2013).

Figure 3.

Figure 3. Three offset functions p(ωOi ) considered for the underlying angular distribution of FRBs relative to their host galaxies, normalized by the galaxy's half-light radius ϕ. Each is normalized to have identical integrated area to a maximum assumed offset of ${\theta }_{\max }=6\,\phi $.

Standard image High-resolution image

In practice, we treat the galaxies as "round," i.e., ignoring for now any ellipticity. Future works will advance this aspect.

5. Simulations

To explore the formalism introduced here, we have generated Monte Carlo simulations designed to faithfully reproduce the FRB experiment. We describe this first and then detail the results.

5.1. Sandboxes

Our Monte Carlo approach leverages the public catalog of the Hubble Space Telescope/Cosmic Evolution Survey (HST/COSMOS) field (Scoville et al. 2007b, 2007a), which provides over 1 million sources at high spatial resolution and faint fluxes (median AB magnitude mr ≈ 25.9). These galaxies are distributed over an ≈1 deg2 of sky, which captures a great variety of distributions but not the most extreme events (e.g., z < 0.5 galaxy clusters, nearby galaxies). Figure 4 shows the angular size (a_image) and apparent magnitudes for the catalog, restricted to the sources labeled as galaxies.

Figure 4.

Figure 4. Distribution of angular sizes (a_image parameter) vs. apparent magnitude mr for all of the galaxies in the COSMOS catalog (Scoville et al. 2007a).

Standard image High-resolution image

We generate a series Monte Carlo realizations of FRBs (referred to as a sandboxes, or SBs), using the following recipe:

  • 1.  
    Define the true distribution of FRBs from their host galaxies p(ωOi ) and magnitude mr .
  • 2.  
    Define a sample of potential host galaxies based on mr .
  • 3.  
    Define the distribution of localization errors for FRBs (σFRB).
  • 4.  
    Draw NFRB galaxies from the parent sample of potential host galaxies without duplicates.
  • 5.  
    Set the true FRB positions according to p(ωOi ).
  • 6.  
    Offset the FRBs to an observed coordinate according to σFRB.
  • 7.  
    Consider catalog galaxies within 30'' to represent an image.

For sandbox 5 (SB-5), 10% of the FRBs were randomly placed in the COSMOS field, i.e., without a host galaxy. We plan to use this sandbox to evaluate the performance of the framework when the host galaxies are unseen. Also, as COSMOS is a deep survey, we generate a magnitude-limited catalog of galaxies for each sandbox (last column of Table 1) on which we run the Bayesian framework. In the following subsection, we discuss results for five sandboxes (focusing primarily on SB-1) with the parameters described in Table 1.

Table 1. Sandboxes

Label p(ωOi ) NFRB Sample σFRB Catalog
    ('')Filter
SB-1 ${ \mathcal U }(0,2\phi )$ 100,0001
SB-2 ${ \mathcal U }(0,2\phi )$ 46,699 mr = [20, 23] ${ \mathcal U }(0.1,1)$ mr ≤ 23
SB-3core46,699 mr = [20, 23] ${ \mathcal U }(0.1,1)$ mr ≤ 23
SB-4exponential46,699 mr = [20, 23] ${ \mathcal U }(0.1,1)$ mr ≤ 23
SB-5 a ${ \mathcal U }(0,2\phi )$ 50,000 mr = [20, 25] ${ \mathcal U }(0.1,1)$ mr ≤ 25

Note.

a See text for details regarding the FRB selection for this sandbox.

Download table as:  ASCIITypeset image

5.2. Analysis and Results

We now analyze the sandboxes listed in Table 1 with a variety of priors and assumed p(ωOi ) functions (that generally do not match the true p(ωOi )). Table 2 lists the various priors assumed for each analysis performed.

Table 2. Sandbox Analysis

Sandbox P(O) P(U) p(ωOi ) ${\theta }_{\max }$/ϕ f(T+secure)TP
SB-1Inverse0Exp60.330.96
SB-1Inverse10Exp60.300.99
SB-1Inverse20Exp60.321.00
SB-1Identical0Uniform60.221.00
SB-1Inverse0.05Exp60.240.96
SB-2Inverse0Uniform20.861.00
SB-3Inverse0Core60.581.00
SB-4Inverse0Exp60.680.99
SB-5Inverse0.10Exp60.580.98

Download table as:  ASCIITypeset image

Figure 5(a) shows the posteriors for the candidates using the fiducial sandbox (SB-1) and listed in row 1 of Table 2 (also referred to as the adopted prior set; Table 3). Since most of the candidates defined in step 7 are very far from the offset FRB position, we restrict results to the ≈8% of candidates with $P\left({O}_{i}| x\right)\gt 0.01$. This distribution is multimodal, with the overwhelming majority of recovered $P\left({O}_{i}| x\right)\approx 0$ corresponding to unassociated galaxies and another peak at $P\left({O}_{i}| x\right)\approx 1$ corresponding to secure associations. Figure 5(b) shows the posterior value for the most probable candidate for each of the 100,000 FRBs. For this model and analysis, ≈35% of the FRBs have a high probability, $P\left({O}_{i}| x\right)\gt 0.95$ which we adopt as a "secure" association. Adopting such an arbitrary value to define secure is useful for including/excluding candidates for subsequent analyses that rely on knowing the correct FRB host, e.g., that by Macquart et al. (2020). However, we emphasize that it is in general better to consider all host associations as uncertain, with different levels of certainty according to the obtained posteriors.

Figure 5.

Figure 5. Analysis of SB-1: (a) PDF for the posterior probabilities for all of the candidates with $P\left({O}_{i}| x\right)\gt 0.01$. The PDF is nicely bimodal, with 96% having $P\left({O}_{i}| x\right)\lt 10 \% $ or $P\left({O}_{i}| x\right)\gt 90 \% $. (b) PDF of the maximum posterior for the 100,000 simulated FRBs. We find ≈35% of the sources has $P\left({O}_{i}| x\right)\gt 0.95$, which we define as secure.

Standard image High-resolution image

Table 3. Prior Sets

Set P(O) P(U) p(ωOi ) ${\theta }_{\max }$/ϕ
ConservativeIdentical0Uniform6
AdoptedInverse0Exp6

Download table as:  ASCIITypeset image

Table 4. FRBs Analyzed

FRB αFRB δFRB epsilona epsilonb epsilonPA Filter
 (deg)(deg)('')('')(deg)
FRB 12110282.9945833.147920.100.100.0GMOS_N_i
FRB 18091629.5031365.716750.000.000.0GMOS_N_r
FRB 180924326.10523−40.900030.110.090.0VLT_FORS2_g
FRB 181112327.34846−52.970933.250.81120.2VLT_FORS2_I
FRB 190102322.41567−79.475690.540.470.0VLT_FORS2_I
FRB 190523207.0650072.469724.001.50340.0LRIS_R
FRB 190608334.01987−7.898250.260.2590.0VLT_FORS2_I
FRB 190611320.74546−79.397580.670.670.0GMOS_S_i
FRB 19061465.0755273.706740.800.4067.0LRIS_I
FRB 190711329.41950−80.358000.400.3190.0GMOS_S_i
FRB 190714183.97967−13.021030.360.2290.0VLT_FORS2_I
FRB 191001323.35155−54.747740.170.1390.0VLT_FORS2_I
FRB 200430229.7064212.376891.070.300.0LRIS_I

Note. epsilona , epsilonb , epsilonPA define the total 1σ error ellipse for the FRB localization data are taken from Ravi et al. (2019), Day et al. (2020), Law et al. (2020), Tendulkar et al. (2017), Marcote et al. (2020), and Heintz et al. (2020).

Download table as:  ASCIITypeset image

Figure 6 evaluates, in 10 bins of equal number of FRBs, the maximum $P\left({O}_{i}| x\right)$ assigned to a candidate for each FRB and the percentage of correct associations assuming this is the host. The different colors indicate different choices for P(O). Our adopted inverse prior appears well calibrated in that $P\left({O}_{i}| x\right)$ yields an accurate estimate of the fraction of FRBs correctly assigned to their host galaxy.

Figure 6.

Figure 6. The points are equal number (10,000) bins of FRBs according to the maximum posterior probability of their host candidates for SB-1. For each set of 10,000, we determined the fraction of correct assignments if one adopts the candidate with max $P\left({O}_{i}| x\right)$ as the host. The one-to-one line indicates a perfectly calibrated algorithm. The colors indicate different choices for the prior P(O): (blue) P(O) ∝ 1/Σ(m), inverse; (orange) P(O) ∝ 1/Σ(m)ϕ, inverse1; and (green) P(O) ∝ 1/Σ(m)ϕ2, inverse2. Our adopted prior (inverse) is relatively well normalized in that $P\left({O}_{i}| x\right)$ yields an accurate estimate of the fraction of FRBs correctly assigned to their host galaxy.

Standard image High-resolution image

Figure 7 shows another set of results but for more different choices of priors (Table 2). The remarkably close correspondence between the two quantities indicates the posterior is well calibrated, at least for this pairing of sandbox and prior set. We also show results for prior sets where we assume P(U) = 0.05 and for the conservative prior set (Table 3). Each of these assigns systematically lower values to the true host galaxy yielding a higher percentage of correct cases at lower maximum $P\left({O}_{i}| x\right)$.

To characterize the behavior of our method under different simulated truths (i.e., sandboxes) and different priors, Table 2 lists the fraction f of all FRBs that are correctly identified, i.e., the true (T) host is securely identified ($P\left({O}_{i}| x\right)\gt 0.95$) and the fraction of secure identifications that are correct. In all cases, at least 96% of all secure associations find the true host, indicating that our method is trustworthy. The fraction of FRBs expected to have such a secure association however varies significantly, primarily as a function of the sandbox (variation of ±0.3 in $f(T+\mathrm{secure})$), with the analysis method on a given sandbox having a secondary effect (variation of ±0.05). An analysis of SB-1 demonstrates that the choice between different inverse priors has little effect, but produces a higher fraction of secure associations than a uniform prior. Interestingly, $f(T+\mathrm{secure})$ is much higher for the SB-4 analysis than for SB-2 and SB-3, despite all three using an assumed p(ωOi ) of equal shape to the true p(ωOi ), and being otherwise identical. However, while the SB-4 analysis assumes p(ωOi ) to be uniform out to θ = 6ϕ, the true distribution is fully contained within 2ϕ, unlike SB-2 and SB-3. We thus conclude that the dominant determinant of the fraction of securely (and hence, correctly) identified FRB hosts, in the case that the true host is observed, is the fraction of FRBs lying in close proximity to their hosts, irrespective of other considerations. We find it especially reassuring that the results are not highly sensitive to the analysis method, i.e., that our formalism yields greater sensitivity to the physical truth than to our choice of reasonable priors.

What about unseen hosts? If we use P(U) = 0, as typically assumed in this work, then P(Ux) = 0 always, and the method will tend to assign the highest posterior $P\left({O}_{i}| x\right)$ to the closest galaxy regardless of distance. Using the 10% of hostless FRBs from SB-5, the conservative and adopted prior sets from Table 3 find secure associations for the majority of FRBs (55% and 62% respectively). However, the typical radial offset for these secure associations is very large. In Figure 8, we show the cumulative distribution of θ/ϕ for such candidates. The probability of the most likely candidate being close to the FRB is small, with 10% or less of such falsely identified hosts having θ/ϕ < 6. The distributions for secure associations is almost identical to that from nonsecure hosts. We conclude that measuring a small θ/ϕ is a strong discriminant against unseen hosts irrespective of p(U).

Figure 8.

Figure 8. Cumulative distribution of θ/ϕ for candidate galaxies in SB-5 where the true host is unseen, using conservative (green) and adopted (blue) priors. Lower panel is a zoom-in on the region related to our adopted maximum offset.

Standard image High-resolution image
Figure 7.

Figure 7. Same as Figure 6, but for different prior assumptions as described in the legend.

Standard image High-resolution image

Buoyed by these results, we now proceed to apply PATH to real FRB observations.

6. Real FRB Analysis and Results

Informed by the results in the previous section, we proceed to apply the formalism to all of the published, well-localized FRBs (Table 4). We discuss the results for two sets of priors—conservative and adopted—as summarized in Table 3. We refer to the first as conservative because all galaxies within ${\theta }_{\max }$ are given an equal prior.

6.1. FRB Host Candidates

Central to the analysis is the identification and analysis of galaxy candidates in imaging data. The first step—source identification—is the most challenging and the most subjective. For every image, sources near the detection limit are subject to the precise methodology: background subtraction, thresholding, pixel grouping, and deblending. After experimenting with the routines encoded in the photutils package, we settled on the following key parameters: npixels = 9, deblend=True, xy_kernel = (3, 3), Gaussian2Dkernel, nsig = 3. (kernel), nsig = 1.5 (threshold), background = (50, 50), and filter_size = (3, 3), median background.

To test these choices, we independently analyzed the data with the SExtractor package using a standard set of input parameters. Namely, we set DETECT_MINAREA = 9 and DETECT_THRES = 1.5 for consistency with the photutils parameters. Images are filtered with the default convolution kernel (default.conv). To recover blended sources (see, e.g., FRB 180924 below), we set DEBLEND_MINCONT = 0.0001.

With this set of parameters, we recover the centroid positions within ≈2%, while the aperture sizes show scatter up to ≈15%. We note that slight differences between the photutils and SExtractor methodologies may be driving these discrepancies; namely, while SExtractor uses a multi-thresholding deblending technique, photutils utilizes a combination of multi-thresholding and watershed segmentation. Furthermore, the default.conv convolution kernel is equivalent to a 3 × 3 Gaussian kernel with an FWHM = 2, in slight contrast to the Gaussian kernel used above. Nevertheless, we find comparable results using the two methods.

Figure 9 shows the segmentation maps of FRB 180924 and FRB 190523. Note the three, blended sources near the localization of FRB 180924 that are known to be unique galaxies at distinct redshifts (Bannister et al. 2019). An image with shallower depth (e.g., DES-DR1) or a different choice of photutils parameters would lead to the non-detection of the fainter sources. This highlights the subjectivity of source identification that can affect the final results.

Figure 9.

Figure 9. Segmentation images for the sources in 30'' cutouts around FRB 180924 (left) and FRB 190523 (right). The multitude of sources (several erroneous) at the top of the FRB 180924 image is due to artifacts from a very bright star.

Standard image High-resolution image

The source identification packages offer an assessment of the source shape (e.g., ellipticity and size), which can be used to used to select and then ignore Galactic stars. For the analysis that follows, we have simply clipped bright stars according to their apparent magnitudes when necessary.

Provided with the segmentation map, one may perform aperture photometry and estimate ϕ from the derived elliptical apertures. All of the measurements for the galaxy candidates are provided in Table 5.

Table 5. Results for FRB Associations

FRBR.A.cand Decl.cand θ ϕ m Filter Pc P(O) P(Ox) P(U) P(Ux)
Conservative
FRB 12110282.994533.14790.20.2823.52GMOS_N_i0.00390.10001.00000.00000.0000
 82.994233.14722.90.2821.14GMOS_N_i0.01130.10000.00000.00000.0000
 82.993533.14733.90.2324.18GMOS_N_i0.24870.10000.00000.00000.0000
 82.996033.14854.70.2523.28GMOS_N_i0.18180.10000.00000.00000.0000
 82.993933.14924.80.1325.06GMOS_N_i0.57400.10000.00000.00000.0000
 82.992333.14697.90.2821.58GMOS_N_i0.11690.10000.00000.00000.0000
 82.996833.14907.70.2422.91GMOS_N_i0.31950.10000.00000.00000.0000
 82.994833.15038.50.1524.95GMOS_N_i0.91010.10000.00000.00000.0000
 82.991833.14709.20.2523.53GMOS_N_i0.60510.10000.00000.00000.0000
 82.993733.145310.00.3120.19GMOS_N_i0.05010.10000.00000.00000.0000
FRB 18091629.501265.71487.73.0316.16GMOS_N_r0.00050.05881.00000.00000.0000
 29.505465.714010.50.5321.42GMOS_N_r0.17280.05880.00000.00000.0000
 29.509365.717910.00.2122.00GMOS_N_r0.25540.05880.00000.00000.0000
 29.499865.713014.40.6620.96GMOS_N_r0.20680.05880.00000.00000.0000
 29.508465.713912.70.2422.71GMOS_N_r0.58880.05880.00000.00000.0000
 29.491365.717417.61.0220.91GMOS_N_r0.28160.05880.00000.00000.0000
 29.506065.721116.30.4121.29GMOS_N_r0.33120.05880.00000.00000.0000
 29.499665.720313.70.3919.83GMOS_N_r0.06560.05880.00000.00000.0000
 29.512965.719217.10.3720.03GMOS_N_r0.12020.05880.00000.00000.0000
 29.513665.718116.30.4219.42GMOS_N_r0.05980.05880.00000.00000.0000
 29.505265.712515.70.4318.15GMOS_N_r0.01410.05880.00000.00000.0000
 29.512265.717113.50.4719.02GMOS_N_r0.02740.05880.00000.00000.0000
 29.501565.712415.70.1621.13GMOS_N_r0.27480.05880.00000.00000.0000
 29.493365.713818.00.3320.19GMOS_N_r0.15410.05880.00000.00000.0000
 29.511065.713616.40.1922.17GMOS_N_r0.60120.05880.00000.00000.0000
 29.502165.713013.60.5921.73GMOS_N_r0.34800.05880.00000.00000.0000
 29.502965.721818.30.2821.64GMOS_N_r0.50530.05880.00000.00000.0000
FRB 180924326.1042−40.90022.90.8124.27VLT_FORS2_g0.19730.25000.71720.00000.0000
 326.1054−40.90020.81.3121.32VLT_FORS2_g0.01180.25000.27790.00000.0000
 326.1062−40.89933.80.5025.47VLT_FORS2_g0.53900.25000.00490.00000.0000
 326.1017−40.89989.50.4625.30VLT_FORS2_g0.98070.25000.00000.00000.0000
FRB 181112327.3486−52.97090.40.6721.49VLT_FORS2_I0.02270.25000.75880.00000.0000
 327.3496−52.96965.41.0619.10VLT_FORS2_I0.00730.25000.24110.00000.0000
 327.3484−52.97297.00.5822.01VLT_FORS2_I0.16460.25000.00010.00000.0000
 327.3467−52.97277.40.3224.05VLT_FORS2_I0.66120.25000.00000.00000.0000
FRB 190102322.4149−79.47560.50.8620.73VLT_FORS2_I0.00380.50001.00000.00000.0000
 322.4173−79.47735.90.5522.54VLT_FORS2_I0.16230.50000.00000.00000.0000
FRB 190523207.064272.47063.40.7122.13LRIS_R0.11580.33330.61160.00000.0000
 207.065472.46815.80.6122.82LRIS_R0.29860.33330.37120.00000.0000
 207.058972.46916.90.7220.78LRIS_R0.06640.33330.01730.00000.0000
FRB 190608334.0203−7.89882.51.6617.60VLT_FORS2_I0.00050.25001.00000.00000.0000
 334.0185−7.89865.00.3024.83VLT_FORS2_I0.53730.25000.00000.00000.0000
 334.0186−7.89696.60.2625.28VLT_FORS2_I0.84610.25000.00000.00000.0000
 334.0187−7.89599.50.4522.76VLT_FORS2_I0.40810.25000.00000.00000.0000
FRB 190611320.7429−79.39732.00.5022.35GMOS_S_i0.02670.09090.94800.00000.0000
 320.7495−79.39723.10.2725.87GMOS_S_i0.52970.09090.03590.00000.0000
 320.7439−79.39853.30.2624.91GMOS_S_i0.34460.09090.01400.00000.0000
 320.7539−79.39795.70.6523.63GMOS_S_i0.35180.09090.00190.00000.0000
 320.7383−79.39774.80.3623.44GMOS_S_i0.22670.09090.00010.00000.0000
 320.7346−79.39888.40.5323.36GMOS_S_i0.50330.09090.00000.00000.0000
 320.7541−79.39657.00.1826.52GMOS_S_i0.99360.09090.00000.00000.0000
 320.7569−79.39919.40.5223.65GMOS_S_i0.66430.09090.00000.00000.0000
 320.7364−79.39989.90.4124.56GMOS_S_i0.91660.09090.00000.00000.0000
 320.7319−79.39709.20.2725.04GMOS_S_i0.95510.09090.00000.00000.0000
 320.7587−79.39869.50.1726.91GMOS_S_i1.00000.09090.00000.00000.0000
FRB 19061465.074373.70681.30.4124.01LRIS_I0.05520.20000.60320.00000.0000
 65.073873.70642.20.4022.79LRIS_I0.03860.20000.39680.00000.0000
 65.070573.70755.70.3324.26LRIS_I0.49490.20000.00000.00000.0000
 65.081773.70797.50.1726.35LRIS_I0.99450.20000.00000.00000.0000
 65.069173.70818.20.2825.21LRIS_I0.93870.20000.00000.00000.0000
FRB 190711329.4194−80.35810.50.4622.93GMOS_S_i0.01080.20000.88210.00000.0000
 329.4187−80.35862.10.2624.88GMOS_S_i0.14710.20000.11790.00000.0000
 329.4143−80.35704.70.2224.69GMOS_S_i0.46540.20000.00000.00000.0000
 329.4117−80.35715.70.2924.88GMOS_S_i0.65450.20000.00000.00000.0000
 329.4190−80.35955.30.2523.97GMOS_S_i0.36820.20000.00000.00000.0000
FRB 190714183.9795−13.02121.00.9519.48VLT_FORS2_I0.00120.20001.00000.00000.0000
 183.9797−13.01936.10.5423.71VLT_FORS2_I0.39250.20000.00000.00000.0000
 183.9787−13.02297.40.6021.22VLT_FORS2_I0.07720.20000.00000.00000.0000
 183.9797−13.02307.00.3124.36VLT_FORS2_I0.64940.20000.00000.00000.0000
 183.9795−13.02348.70.5022.71VLT_FORS2_I0.34250.20000.00000.00000.0000
FRB 191001323.3525−54.74873.91.3617.82VLT_FORS2_I0.00090.33330.54120.00000.0000
 323.3492−54.74835.31.4717.85VLT_FORS2_I0.00150.33330.45880.00000.0000
 323.3501−54.74967.30.2725.11VLT_FORS2_I0.86900.33330.00000.00000.0000
FRB 200430229.706412.37660.90.7221.19LRIS_I0.00560.50001.00000.00000.0000
 229.708812.37788.90.3824.82LRIS_I0.91230.50000.00000.00000.0000
Adopted
FRB 12110282.994533.14790.20.2823.52GMOS_N_i0.00390.02451.00000.00000.0000
 82.994233.14722.90.2821.14GMOS_N_i0.01130.20260.00000.00000.0000
 82.993533.14733.90.2324.18GMOS_N_i0.24870.01440.00000.00000.0000
 82.996033.14854.70.2523.28GMOS_N_i0.18180.02980.00000.00000.0000
 82.993933.14924.80.1325.06GMOS_N_i0.57400.00730.00000.00000.0000
 82.992333.14697.90.2821.58GMOS_N_i0.11690.13320.00000.00000.0000
 82.996833.14907.70.2422.91GMOS_N_i0.31950.04080.00000.00000.0000
 82.994833.15038.50.1524.95GMOS_N_i0.91010.00790.00000.00000.0000
 82.991833.14709.20.2523.53GMOS_N_i0.60510.02420.00000.00000.0000
 82.993733.145310.00.3120.19GMOS_N_i0.05010.51520.00000.00000.0000
FRB 18091629.501265.71487.73.0316.16GMOS_N_r0.00050.82001.00000.00000.0000
 29.505465.714010.50.5321.42GMOS_N_r0.17280.00260.00000.00000.0000
 29.509365.717910.00.2122.00GMOS_N_r0.25540.00150.00000.00000.0000
 29.499865.713014.40.6620.96GMOS_N_r0.20680.00400.00000.00000.0000
 29.508465.713912.70.2422.71GMOS_N_r0.58880.00080.00000.00000.0000
 29.491365.717417.61.0220.91GMOS_N_r0.28160.00420.00000.00000.0000
 29.506065.721116.30.4121.29GMOS_N_r0.33120.00290.00000.00000.0000
 29.499665.720313.70.3919.83GMOS_N_r0.06560.01230.00000.00000.0000
 29.512965.719217.10.3720.03GMOS_N_r0.12020.01010.00000.00000.0000
 29.513665.718116.30.4219.42GMOS_N_r0.05980.01900.00000.00000.0000
 29.505265.712515.70.4318.15GMOS_N_r0.01410.07630.00000.00000.0000
 29.512265.717113.50.4719.02GMOS_N_r0.02740.02910.00000.00000.0000
 29.501565.712415.70.1621.13GMOS_N_r0.27480.00340.00000.00000.0000
 29.493365.713818.00.3320.19GMOS_N_r0.15410.00860.00000.00000.0000
 29.511065.713616.40.1922.17GMOS_N_r0.60120.00130.00000.00000.0000
 29.502165.713013.60.5921.73GMOS_N_r0.34800.00190.00000.00000.0000
 29.502965.721818.30.2821.64GMOS_N_r0.50530.00210.00000.00000.0000
FRB 180924326.1054−40.90020.81.3121.32VLT_FORS2_g0.01180.87230.98890.00000.0000
 326.1042−40.90022.90.8124.27VLT_FORS2_g0.19730.06830.01110.00000.0000
 326.1062−40.89933.80.5025.47VLT_FORS2_g0.53900.02780.00000.00000.0000
 326.1017−40.89989.50.4625.30VLT_FORS2_g0.98070.03160.00000.00000.0000
FRB 181112327.3486−52.97090.40.6721.49VLT_FORS2_I0.02270.07840.83000.00000.0000
 327.3496−52.96965.41.0619.10VLT_FORS2_I0.00730.86460.17000.00000.0000
 327.3484−52.97297.00.5822.01VLT_FORS2_I0.16460.04840.00000.00000.0000
 327.3467−52.97277.40.3224.05VLT_FORS2_I0.66120.00860.00000.00000.0000
FRB 190102322.4149−79.47560.50.8620.73VLT_FORS2_I0.00380.84251.00000.00000.0000
 322.4173−79.47735.90.5522.54VLT_FORS2_I0.16230.15750.00000.00000.0000
FRB 190523207.064272.47063.40.7122.13LRIS_R0.11580.19740.81530.00000.0000
 207.065472.46815.80.6122.82LRIS_R0.29860.10700.17770.00000.0000
 207.058972.46916.90.7220.78LRIS_R0.06640.69560.00700.00000.0000
FRB 190608334.0203−7.89882.51.6617.60VLT_FORS2_I0.00050.99301.00000.00000.0000
 334.0185−7.89865.00.3024.83VLT_FORS2_I0.53730.00100.00000.00000.0000
 334.0186−7.89696.60.2625.28VLT_FORS2_I0.84610.00070.00000.00000.0000
 334.0187−7.89599.50.4522.76VLT_FORS2_I0.40810.00530.00000.00000.0000
FRB 190611320.7429−79.39732.00.5022.35GMOS_S_i0.02670.33240.99900.00000.0000
 320.7495−79.39723.10.2725.87GMOS_S_i0.52970.02060.00060.00000.0000
 320.7439−79.39853.30.2624.91GMOS_S_i0.34460.04120.00040.00000.0000
 320.7539−79.39795.70.6523.63GMOS_S_i0.35180.11160.00010.00000.0000
 320.7383−79.39774.80.3623.44GMOS_S_i0.22670.13040.00000.00000.0000
 320.7346−79.39888.40.5323.36GMOS_S_i0.50330.13920.00000.00000.0000
 320.7541−79.39657.00.1826.52GMOS_S_i0.99360.01330.00000.00000.0000
 320.7569−79.39919.40.5223.65GMOS_S_i0.66430.11020.00000.00000.0000
 320.7364−79.39989.90.4124.56GMOS_S_i0.91660.05360.00000.00000.0000
 320.7319−79.39709.20.2725.04GMOS_S_i0.95510.03730.00000.00000.0000
 320.7587−79.39869.50.1726.91GMOS_S_i1.00000.01030.00000.00000.0000
FRB 19061465.074373.70681.30.4124.01LRIS_I0.05520.19440.58250.00000.0000
 65.073873.70642.20.4022.79LRIS_I0.03860.53350.41750.00000.0000
 65.070573.70755.70.3324.26LRIS_I0.49490.15930.00000.00000.0000
 65.081773.70797.50.1726.35LRIS_I0.99450.03500.00000.00000.0000
 65.069173.70818.20.2825.21LRIS_I0.93870.07790.00000.00000.0000
FRB 190711329.4194−80.35810.50.4622.93GMOS_S_i0.01080.47820.99950.00000.0000
 329.4187−80.35862.10.2624.88GMOS_S_i0.14710.10100.00050.00000.0000
 329.4143−80.35704.70.2224.69GMOS_S_i0.46540.11630.00000.00000.0000
 329.4117−80.35715.70.2924.88GMOS_S_i0.65450.10100.00000.00000.0000
 329.4190−80.35955.30.2523.97GMOS_S_i0.36820.20360.00000.00000.0000
FRB 190714183.9795−13.02121.00.9519.48VLT_FORS2_I0.00120.79981.00000.00000.0000
 183.9797−13.01936.10.5423.71VLT_FORS2_I0.39250.01560.00000.00000.0000
 183.9787−13.02297.40.6021.22VLT_FORS2_I0.07720.13930.00000.00000.0000
 183.9797−13.02307.00.3124.36VLT_FORS2_I0.64940.00930.00000.00000.0000
 183.9795−13.02348.70.5022.71VLT_FORS2_I0.34250.03610.00000.00000.0000
FRB 191001323.3525−54.74873.91.3617.82VLT_FORS2_I0.00090.50740.71740.00000.0000
 323.3492−54.74835.31.4717.85VLT_FORS2_I0.00150.49200.28260.00000.0000
 323.3501−54.74967.30.2725.11VLT_FORS2_I0.86900.00050.00000.00000.0000
FRB 200430229.706412.37660.90.7221.19LRIS_I0.00560.95661.00000.00000.0000
 229.708812.37788.90.3824.82LRIS_I0.91230.04340.00000.00000.0000

Download table as:  ASCIITypeset images: 1 2 3

6.2. FRB Assignments

We then applied the PATH framework. The primary results are displayed in Figure 10, which summarizes the $P\left({O}_{i}| x\right)$ values for the candidates in each field for each set of priors. We find very similar results for the two prior sets with one obvious exception: FRB 180924. In this case, there are two galaxies with separation $\theta \lt {\theta }_{\max }$ from this precisely localized FRB. These are treated similarly by the conservative approach. We demonstrate below, however, that a uniform p(ωOi ) function with ${\theta }_{\max }=6\phi $ is disfavored by the data. Imposing the exponential offset model yields a higher $P\left({O}_{i}| x\right)$ for the primary candidate. Furthermore, if we allow for the great difference in apparent magnitude by invoking the inverse P(O) prior, the posterior $P\left({O}_{i}| x\right)$ raises to near unity for the host reported by Bannister et al. (2019).

Figure 10.

Figure 10. Posterior probabilities $P\left({O}_{i}| x\right)$ for the most likely host (solid bars) and all other candidates (open bars) with $P\left({O}_{i}| x\right)\gt {P}_{\mathrm{secure}}=0.95$, as a function of prior set (green=conservative; black=adopted). With ${P}_{\mathrm{secure}}=0.95$ as the probability for a secure association, there are currently nine FRBs satisfying this criterion using the adopted prior set. The nonsecure hosts occur for a variety of reasons as described in the text.

Standard image High-resolution image

Based on the results from analysis of mock fields (Section 5), we adopt a probability threshold ${P}_{\mathrm{secure}}=0.95$ above which we consider a host association to be highly secure. The results in Figure 10 indicate that nine of the FRBs are associated with a single galaxy with $P\left({O}_{i}| x\right)\gt 0.95$ for the adopted prior set (and eight for the conservative set with FRB 180924 the difference).

The nonsecure hosts deserve individual consideration, in part to understand the dependence of the formalism to observed variations on the sky. FRB 181112 shows two bright galaxies near the FRB with the brighter assumed to lie in the foreground (Prochaska et al. 2019). The PATH analysis for the purported host gives $P\left({O}_{i}| x\right)\approx 0.7\mbox{--}0.9$, depending on the choice of priors (Table 3). We emphasize that the majority of FRB sightlines that intersect a massive foreground halo (estimated to be a few percent for FRBs at z > 0.5), will tend to have a maximum $P\left({O}_{i}| x\right)\lt {P}_{\mathrm{secure}}$. Given the terrific scientific value of probing such halos with FRBs (Prochaska et al. 2019), one may need to introduce additional criteria/priors to confidently pursue this science.

The next, nonsecure FRB association is FRB 190523 whose larger localization error incorporates several candidates. The analysis, however, does favor the purported host reported by Ravi et al. (2019). Third is FRB 190614, which lies near (≲2'') two faint galaxies with unknown redshifts (Law et al. 2020). As the FRB with the highest DM and therefore the highest presumed redshift of the sample, this result emphasizes the likely challenges of associating high-z FRBs to galaxies. In particular, given the host itself is likely faint, the incidence of additional, chance associations with comparable $P\left({O}_{i}| x\right)$ will be higher. Last is FRB 191001, which sits next to two bright galaxies known to have a common redshift (Bhandari et al. 2020a). Therefore, the redshift of the FRB is secure, but the host offset and its internal properties (e.g., stellar mass) are currently based on the assumption that the closer galaxy is the host, and indeed, it exhibits a 3× higher $P\left({O}_{i}| x\right)$ value for the adopted prior set.

6.3. Toward Additional Priors

Having established a set of nine secure, $P\left({O}_{i}| x\right)\gt {P}_{\mathrm{secure}}\,=0.95$, host associations we may test the assumed p(ωOi ) functions imposed in the analysis. Figure 11 shows the offset distribution for the secure hosts for the three p(ωOi ) priors of the analysis (Section 4.3). Note that modifying the choice of p(ωOi ) could include/exclude FRBs as being secure. The figure also shows the values of θ/ϕ derived for all candidates from the full set of FRBs, where we have weighted the θ/ϕ value of each candidate by $P\left({O}_{i}| x\right)$. Overall, the posteriors lend reasonable credibility to the set of p(ωOi ) functions. On the other hand, a comparison of the secure distribution with the priors yields a one-sided Kolmogorov–Smirnov (K-S) probability PK-S ≲ 0.1 and we rule out the uniform prior to ${\theta }_{\max }=6\phi $ at > 99%. The data appear to favor a p(ωOi ) function that favors a central concentration for FRB locations. Additionally, such small values of θ/ϕ are unlikely when the true host galaxy is unseen (see Figure 8), being <3% for the seven secure hosts with θ/ϕ < 1.5. Since all FRBs have most likely candidates with θ/ϕ < 5, we conclude that no more than one of the FRBs considered can have an unseen host (p ≲ 0.01).

Figure 11.

Figure 11. The solid histogram shows the distribution of separations for the secure host galaxies, in units of their angular size (ϕ). The gray histogram is for all of the candidate galaxies but weighted by their posterior probabilities $P\left({O}_{i}| x\right)$. These results were derived by assuming the adopted prior set (Table 3) and by varying the offset function p(ωOi ), as labeled in each panel. Overplotted on the histogram is the offset function both before (semitransparent) and after convolving with the FRB localization error (solid). The data rule out the uniform offset function that extends to ${\theta }_{\max }=6\phi $ at ≈99% CL (using a one-sided K-S test).

Standard image High-resolution image

Encoded in every FRB is its DM, the path integral of free electrons along the sightline weighted by the cosmological scale factor. The first ≈10 FRBs localized have established a firm correlation between DM and redshift, now termed the Macquart relation (Macquart et al. 2020). This relation derives from the ionized plasma that permeates the cosmic web. Because redshift (i.e., distance) affects observed properties, one should consider incorporating the DM into the association analysis. A full and proper treatment, however, requires including the intrinsic luminosities and spectral slopes of FRBs convolved with instrumental sensitivity and even the triggering software (C. James et al. 2021, in preparation). Further, we emphasize that adopting the Macquart relation as a prior would likely require estimating the redshift for every galaxy candidate; this will be intractable for many FRBs.

In lieu of a direct application of the Macquart relation, we propose to leverage the luminosity distribution (Figure 12) together with DM. Figure 13 presents the apparent magnitudes of each galaxy candidate against an estimate of the cosmic DM, DMcosmic, with the point size proportional to $P\left({O}_{i}| x\right)$. For DMcosmic, we adopt a simple estimation:

Equation (15)

with DMMW,ISM the estimated ISM DM (Cordes & Lazio 2002) and the factor of 100 DM units accounts for the Galactic halo and the host galaxy (see Prochaska & Zheng 2019; Macquart et al. 2020). The locus of data exhibits a clear correlation reflecting the decrease in observed galaxy flux with increasing distance.

Figure 12.

Figure 12. (Top) scatter plot of apparent magnitudes vs. redshift for the nine host galaxies. These show an expected decrease in flux with increasing distance. The dashed line marks the approximate apparent magnitude for an L* galaxy. (Bottom) estimated galaxy luminosity relative to the characteristic luminosity L* at the host redshift. The secure hosts have a median L/L* ≈ 1/4 and an rms scatter of 0.5 dex.

Standard image High-resolution image
Figure 13.

Figure 13. Using the adopted prior set (Table 3), we show the apparent magnitudes of each candidate with $P\left({O}_{i}| x\right)\gt 0.01$ scaling the point size by $P\left({O}_{i}| x\right)$. These are plotted against DMcosmic, an estimate of the dispersion measure of the FRB due to the cosmic web (see text for details). The black curve shows the estimated mr for an L = L*/4 galaxy by converting DMcosmic to redshift (using the Macquart relation). The fact that the mr values track this curve implies an underlying Macquart relation.

Standard image High-resolution image

Converting the DMcosmic estimates to redshift 13 , we may convert a given Lr luminosity to mr ; this is illustrated as the black curve in Figure 13 , where we assumed a fiducial L = L*/4 based on Figure 12. The good correspondence between the data and this curve reveals the Macquart relation without having used any direct redshift measurements.

In principle, one could construct a prior P(mr ∣DM) to include in the analysis. This will, however, be subject to scatter in the DMcosmic (Macquart et al. 2020), DMhost, and the intrinsic luminosities of the host galaxy population (Figure 12). It will also be subject, however, to the S/N considerations that affect any prior related to DM (James et al. 2020).

7. Future Directions and Analyses

This paper and the accompanying code base 14 provide a new methodology to make probabilistic associations of transients to hosts (PATH). While we were motivated by FRB science, the general framework is agnostic to transient type. Therefore, we anticipate it will be applied to GRBs, GW events, Type Ia SNe, and many other transients. We stress further that because it is fully probabilistic, the outputs may be coupled to other likelihood frameworks developed to constrain, e.g., progenitor models or cosmology.

Applied to 13 well-localized FRBs, our results identify nine secure host galaxies, with posterior probabilities >0.95 of being the true host. We have shown using a suite of sandbox simulations that this identification is reliable under a wide range of true FRB host galaxy distributions. This allows a reliable data set to be used when analyzing host galaxy properties, or using FRBs for cosmology. Furthermore, by assigning a quantitative probability to individual hosts, we allow even nonsecure hosts associations to be used for statistical purposes, with appropriate weighting.

Using these data sets, we tentatively identify relations between FRB DMs, and host galaxy redshifts, magnitudes, and luminosities. Our results disfavor FRBs as having large offsets from their host galaxies, and we exclude more than one FRB considered as having an unseen host (p ≲ 0.01). Thus, we can conclusively answer the oft-asked question "could the true host galaxies be missed?" with a "no."

Regarding FRBs, future work will include: (i) leveraging the next set of ∼10 FRBs to further refine the priors and offset function; and (ii) expanding the formalism to include additional observables. The latter may require obtaining additional data or performing additional analyses (e.g., photo-z estimates) than the simple flux and angular sizes considered here. One may also introduce and test priors motivated by progenitor models. Last, the analysis can inform observing strategies to optimize the probability of a secure association as a function of anticipated FRB redshift, localization error, and imaging quality.

We thank C. Kilpatrick and J. Bloom for helpful discussions. The Fast and Fortunate for FRB Follow-up team acknowledges support from NSF grants AST-1911140 and AST-1910471. A.T.D. is the recipient of an ARC Future Fellowship (FT150100415). K.A. acknowledge support from NSF grant AAG-1714897. T.B. gratefully acknowledges support from the NSF via grants AST-1909709 and AST-1814778. C.W.J. acknowledges the support of the Australian Government through the Australian Research Council's Discovery Projects funding scheme (project DP210102103). We thank S. Ryder and L. Marnoch for sharing the reduced VLT/FORS2 image around FRB 190608 in advance of publication. This work is partly based on observations collected at the European Southern Observatory under ESO programmes 0102.A-0450(A), 0103.A-0101(A), 0103.A-0101(B), and 105.204W.001.

Footnotes

Please wait… references are loading.
10.3847/1538-4357/abe8d2