1 Introduction

In the past 20 years more than 4000 exoplanets have been detected using space and ground based surveys, and many more are expected to be discovered in the coming years thanks to space missions such as TESS [1], CHEOPS [2], PLATO [3] and GAIA [4], and to ground instrumentation such as HARPS [5], HATnet [6], WASP [7], KELT [8], OGLE [9], NGTS [10] and many others.

Planets have been found to be ubiquitous in our Galaxy, have been detected around almost every type of star and [11] infer that on average every star in our Galaxy hosts one planetary companion. The exoplanets detected thus-far show a diversity in their masses, sizes, orbits, and, presumably, physical and chemical conditions unseen among the planets in our own Solar System.

However, the essential nature of these exoplanets remains elusive. We have little idea whether the planet chemistry is linked to the formation environment or whether the type of host star drives the physics and chemistry of the planet’s birth, and evolution [12].

Atmospheric spectroscopy holds the key to unlock the mysteries of the chemical and physical conditions of these alien worlds as well as their formation and evolutionary histories. Multi-band photometry and spectroscopy of transiting exoplanets [13] is currently one of the most effective observational techniques for revealing the chemistry and thermodynamics of exoplanet atmospheres [14,15,16,17,18,19]. Photometric and spectroscopic light-curves of transiting exoplanets provide a measurement of the transmission (transit) or emission (eclipse) spectrum of an exo-atmosphere, and can be used to reveal chemical constituents, as well as the pressure and temperature profile, using retrieval techniques originally developed for the study of the Earth and Solar System planets, and adapted to the new field of investigation [e.g., 20,21,22,23].

Existing astronomical instrumentation has allowed to study spectroscopically a few tens of exoplanets, selected among those that are more easily observable based on their sizes and temperatures, and over a limited spectral range [e.g. 18, 24]. A significantly larger population study is now required in order to decipher the secrets of the exoplanets and their diversity.

The Atmospheric Remote-Sensing Infrared Exoplanet Large-survey, Ariel, has been selected by ESA as the next medium class mission of the Cosmic Vision programme to spectroscopically characterise the atmospheres of a large and diverse sample of exoplanets. Ariel will largely focus on warm and hot exoplanets, taking advantage from their well mixed atmospheres that show minimal condensation and sequestration of high atomic weight metals such as C, O, N, S, Si. The Ariel science payload uses a 1-m class telescope to feed a multi-band photometer and spectrometers covering the wavelength range from 0.5 μ m to 7.8 μ m, to sample both the peak thermal emission of the exoplanet atmospheres, and the spectral signature of all major atmospheric gases (e.g. H2O, CO, CO2, NH3, CH4, HCN, H2S, TiO, VO) and condensed species.

The Ariel payload design is investigated using detailed simulations of the astrophysical detection, that take into account mission design parameters such as flight duration and sky availability, and payload and astrophysical uncertainties. Margins are used on each estimate to ensure all performance predictions are derived under reasonably conservative assumptions.

In this work we describe ArielRad, the Ariel radiometric model simulator used to assess the payload science performance and to demonstrate its compliance with the science requirements. ArielRad is the third simulation tool developed to assess the mission performance and follows ExoSim [25], and the Ariel ESA Radiometric Model (AERM), developed by the Space Agency [26] to support the flow down of science requirements to instrument requirements the radiometric model during the Ariel phase/A study [12].

ExoSim is a end-to-end, time-domain simulator of Ariel observations. It evaluates photometric and spectroscopic light curves implementing a detailed description of the instrument design, source of uncertainties, and systematics of instrument and astrophysical origin. As such, the simulated observations produced by ExoSim are similar to those expected with Ariel and require a full data reduction pipeline to detrend the observations and reconstruct the planet spectrum. ExoSim has been fully validated using noise modelling as well as real measurements obtained with Hubble Space Telescope. ExoSim simulations allow us to study effects of spatially and temporally correlated noise processes such as the photometric uncertainties arising from the jitter of the line of sight, or from the activity of the host star. However, ExoSim analyses are computationally intensive and it is currently impractical to conduct studies on more than a few targets, until a fully automated data reduction pipeline is developed and validated in the next phase of the project. AERM overcomes this limitation implementing a simplified approach based on a radiometric modelling of the detection and of the uncertainties. These simplifications mean it is capable of assessing the confidence limit on the detection of emission and transmission spectra of hundreds of exoplanet targets. AERM implements a noise model calibrated using ExoSim estimates and delivers compatible estimates on test targets. However, the noise model implemented in AERM is a two parameters model that falls short in capturing the complexity of the Ariel payload design. Consequently, AERM provides an overly pessimistic prediction on some targets which makes this simulator unsuitable in assessing instrument design solutions.

ArielRad has been written to derive payload requirements from science requirements through detailed error budgeting, to validate the compliance of the payload design with the science requirements, and to optimise the payload design evaluating instrument design solutions over a proposed target list comprising about 1000 exoplanets. ArielRad overcomes the limitations of AERM by implementing a detailed payload model, similar to that used by ExoSim, capable of describing all major instrument components. ExoSim computing limitations are overcome in ArielRad by implementing radiometric estimates of the uncertainties of the detection. Noise contributions that need to be estimated in the time domain, such as the photometric noise arising from the jitter of the line-of-sight, are imported in ArielRad from pre-computed ExoSim estimates. ArielRad is used to create and maintain the top level payload performance error budgets, allowing a balanced allocation of resources across the payload.

In this work we describe ArielRad, the models implemented, their validation, and provide examples of how ArielRad can be used to support the Ariel mission development, leaving to a future work a detailed assessment of the Ariel mission performance.

2 Ariel instrument design and observational strategy

The Ariel payload design is briefly described in this section with more details available in [12, 27, 28], and in the Ariel Assessment Study ReportsFootnote 1. The telescope is an off-axis 0.63m2 Cassegrain with an elliptical primary mirror, that provides diffraction limited performance at wavelengths longer than 3μm; there is no need for imaging capabilities and the telescope is cooled to less than 70K. A refocusing mechanism actuates the secondary mirror to correct possible misalignments, that can occur at launch and during thermalisation.

The flux collected by the primary aperture feeds two separated instrument modules. A dichroic mirror splits the light into two beams, one at wavelengths shorter than 1.95μm and the second at wavelengths between 1.95μm and 7.8μm. The first beam is fed to an instrument module containing three photometers (VISPhot, 0.5μm − 0.6μm; FGS-1, 0.6μm − 0.80μm; FGS-2, 0.80μm − 1.1μm) and a slit-less prism spectrometer (NIRSpec, 1.1μm − 1.95μm) with spectral resolving power > 15. The two photometers, FGS1 and FGS2, operate as Fine Guidance Sensors (FGS), providing both photometric and pointing information for the attitude and orbital control system (AOCS). The second instrument module, fed by the longer wavelength beam, hosts the Ariel Infrared Spectrometer (AIRS), which consists of two prism-dispersed channels: Channel 0 (CH0) covering the 1.95μm − 3.9μm band with a spectral resolving power larger than 100, and Channel 1 (CH1) covering the 3.9μm − 7.8μm band with a spectral resolving power larger than 30. The spectrometers have field stops (slits) at an intermediate image plane, that are wider than the telescope Point Spread Function (PSF) and are used to limit the stray-light and the diffuse emission from reaching the focal plane.

During its four years primary mission, Ariel will observe \(\sim 1000\) planets. To maximise the scientific return, the mission implements a three-tier observational and data analysis approach, where three different sample spectra are analysed with optimised spectral resolutions, wavelength intervals and Signal-to-Noise ratios (SNR). A summary of this observational strategy is given in Table 1 with the first tier (Tier 1) being a low spectral resolution reconnaissance survey of \(\sim 1000\) planets to address science questions on a large population of targets, such as to the fraction of planets covered by clouds or of small planets that have retained a H/He atmosphere. The second tier, Tier 2, which consists of \(\sim 50 \%\) of the planets from Tier 1, are studied with a higher spectral resolution, merging the data collected in two juxtaposed spectral bins. Tier 2 analysis searches for potential correlations between atmospheric chemistry and basic parameters such as planetary size, density, temperature, stellar type and metallicity. It allows investigations of chemical abundances, cloud characterisation and elemental composition. Finally, \(\sim 10 \%\) of Tier 1 planets are re-observed multiple times in Tier 3, and the data are analysed using the full spectral resolution provided by the payload to gain detailed knowledge of the planetary chemistry dynamics and temporal variability of the exoplanet atmospheres.

Table 1 Summary of the science addressed in each tier

3 The ArielRad simulator

ArielRad is a radiometric simulator of the Ariel payload, it is implemented in Python and it is maintained by the Ariel Consortium. The software package comes with an exhaustive documentation and the primary inputs are contained in a XML configuration file describing the payload, an XML configuration file describing the mission parameters, and a CSV or spreadsheet table containing a list of target exoplanetary systems with their parameters. An instrument independent version of the radiometric simulator called ExoRad is publicly available on GitHubFootnote 2.

The simulator evaluates the payload science performance by estimating the expected experimental uncertainties on measured exoplanet atmospheric spectra in emission and transmission. Each simulation starts with the generation of the source signal from the star. The signal is then propagated through the instrument to the detector focal planes (assuming no field stars), accounting for the transmission of each optical component and the dispersion of the prism spectrometers.

Uncertainty estimates account for detector noise (readout noise and gain noise, the latter arising from variations in acquisition electronic chain), dark current, photon noise from the star, the zodiacal background and includes instrument emission, and jitter noise. Margins are included to account for uncertainties in the current noise estimates and instrument performances. The simulation continues by estimating the planet atmospheric signal and thus the SNR of the detection when observing the target in transit or eclipse. Every candidate target observation is considered to last 2.5 times the planet transit time, i.e. the time between the first and the last contact between the planetary and the stellar disks, T14. This strategy allows the collection of data both in and out of transit for the light curve fit and the transit depth estimation. This parameter can be configured by the user as well as all other parameters described later. The data is then binned according to the tier resolution, as will be done in Ariel data analysis. Tier 1 uses a low resolution spectroscopy (4 spectral resolution elements covering 1.1 − 7.80μm); Tier 2 has spectral resolution \(R \sim 10\) for 1.1 < λ < 1.90μm, \(R \sim 50\) for 1.95 < λ < 3.90μm and \(R \sim 15\) for 3.90 < λ < 7.80μm; Tier 3 uses the full R, that means R = 100,30 for AIRS-CH0 and AIRS-CH1 respectively, and for NIRSpec, as the requirement is R > 15, we use R = 20. Finally, ArielRad estimates the number of observations required to achieve a desired SNR for each planet (e.g. SNR = 7) and from this the total observing time required on target. This process is summarised graphically in Fig. 1 and each stage described in detail in the following sections.

Fig. 1
figure 1

Simulator work flow. The simulation starts with two input files: a payload configuration file and a candidate planet list. ArielRad propagates the target host star signal thought the payload, then evaluates the noise. Finally, the simulator estimates the transit or eclipse observation and the resultant SNR

3.1 Source model

ArielRad models the target star using a grid of synthetic Phoenix spectra from [29]. Given a target, ArielRad selects the Phoenix spectrum that refers to a source with the most similar temperature, surface gravity and metallicity to the target star. The Spectral Energy Density (SED) of the target star at the telescope input is evaluated as

$$ S(\lambda_{k}) = \frac{1}{\lambda_{k} - \lambda_{k-1}}\frac{R_{\star}^{2}}{D^{2}}{\int}_{\lambda_{k-1}}^{\lambda_{k}}S_{Ph}(\lambda)d\lambda $$
(1)

where \(\lambda _{k} = \left (1+ \frac {1}{6000} \right )^{k} \lambda _{min}\) is a logarithmic spaced wavelength grid, determined starting from the sample spectral resolution of 6000 and the minimum wavelength λmin. This binning reduces the spectral resolution of the input spectra for computational efficiency, while preserving the total power. R is the target star radius, D is the distance and SPh the input Phoenix spectrum. The units of S(λk) are those of a spectral energy density. A table for acronyms and symbols used in the text is available in Appendix 1, Table 3.

3.1.1 Diffuse radiation

ArielRad includes contributions from the Zodiacal background, and instrument thermal emission.

The Zodiacal background emission is modelled using a modified version of the JWST-MIRI Zodiacal model [30], scaled according to the target position in the sky and the Zodi model of [31]:

$$ I_{zodi}(\lambda) = A \cdot 3.5 10^{-14} \cdot BB(\lambda, 5500K) + B \cdot 3.58 10^{-8} \cdot BB(\lambda, 270K) $$
(2)

where BB is the Planck law, and A and B are the fitted coefficients. Typically, A and B evaluate to \(\sim 0.9\) at ecliptic poles, \(\sim 8\) at a Solar elongation angle of 55 deg and at an ecliptic latitude of 0 deg, and \(\sim 2.5\) on average. Therefore A = B = 2.5 is assumed when target position coordinates are not available [32].

Instrument thermal emission, Iinst, is estimated by modelling each of the optical element as a Planck law at the element operating temperature and modified by a wavelength dependent emissivity. All the reflective surfaces are made in aluminium, and we conservatively assume a 3% emissivity. Same emissivity is assumed for the refractive components. All optical elements are passively cooled to 60K. Because of the almost isothermal Ariel design, the total instrument emission from optics is computed from the emission of one component, multiplied by the number of optical components along the light path.

The exception is the contribution of the detector box, IInner, as the flux is not coming from the instrument field of view (FoV) but instead directly irradiates the pixels from all directions. This contribution is referred to as the “Inner Sanctum” and it is estimated as the radiation emitted from a black body cavity at the detector operating temperature (i.e. unit emissivity is assumed).

3.2 Payload configuration

The payload is modelled using the current best estimates from the instrument system engineers. The transmission ΦY(λ), where Y is one of VISPhot, FGS1, FGS2, NIRSpec, AIRS-CH0 or AIRS-CH1, is obtained by simulating the light path from the telescope to the detectors through the optics. The detector quantum efficiency QEY(λ) is also dependent on wavelength and defined for each channel. The photon conversion efficiency (PCE) is the product between transmission and quantum efficiency and it is shown in Fig. 2. The lower PCE observed in VISPhot and FGS1 with respect to FGS2 and NIRSpec is caused by a lower detector QE at short wavelengths, while a similar PCE reduction at AIRS wavelengths is mainly a consequence of the refractive materials used in the mid-infrared. The Point Spread Functions (PSF) are estimated as a function of the wavelength using external software and included, allowing for wavelength interpolations.

Fig. 2
figure 2

Total photon conversion efficiency, ΦY(λ)QEY(λ), as used in ArielRad

3.3 Signal model

The Point Response Function (PRF) is the PSF, normalised to unit volume and convolved with the pixel response, which is assumed to be a top-hat function, Hpix(x,y). The effects of a non-flat pixel response function are only relevant for pointing jitter. As discussed later, this effect is estimated with ExoSim and imported in ArielRad as an additional noise contribution (see Section 3.4.3). Hence,

$$ PRF(x_{l}, y_{m}, \lambda) = \int \int PSF(x,y,\lambda)H_{pix}(x_{l}-x, y_{m}-y) dx dy $$
(3)

where l and m are detector pixel indices, and \({\sum }_{l,m} PRF(x_{l}, y_{m},\lambda )=1\).

For each photometric channel, the signal of the target star T is modelled as the sum of the signals sampled by all detector pixels within an elliptical aperture, the size of which depends on the PRF. Hence, this is estimated as

$$ S_{T, phot}(\lambda_{s}) = A_{tel}\underset{x_{l}, y_{m} \in A_{p}}{\sum} PRF(x_{l}, y_{m},\lambda_{s}) \int QE_{Y}(\lambda) {\varPhi}_{Y}(\lambda) S(\lambda) \frac{\lambda}{hc} d \lambda $$
(4)

where λs is the central wavelength of the photometric band (e.g. 0.55, 0.70 and 0.90μm for VISPhot, FGS1 and FGS2 respectively). Atel is the telescope effective collecting area. Ap is an elliptical aperture used for aperture photometry. Typical aperture sizes are chosen to encircle either 83.8% or 91.0% of the energy in the PSF as ExoSim models indicate a reduction of jitter noise to negligible levels compared to other noise sources for these aperture sizes [33]. The aperture, Ap, is defined over x,y pixels with coordinate indices l,m.

The case of a spectroscopic channel is similar and the signal sampled by the detector is estimated as

$$ S_{T, spec}(\lambda_{s}) = A_{tel} \underset{x_{l}, y_{m} \in A_{s}}{\sum} PRF(x_{l}, y_{m}) QE_{Y}(\lambda) {\varPhi}_{Y}(\lambda) S(y_{m}) \frac{ \lambda}{hc}{\Delta} \lambda_{m} $$
(5)

where As is now a rectangular box aperture of the size of the spectral bin along the y-axis, assumed to be parallel to the dispersion direction. Apertures are spaced such as to obtain the desired spectral binning R, which is 20, 100, 30 for NIRSpec, AIRS-CH0 and AIRS-CH1 respectively. Δλm is the wavelength interval sampled by the pixel at coordinate ym, and λs is the wavelength at the centre of the spectral bin sampled by the spectrometer, defined as

$$ \lambda_{s} = \frac{1}{2} \left( \lambda_{j} + \lambda_{j+1} \right) $$
(6)

where λj is the wavelength at the bin edge defined by the recursive relation

$$ \lambda_{j+1} = \lambda_{j} \left( 1+ \frac{1}{R} \right) $$
(7)

and the focal plane wavelength solution maps pixel coordinates to wavelength (\(\lambda _{j+1}, \lambda _{j} \rightarrow y_{i+1}, y_{i}\)).

For the Zodiacal light contribution, the effect is proportional to the pixel solid angle, \({\varOmega } = \left (\frac {{\Delta }_{pix}}{f_{eff}} \right )^{2}\), where Δpix is the pixel size and feff is the effective focal length. For the photometers and the slit-less NIRSpec spectrometer this contribution is given by

$$ S_{zodi,phot}(\lambda_{s}) = A_{tel} {\varOmega} \int QE_{Y}(\lambda) {\varPhi}_{Y}(\lambda) I_{zodi}(\lambda) \frac{\lambda}{hc} d \lambda $$
(8)

However, the AIRS instrument includes a field stop that being wider than the input PSF has no effects on a point source target, but acts as a slit for diffuse radiation. Therefore, Zodiacal light must be modelled differently for AIRS channels. ArielRad simulates the signal incoming to the detector as the convolution between the Zodiacal light and the field stop/slit. If the slit width expressed in number of pixels at the focal plane is L, and the spectral resolving power computed at a certain λ0 is R(λ0), the detector receives diffuse radiation over the wavelength range \(\left (\lambda _{j} - \frac {L \lambda _{0}}{4R(\lambda _{0})} , \lambda _{j} + \frac {L \lambda _{0}}{4R(\lambda _{0})} \right )\), and not over the full range of wavelengths accepted by the filter, so

$$ S_{zodi,spec}(\lambda_{s}) = A_{tel} {\varOmega} {\int}_{\lambda_{c} - \frac{L \lambda_{0}}{4R(\lambda_{0})}}^{\lambda_{c} + \frac{L \lambda_{0}}{4R(\lambda_{0})}} QE_{Y}(\lambda) {\varPhi}_{Y}(\lambda) I_{zodi}(y_{m}) \frac{ \lambda}{hc} {\Delta} \lambda_{m} $$
(9)

where λ0 is 1.95 and 3.90μm and R(λ0) is R(1.95) = 100 and R(3.90) = 30 for AIRS-CH0 and AIRS-CH1 respectively.

ArielRad models the diffuse light coming from instrument emission by substituting Izodi(λ) with n𝜖Iinstr(λ) in (8) and (9), where we are assuming that all the n optical elements (lenses, mirrors, etc.) have the same emissivity, 𝜖. If the components are modelled with different emissivity, \({\sum }_{i=1}^{n} \epsilon _{i} I_{instr}(\lambda )\) is used instead.

The Inner Sanctum contribution is the same for all detector focal planes, as it originates from the emission of their enclosures, and it is proportional to the pixel surface:

$$ S_{Inner}(\lambda_{s}) = {\Delta}_{pix}^{2} \left( \pi - {\varOmega} \right) \int QE_{Y}(\lambda) I_{Inner}(\lambda) \frac{\lambda}{hc}d \lambda $$
(10)

Diffuse backgrounds add a DC offset in the measured stellar signal. ArielRad assumes that these are removed using standard techniques, e.g. aperture photometry, and only their (uncorrelated) contribution to the noise budget is considered later in this work.

The resultant total signal in one spectral bin is the sum of all the previous contributions

$$ S_{Y}(\lambda_{s}) = S_{T,Y}(\lambda_{s}) + S_{zodi,Y}(\lambda_{s}) + S_{instr,Y}(\lambda_{s}) + S_{Inner}(\lambda_{s}) $$
(11)

where in our notation Y can be one of VISPhot, FGS1, FGS2, NIRSpec, AIRS-CH0 or AIRS-CH1. SY(λs) has units of counts (electrons) per second.

3.4 Noise model

ArielRad simulates the noise for each spectral bin from the signal estimate. In a real instrument, noise sources act at every stage of the detection chain and can be stationary and non-stationary. ArielRad estimates the contributions of noise components that are stationary random processes, such as Poisson noise and detector noise. It also includes Jitter noise (provided from an ExoSim simulation), margins for other noise contributions, and a noise floor. Figure 3 shows a noise tree including all the noise sources considered.

Fig. 3
figure 3

Noise tree diagram listing all noise contributors considered in (12). The Poisson Noise term depends on signals collected by the telescope (in yellow the target and the Zodiacal background photon noise), and on signals of instrument origin (in blue, photon noise from instrument and inner sanctum self emission). The detector noise contributors are from dark current, gain and read noise. The Jitter noise term is discussed separately from the rest of the noise terms as it is imported as a model from ExoSim simulations. The payload noise floor prevents noise from integrating down indefinitely

The variance of each spectral bin is modelled as

$$ VAR[S_{Y}(\lambda_{s})] = \begin{cases} (1+\chi_{Y}) VAR_{P}[S_{Y}(\lambda_{s})] \frac{1}{\Delta t_{int}} + & {(Poisson noise term)} \\ +VAR_{D}[S_{Y}(\lambda_{s})] \frac{1}{\Delta t_{int}} + & {(Detector noise term)} \\ +VAR_{J}[S_{Y}(\lambda_{s})] \frac{1}{ {\Delta} t_{int}} + & {(Jitter noise term)} \\ +\left[ p_{0} S_{T,Y}(\lambda_{s}) \right]^{2} & {(Payload noise floor)} \end{cases} $$
(12)

where all quantities except p0 are estimated at a 1 hr integration time and at longer time scales are assumed to decrease as the inverse of Δtint, the integration time in hours. Time scales longer than 1 hr are expected to be above the correlation time scale of coloured noise processes considered. χY is a margin added to include noise sources that cannot be simulated with ArielRad, such as correlated noise, time-dependent effects, and unknown unknowns. Typically, χ = 0.4 is assumed, allowing for approximately 20% margin on top the photon noise that, from one side, it is conservatively larger than what has been achieved with Hubble/WFC3 and Spitzer/IRAC [\(\sim 15\)%, 34], but from the other it is adequate for the current stage of the mission design.

The Payload noise floor, p0, models low frequencies instabilities (e.g. Brownian or pink noise processes) that prevents noise to integrate down indefinitely with time. Following [35], we use p0 = 20 ppm of the incoming signal ST,Y(λs). See also [36, 37] for an example of this approach.

3.4.1 Poisson noise

The Poisson noise term includes all the photon noise contributions: target source, zodiacal background, inner sanctum and instrument emission. Hence, the Poisson variance at 1 hr time scale is

$$ VAR_{P}[S_{Y}(\lambda_{s})] = \frac{1 \text{hr}}{3600 \mathrm{s}} \frac{1 }{\eta } \frac{6(n^{2}+1)}{5n(n+1)} S_{Y}(\lambda_{s}) $$
(13)

The factor 3600s normalises the variance at 1 hr time scale and the parameter \(\eta \lesssim 1\) is an efficiency accounting for detector reset events needed to prevent saturation and includes the temporal gap occurring between the end of the reset event and the first read. As the detector is read in n consecutive non-destructive reads (NDRs) during each exposure, (13) is multiplied by \(\frac {6(n^{2}+1)}{5n(n+1)}\) as discussed in [38] (MULTIACCUM readout strategy, see their (1)). This is appropriate as the number of group elements is 1 in the Ariel baseline payload design. The minimum value of n is 2, i.e. correlated double sampling (CDS) required to remove the detector KTC noise; \(\frac {6(n^{2}+1)}{5n(n+1)} \) evaluates to 1 when n = 2. NDRs are read at a constant cadence and n depends from the detector saturation time

$$ T_{sat} = \frac{f_{WD} \cdot WD}{S_{Y, max}} $$
(14)

where WD is the detector well depth, fWD is the fraction of well depth used (e.g. fWD = 0.75, [39]), and SY,max is the largest pixel signal in the channel Y, that is estimated by ArielRad, but is not explicitly discussed here for the sake of conciseness. V ARP[SY(λs)] has units of s− 2hr.

3.4.2 Detector noise

The detector noise variance can be divided in three terms, read noise, gain noise and dark current. The dark current signal depends on the detector pixel dark current, Idark, and on the number of pixels, Npix, included in the photometric apertures Ap or As defined earlier, with Sdarkcurr = NpixIdark. The detector noise variance at 1 hr is

$$ \begin{array}{@{}rcl@{}} VAR_{D}[S_{Y}(\lambda_{s})] & =& \frac{1 \text{hr}}{3600 \mathrm{s}} \frac{12(n-1)}{n(n+1)} \frac{N_{pix}\sigma_{rd,Y}^{2}}{\eta^{2} T_{sat}} + \\ & +& \sigma^{2}_{gain, Y} {S_{Y}^{2}}(\lambda_{s}) + \\ & +& \frac{1 \text{hr}}{3600 \mathrm{s}} \frac{1 }{\eta } \frac{6(n^{2}+1)}{5n(n+1)} S_{dark curr} \end{array} $$
(15)

where \(\sigma _{rd, Y}^{2}\) is the noise variance on each individual NDR and has no units. Following [38], the factor \(\frac {12(n-1)}{n(n+1)}\) accounts for a line fit to the NDR ramp in one exposure, and decreases as n increases.

The gain noise , σgain,Y in units of \(\sqrt {\text {hr}}\), accounts for instabilities of the electronic acquisition chain (amplifiers, digitisers, etc.), assumed post-processing, i.e. after common modes are removed using housekeeping information. With Tsat in units of seconds, V ARD[SY(λs)] has units of s− 2hr.

3.4.3 Jitter noise

Pointing drifts and jitter of the line-of-sight (LoS) manifest themselves in the observed data product via two mechanisms: 1) the drifting of the spectrum along the spectral dispersion axis of the detector; 2) the drift of the spectrum along the cross-dispersion (spatial) direction. The effect of jitter on the observed time series is the introduction of noise, characterised by the power-spectrum of the telescope pointing (usually not stationary). It is correlated in time, as the power-spectrum is not constant in frequency. The amplitude of the resultant photometric scatter depends on the amount of spectral/spatial displacement of the spectrum, the monochromatic PSF of the instruments, the detector pixel response function (intra-pixel response) and the amplitude of the inter-pixel variations (i.e. QE variations across the focal plane detector array). Modelling the complexity of the jitter noise effect is beyond the capabilities of a radiometric model, therefore ArielRad imports jitter noise models from ExoSim simulations that provides variance vs wavelength at a timescale of 1 hr, i.e. V ARJ[SY(λs)] in (12). At longer timescales, jitter noise is to good approximation time uncorrelated and can be therefore scaled at any desired observing time longer than 1 hr.

3.4.4 Additional noise contributions

As ArielRad is a radiometric model of the Ariel payload performance, it cannot simulate non-stationary noise processes, or processes that require a more sophisticated simulation strategy, i.e. time-domain. One example of the latter is the already discussed pointing jitter, that is accounted for in ArielRad using external modelling with ExoSim.

Detector persistence is an additional potential systematic that cannot be modelled with ArielRad, but HST observations have shown that it can be effectively corrected in data analysis [40]. Further, this effect is expected to be negligible in Ariel observations compared to HST as Ariel stares continuously at a target from an L2 orbit ensuring that detectors reach a steady state in a relatively short amount of time (few minutes at most).

Detector non-linear behaviour is well characterised in detector testing activities before launch and during commissioning. Coupled with a stable line-of-sight provided by the AOCS at the sub-pixel level, detectors pixels sample the same optical power level during an observation, therefore dwarfing, relatively to other sources of experimental uncertainties, this type of systematic that is also known to be amendable [41].

Throughput variations can be caused by thermoelastic deformations that can also induce temporal variations in the shape of the PSF. To minimise this effect, the Ariel payload design uses aluminium structures, including the telescope mirrors. Along with the thermal stability enjoyed by spacecraft in L2 orbits, this prevents driving any significant thermal gradient on the payload structure and subsystems. It is therefore expected by design that thermoelastic throughput variations during Ariel observations will be significantly below any detection limit.

3.4.5 Output format

ArielRad outputs the noise integrated over 1 hr of observation relative to the target signal:

$$ \sigma_{Y,1hr}(\lambda_{s}) = \frac{\sqrt{(1+\chi_{Y})VAR_{P} + VAR_{D} + VAR_{J}}}{S_{T, Y}(\lambda_{s})} $$
(16)

The units are hr1/2. The relative noise achieved during one observation is

$$ \sigma_{Y, {\Delta} t}^{2}(\lambda_{s}) = \frac{\sigma_{Y,1hr}^{2}(\lambda_{s})}{\Delta t_{int}} + {p_{0}^{2}} $$
(17)

where Δtint is the observing time in hours, and the payload noise floor p0 is added in quadrature to prevent the noise to integrate down indefinitely as discussed earlier.

The payload (relative) noise budget is shown in Fig. 4 for a typical bright target star (HD 209458) and for a faint target star (GJ 1214). These stars are representative of respectively the bright and faint flux limits of the payload design requirements. The noise budget has been estimated at 1 hr integration assuming that detectors are analysed in CDS samples, i.e. n = 2. The budget is sampled at Tier 3 resolution and it allows comparison of noise contributions to the total noise in any individual Ariel channel. A comparison of sensitivity among channels for any given target needs to take into account the different spectral binning used that is higher in CH0 (R = 100) compared to NIRSpec (R = 20) and CH1 (R = 30). A strong increase with wavelength is seen in the read and dark current noise components as a consequence of a decreasing stellar SED toward the red.

Fig. 4
figure 4

ArielRad noise budget example of the Ariel payload at 1 hr integration time, using Tier 3 binning. Two targets are considered: GJ 1214, a faint target for the Ariel mission, and HD 209458, a typical bright target. The total noise estimate is in black and includes a 20% margin in excess of the target photon noise (green). Also shown are the detector dark current and read noise (purple and yellow, respectively), photon noise from the zodiacal background (blue), gain noise (brown), and pointing jitter noise (gray). A noise floor is indicated by the horizontal dashed line. The pointing jitter noise model is estimated using ExoSim [33] simulations with identical payload parametrization. Gain noise is assumed to be equal to 40 ppm \(\sqrt {\text {hr}}\). Photon noise from instrument and inner sanctum emission are omitted because are negligibly small and out of the scale of the diagram. . The Ariel payload model used in this budget is that at the mid of phase B1, it includes a noise floor of 20ppm and identical detector median dark current noise across all channels, that might result in an overestimate of up to a factor of 3 in all channels, but CH1

As it is evident from the noise budget, the Ariel payload design reaches photon noise limited performances, allowing observations of even dimmer targets.

3.4.6 Signal-to-noise ratio

In a transit or eclipse observation, the observable is the wavelength-dependent contrast-ratio that is the difference between the flux incoming from the extrasolar system star plus planet when the planet is moving in front (transit) or behind (secondary transit, i.e. eclipse) the star and when it is not:

$$ CR(\lambda) = \frac{S_{OOT}(\lambda)-S_{IT}(\lambda)}{S_{OOT}(\lambda)} $$
(18)

where the labels OOT and IT identify signals measured respectively out of transit and in transit.

For a primary transit the contrast ratio is evaluated from the comparison between the SED of the star S(λ) plus that of the planet Sp(λ), SOOT(λ) = S(λ) + Sp(λ), and the SED measured during the transit, that up to the first order depends on the radii ratio [42, 43]. ArielRad simulates the contrast ratio in each spectral bin for the primary transit, considering also the portion of the star light passing through the planet atmosphere, as described in [13] and [44]:

$$ CR_{tot}(\lambda) = \left( \frac{R_{p}}{R_{\star}}\right)^{2} + 2 {\Delta} z(\lambda) \frac{R_{p}}{R_{\star}^{2}} $$
(19)

where Rp and R are the planet and star radii respectively and Δz(λ) is the atmospheric height. Because the interest is on the detection of the exoplant atmosphere, the “signal” is just the rightmost quantity [34, 45] in the expression above, i.e.

$$ CR(\lambda) = 2 {\Delta} z(\lambda) \frac{R_{p}}{R_{\star}^{2}} $$
(20)

The atmospheric height is proportional to the scale height H = kBT/μg where kB is the Boltzmann constant, T is the temperature of the atmosphere, μ is the mean molecular weight and g is the gravitational acceleration. The wavelength dependent constant of proportionality can be provided by atmospheric models or, following [46], set to 5 independently from the wavelength for a simple, yet representative performance estimate.

For the eclipse case, the contrast ratio is simply CR(λ) = Sp(λ)/S(λ) [14, 47], and the planetary SED is part due to the planet thermal emission, Sem(λ), and part due to the reflected star light, Al(λ). Thus,

$$ CR(\lambda) = \frac{S_{em}(\lambda) + A_{l}(\lambda)}{S_{\star}(\lambda)} $$
(21)

The simulator estimates the contrast ratio in each spectral bin modelling the planet emission as a Black Body at the planet temperature, and computes the reflected light component according to [48] as \(A_{l} (\lambda )= \alpha (\lambda ) \left (\frac {R_{p}}{a} \right )^{2} S_{\star }(\lambda ) \) where α(λ) is the geometric albedo and a is the semi-major orbital axis.

ArielRad can estimate these contrast-ratios considering the observation of SIT lasting the time between the first and last contact, T14, and SOOT lasting γT14, with γ = 1.5, topically, from current Ariel science requirements. Using (17) and 18, the noise variance estimate on a contrast ratio measurement is

$$ VAR(CR_{Y}, \lambda_{s})=\sigma^{2}_{Y,1hr}(\lambda_{s}) \left[ 1 +\frac{1}{\gamma} \right] \frac{1}{T_{14}} + {p_{0}^{2}} $$
(22)

and the Signal-to-Noise Ratio in each spectral bin is

$$ SNR_{Y}(\lambda_{s}) = \frac{CR_{Y}(\lambda_{s})}{\sqrt{VAR(CR_{Y}, \lambda_{s})}} $$
(23)

where we can substitute CR(λs) with the estimated contrast ratio for transit or eclipse observations. The label Y indicates that quantities are integrated over the photometric band or spectral bin of interest. Therefore ArielRad estimates three sets of SNR, one for each of the three Ariel tier discussed in Section 2.

The SNR achieved in multiple, Nobs observations of the same target extrasolar system is assumed to scale as \(\sqrt {N_{obs}}\), under the assumption that disturbances originate from stochastic processes that are uncorrelated over the time scale separating two observations, and longer. If this were not the case, it would be more appropriate to assume that the noise floor is not reduced by averaging multiple observations. However, there is currently no evidence supporting this as Hubble/WFC3 and Spitzer/IRAC observations have not yet reached a noise floor [34] .

4 Validation

ArielRad has been validated by comparing its estimates to those of ExoSim and AERM introduced in Section 1.

As ExoSim has been extensively validated against real astrophysical observations [25], the comparison between ArielRad and ExoSim estimates are the most interesting to investigate. For this, we chose to compare the predictions made of Tsat (14). A consistent result between EsoSim and ArielRad on this parameter implies that ArielRad captures the complexity of the payload design as thoroughly as ExoSim does, and validates the implementation of the radiometric algorithms. The comparison is therefore done implementing the same baseline Ariel model in the two simulators, and Tsat is evaluated for the three target stars GJ 1214 (M4.5, mag\(_{K} \simeq 8.8\)), HD 209458 (G0V, mag\(_{K} \simeq 6.3\)) and HD 219134 (K3V, mag\(_{K} \simeq 3.3\)) that cover a range in brightness and temperature representative of potential Ariel targets [e.g. 49]. The comparison is given in Table 2 For the visible photometer and the infrared spectrometer; it is found that the two models agree to better than 5% on all targets.

Table 2 Comparison between Tsat estimates with ExoSim and ArielRad

The comparison between ArielRad and AERM provides validation of the SNR calculations summarised by (23), that means a validation of all calculations implemented concerning the estimate of the exoplanet atmospheric signatures observable during a transit or an eclipse and uncertainties. [26], and reference therein, detail the algorithms implemented by AERM, that estimates the noise during 1 s of integration as

$$ \sigma_{Y,Total} = \sqrt{[S_{T, Y}(\lambda_{s})+ S_{zodi, Y}(\lambda_{s})] \cdot (1+\chi_{Y}) + N_{min,Y}(\lambda_{s})} $$
(24)

The main difference with the ArielRad implementations is the Nmin(λs) term that combines the noise variance in a spectral bin or photometric band from detector dark current and instrument emissions. For the photometric channels, Nmin,Y(λs) is set to 400 s− 1. For the prism spectrometers, \(N_{min,Y}(\lambda _{s})\propto {\lambda _{s}^{3}}\), and it is set to 17 s− 1μm− 3,20 s− 1μm− 3, and 5 s− 1μm− 3 at the blue end of NIRSpec, AIRS-CH0 and AIRS-CH1, respectively. There is no provision for other noise sources in AERM, including detector readout and gain noise, nor pointing jitter. These effects are accounted for in the term χY that is set to 0.2 for the photometers and NIRSpec, and to 0.3 for both AIRS channels. AERM further assume a photon conversion efficiency that, in each photometric or spectroscopic channel is wavelength independent. For this validation exercise, the ArielRad input configuration is adapted to match that in AERM, in terms of photon conversion efficiency, and matching dark currents to the equivalent Nmin,Y(λs). Furthermore, the parameters in (13) are chosen such that n = 2 (i.e. AERM assumes CDS), η = 1 (i.e. AERM assumes 100% sampling efficiency) and the photon noise margin parameter χ are set to those in use in AERM. V ARD, V ARJ, and p0 are set to zero in (12). With this, AERM and ArielRad implement effectively the same instrument model, and are run to evaluate the SNR achieved on one transit or one eclipse over 2500 candidate Ariel targets [49]. Figure 5 shows the comparison of the two model estimates of the average SNR over spectral bins in each spectrometer. We find that, on average, the SNR estimates agree to better than 2 or 3% for AIRS channels. For NIRSpec we find that on average ArielRad predicts SNR that are up to 8% smaller than AERM’s predictions. A further investigation has revealed the root cause of this discrepancy in the way AERM estimates the signals: while ArielRad integrates SEDs over wavelengths in a spectral bin, AERM uses the SED estimated at the blue end of each spectral bin. This always implies that signal in AERM are systematically larger resulting is smaller uncertainties. The effect is more evident in NIRSpec because the larger spectral bin width (R = 10, as impleneted in ESArad) compared to AIRS channels (R = 100 and R = 30 in CH0 and CH1, respectively).

Fig. 5
figure 5

Comparison between ArielRad and AERM SNR estimates in one transit (left) and one eclipse (right) observation of a sample of 2500 candidate exoplanet atmospheres. Each datapoint is the average SNR across the NIRSpec (green), AIRC-CH0 (red) and AIRS-CH1 (blue) bands, binned at the Ariel tier 3 spectral resolution. Trend lines are shown with slope and intercept given in the text annotation in each panel. A black dotted line with unity slope is shown. While estimates for CH0 and CH1 are in good agreement among both models, AERM provides SNR estimates in NIRSpec that systematically and unrealistically larger than ArielRad. This is due to a limitation in the AERM algorithms implemented and it is further discussed in Section 4

5 Use of ArielRad

ArielRad has been developed to support the Ariel phase B study, leading to Mission Adoption by ESA in the Autumn 2020. ArielRad allows to evaluate the payload science performance over a large target list of thousands of potential exoplanetary targets and to assess the compliance of the payload design with the science requirements briefly discussed in Section 2, with a more detailed discussion in [12] and in the Ariel yellow bookFootnote 3. ArielRad is the main tool used to assess the Ariel payload design solutions and provides a guide to optimise the payload to achieve requirements, and to maximise the Ariel science return beyond requirements, when possible.

The observed diversity of exoplanets can only be investigate by surveying a large parameter space in planetary radii and masses, thermodynamic conditions, chemical properties and host star types. Ariel is designed to provide the first large survey of the atmospheres of about 1000 diverse planets and ArielRad is the tool used to craft a target list that is compliant with this science mandate. [49] used ArielRad simulations to provide a preliminary mission reference sample (MRS) of 1000 planets. While the MRS is expected to evolve during the next phases of the project until launch in 2028, the ArielRad performance analysis demonstrates that the atmospheres of planets in the MRS can be characterised with a SNR > 7 during the 4 year nominal mission lifetime.

As discussed in Section 3.4, ArielRad provides a detailed description of the Ariel noise budget (Fig. 4) on individual targets. The analysis can be extended to provide a comprehensive description of the payload performance over all targets in the MRS to show how Ariel achieves a photon noise limited performance on all targets, as shown in Fig. 6. The MRS of [49] is used. It lists both exoplanets already discovered and expected TESS yields. At AIRS wavelengths (λs > 1.95μ m) photon noise is the dominant source of uncertainty, dwarfing all other noise contributions. At shorter wavelength, photon noise is less important for a small, but significant number of targets, with detector gain noise playing a larger role in the noise budget. These are largely TESS targets around M-type (cold) host stars, demonstrating the power of the Ariel IR bands for these type of targets.The Ariel telescope is diffraction limited at a wavelength of 3 μ m and significant optical aberrations degrade the image quality at shorter wavelengths. However, Ariel works as a light bucket, and image quality is not relevant to achieve its performance requirements. This aspect is also investigated in Fig. 6 where the analysis is done for both a diffraction limited instrument and using estimates of aberrated PSF from engineering optical modelling, corresponding to a wave front error of 250 nm RMS at the VISPhot, FGS1, FGS2 and NIRSpec focal planes, and 280 nm RMS at the AIRS focal planes. The differences are negligible and it can be noted from the figure that an aberrated PSF behaves slightly better than a diffraction limited PSF, despite the former requires a larger number of pixel in each photometric or spectral bin aperture. However, because the aberrated PSF dilutes the signal more, pixels take longer to saturate, there are fewer exposure in a given observing time, hence read noise has overall a smaller impact.

Fig. 6
figure 6

Total noise to photon noise ratio in one hour integration for all Ariel MRS targets. Solid and dashed lines mark median values across all targets evaluated for diffraction limited (red) and aberrated (blue) optics. Blue area represents the dispersion of 95% of simulated planets for the aberrated optics configuration. As the MRS contains exoplanets already discovered as well as expected TESS yields, these are separated respectively in the left and right panels. As discussed in the text, the apparent excess noise above photon noise at visible-nearIR wavelengths is due to the presence of cold M-type stellar targets, that are more numerous among TESS targets, and the assumed gain noise contribution. For these type of targets, observations in the nearIR-midIR are more sensitive showing the power of Ariel’s IR bands that are always shot noise limited

ArielRad uncertainties estimates support the the science community to optimise the science of Ariel. For instance [50] used ArielRad estimates to investigate the capability of Ariel in retrieving pressure-dependent chemical profiles from predictions of observed atmospheric spectra. Science analyses using ArielRad performance estimates are ongoing, include aspects related to atmospheric retrieval, phase curve detection, transit timing variations, etc., and will be reported in Ariel phase B study report later in 2020 ahead of Mission Adoption: the Ariel “Red Book”.

6 Conclusions

In this work we have discussed the algorithmic implementation of ArielRad, the Ariel radiometric simulator, used for the optimisation of the payload design and to evaluate the science performance of the ESA M4 space mission. ArielRad accounts for all relevant sources of uncertainties on the detection of exoplanetary atmospheres with Ariel, that are: photon noise (of astrophysical origin and from the instrument self emission), detector noise and electronic noise, and jitter noise. All other potential systematic of instrument origin are expected to be made negligible by a careful instrument design as detailed by [12] and further discussed in the Ariel Yellow Book.

ArielRad has been extensively validated against two alternative models, ExoSim and AERM, always showing excellent agreement at a few percent level across all Ariel bands.

Ariel will perform the first statistical survey of the atmospheres of a large and diverse sample, observing about 1000 exoplanets during its life time, as discussed in [49] with the help of ArielRad. For all exoplanet observations, the photon noise of their host stars is the dominant source of uncertainties, as revealed by an ArielRad assessment.

ArielRad performance estimates, in the form of noise vs wavelength achieved on a given set of exoplanetary targets, are a product distributed upon request. As the phase-B continues, the payload design is optimised in an iterative process that aims at building the most performant space mission within the envelope provided. An advancement in payload design does not however imply the need to modify the algorithmic implementation of ArielRad thanks to its parametric description of the payload model. As a consequence, the simulator itself is planned to be released at the freeze-out of the payload design, to occur after mission adoption.