Abstract
Phosphine could be a key molecule in the understanding of exotic chemistry that occurs in (exo)planetary atmospheres. While phosphine has been detected in the Solar System's giant planets, it has not been observed in exoplanets to date. In the exoplanetary context, however, it has been theorized to be a potential biosignature molecule. The goal of our study was to identify which illustrative science cases for PH3 chemistry are observable with a space-based mid-infrared nulling interferometric observatory like the Large Interferometer for Exoplanets (LIFE) concept. We identified a representative set of scenarios for PH3 detections in exoplanetary atmospheres that vary over the whole dynamic range of the LIFE mission. We used chemical kinetics and radiative transfer calculations to produce forward models of these informative, prototypical observational cases for LIFEsim, our observation simulator software for LIFE. In a detailed, yet first order approximation, it takes a mission like LIFE: (i) about 1 h to find phosphine in a warm giant around a G star at 10 pc, (ii) about 10 h in H2 or CO2 dominated temperate super-Earths around M star hosts at 5 pc, (iii) and even in 100 h it seems very unlikely that phosphine would be detectable in a Venus-Twin with extreme PH3 concentrations at 5 pc. Phosphine in concentrations previously discussed in the literature is detectable in 2 out of the 3 cases, and it is detected about an order of magnitude faster than in comparable cases with James Webb Space Telescope. We show that there is a significant number of objects accessible for these classes of observations. These results will be used to prioritize the parameter range for the next steps with more detailed retrieval simulations. They will also inform timely questions in the early design phase of a mission like LIFE and guide the community by providing easy-to-scale first estimates for a large part of detection space of such a mission.
Introduction
Phosphine is an essential molecule for our understanding of atmospheric chemistry in planets inside and outside our Solar System. It has been observed in the upper atmospheres of Jupiter and Saturn (e.g., Bregman et al., 1975; Fletcher et al., 2009; Larson et al., 1977; Weisstein and Serabyn, 1996), and it is also expected (not detected yet) to be similarly present in exoplanet atmospheres (e.g., Wang et al., 2017). If phosphine is indeed abundant in a wide range of (exo)planetary environments, it is imperative to study its observability. Chemical kinetic models have demonstrated that phosphine can form in the deeper and hotter layers of giant planets, where the temperature and pressure are sufficiently high to make it the primary, thermochemically stable phosphorus-bearing species. Due to thermal disequilibrium, it is then dredged up to the upper layers of the atmosphere as a consequence of quenching and rapid vertical mixing and thus becomes observable (Visscher et al., 2006). This molecule is also expected to be the primary carrier of phosphorus in warm exoplanet atmospheres, where similar processes might happen. Even though the recent tentative detection of a trace of phosphine in the cloud decks of Venus has been challenged (Akins et al., 2021; Bains et al., 2021; Encrenaz et al., 2020; Greaves et al., 2021a, 2021b, 2020; Lincowski et al., 2021; Snellen et al., 2020; Thompson, 2021; Trompet et al., 2021; Villanueva et al., 2021), the interest of the community for this molecule, including its detectability in exoplanet atmospheres, has increased. First estimates by Sousa-Silva et al. (2020) showed that PH3 at ppm levels could be observed with the James Webb Space Telescope (JWST) when using the technique of transit and eclipse spectrophotometry in H2 and CO2 dominated super-Earths orbiting even active late type stars at distances up to 5 pc. Wunderlich et al. (2021) discussed the detectability of PH3 and other relevant potential biosignature gases for the case of the habitable-zone super-Earth LHS 1140 b with JWST and the Extremely Large Telescope.
One of the major achievements of exoplanet research in the first half of this century will be the investigation of atmospheric properties for a statistically significant number of temperate, terrestrial exoplanets (see, e.g., Fujii et al., 2018). This is mostly driven by the search for habitable conditions and the opportunity to identify potential bio- or even techno-signatures, based on a general understanding of the diversity of planetary bodies. Currently, we are at the very beginning of understanding whether there are any “smoking gun” biosignatures, and we are not certain about those numbers that will make a statistically significant, representative data set of the planet atmosphere zoo. In the next two decades, the first milestones on our roadmap will be achieved in conjunction with selected or proposed ground-based projects (see, e.g., Ben-Ami et al., 2018; Lovis et al., 2017; Snellen et al., 2013) and space-based missions such as the Atmospheric Remote-sensing Infrared Exoplanet Large-survey (Tinetti et al., 2018),* the Large UV/Optical/IR Surveyor (The LUVOIR Team, 2019), † the Habitable Exoplanet Observatory (Gaudi et al., 2020), ‡ and The Origins Space Telescope (Meixner et al., 2019). § These next generation observatories will likely deliver a first comprehensive, consistent, large observational set of exoplanetary atmospheres. A complementary approach to the most frequently discussed large space-based coronagraphic missions or the starshade concept is to separate the light emitted by an exoplanet from that of its host star by means of an interferometer.
Large Interferometer for Exoplanets (LIFE) is such a project that was initiated in Europe with the goal to consolidate various efforts and define a roadmap that would eventually lead to the launch of a large, space-based mid-infrared (MIR) nulling interferometer (Quanz et al., 2021; Quanz et al., 2019; Quanz et al., 2018). This mission will have the capability to facilitate investigation of atmospheric properties of a large sample of (primarily) temperate terrestrial exoplanets. Centered on these clear and ambitious scientific objectives, the project will define the relevant science and technical requirements. As a next step, the status of key technologies will be reassessed, and further technological development will be coordinated.
For a detailed discussion of PH3 in the context of more conventional biosignatures, such as the combination of oxygen and methane, we refer to the standard literature (e.g., Des Marais et al., 2002; Seager et al., 2016; Schwieterman et al., 2018; Krissansen-Totton et al., 2018; and references therein), as well as the LIFE articles (Alei et al., 2022; Konrad et al., 2022), which address the detection of astrobiologically relevant species in Earth twins. A survey of the presence of phosphine in a large variety of exoplanetary contexts is needed to understand phosphine's potential as a biosignature molecule. If PH3 is used as a biomarker in super-Earths' planets with a H2-dominated atmosphere, we need to understand how the processes and chemical pathways work, that is, the abiotic production of it in temperate (sub-) giants. As shown in Fig. 1, phosphine has a relatively unique signature in the MIR that can be distinguished from other relevant species in this wavelength regime at the spectral resolutions provided by LIFE. The O3, NH3, and PH3 bands cover a similar spectral range but feature significantly different shapes. However, it is still important to interpret these spectral signatures in the context of other system parameters and atmospheric constituents, when performing full spectral retrievals (see also Section 4) that can, for example, derive whether the atmosphere is oxidizing. Some combinations can only exist in different chemical regimes and would not interfere with one another. For example, PH3 is theorized to be present only in anaerobic environments (i.e., no O3 present simultaneously). In this study, we present a first order study of the exploration space that is accessible to a mission like LIFE for an inventory study of phosphine in extrasolar planetary environments.

Comparison of the opacities of phosphine with other relevant species in the mid infrared. While some other species are optically active at similar wavelengths, it has a very distinct shape that helps to distinguish it.
LIFEsim
For the modeled observations presented here we use LIFEsim, which simulates observations based on a detailed model of an interferometric exoplanet characterization observation and accounts for noise from stellar leakage, local zodiacal, exo-zodiacal, and dust emission, but no instrumental noise (see Dannert et al., 2022).
Following the same approach as Wang et al. (2017) and Sousa-Silva et al. (2020) in their analysis of the observability with JWST, we used LIFEsim to produce synthetic observations of the outlined exoplanet cases with and without phosphine present in their atmospheres. For the presented output spectra, LIFEsim is configured with four apertures of 2 m diameter each, a broadband wavelength range of 4–18.5 μm, and a spectral resolution of R = 50. We assume an exo-zodi level of thrice the local zodi density (following the results from the Hunt for observable signatures of terrestrial systems survey for the expected median level of emission by Ertel et al., 2020) and an interferometric baseline setup between 10 and 100 m (Table 1).
Overview of Simulation Parameters Used in LIFEsim
Overview of Simulation Parameters Used in LIFEsim
These are the same standard values as, for example, used in Quanz et al. (2021).
The main goal of this analysis was to identify the dynamic range of the LIFE observatory's exoplanet characterization potential by testing a few extreme, yet representative, science cases. Fortunately for us the literature already contained studies of phosphine in hot to warm Jupiter sized planets (Wang et al., 2017), as well as various configurations of super-Earths (Sousa-Silva et al., 2020). In our scheme, the Jupiter case represents one of the least challenging types of observations with LIFE, whereas the potentially habitable super-Earth case represents one of the “sweet spots” in terms of instrumental requirements, as well as astrobiologically relevant observations. The additional scenario of the Venus twin was chosen to explore the extreme limits of an experiment like LIFE. The two cases from the literature were modeled with dedicated pipelines tailored to these setups, while the Venus case was modeled with a standard, open (online) tool. In the next subsections, we describe in more detail these pipelines and refer to the original articles and the Planetary Spectrum Generator (PSG) documentation for all specifics. We decided for this approach since it would not have been trivial (or even impossible) to derive all discussed cases from each one or all of the used pipelines. For example, the pipeline of Wang et al. (2017) was optimized for warm Jupiters, while the methods of Sousa-Silva et al. (2020) focused on Super-Earths. Alei et al. (2022) show that even in a full Bayesian retrieval using two different modeling pipelines has only minor impact on the retrieved abundances. Therefore, we assume that this will have little to no effect on the very general feasibility/detectability study we present here. For a more detailed study of the retrieval of chemical and physical properties of an Earth-twin planet that could be obtained with LIFE, we refer to Konrad et al. (2022). The characterization of the main features of an Earth-twin at various stages of its evolution (prebiotic, Great Oxygenation Event, modern atmosphere) is also discussed by Alei et al. (2022).
Venus twin
Within the group of modeled science cases with the LIFE observatory concept, the detection and characterization of a Venus-like planet represent one of the most challenging observations. This case was chosen to explore the edge of the parameter space of possible detections with LIFE. In this study, we test whether our local Solar System science case of potential phosphine in Venus would be detectable remotely. We use models from the PSG (
Overview of Simulation Parameters Used in the Planetary Spectrum Generator Model of a Venus Twin
Overview of Simulation Parameters Used in the Planetary Spectrum Generator Model of a Venus Twin
PSG = Planetary Spectrum Generator.
We modeled various cases, including 0, 5–20 ppb (the current estimate for Venus from Greaves et al., 2020c) and up to 310 ppm concentration [same as the super-Earth case in Section 2.2.2 from Sousa-Silva et al. (2020), minimum concentration for remote JWST detection there]. For the analysis here, we compare the detectability of the extreme 310 ppm scenario against one without any phosphine.
The models for the cases of H2- and CO2-dominated super-Earths around M star hosts are taken from Sousa-Silva et al. (2020) (Table 3). The atmospheric composition for these scenarios is based on calculations with the photochemical model of Hu et al. (2012) (for more details please see there and references therein). These model super-Earths with radii of 1.75 R⊕ and masses of 10 M⊕ are orbiting a 0.26 RS active 3000K M-dwarf star at 5 pc distance. The assumed phosphine concentrations for the “with PH3” cases are (H2-dominated) 220 ppb and (CO2-dominated) 310 ppm. To maintain equitable, temperate surface conditions (288K), the H2- and CO2-dominated cases with no PH3 emissions are separated from their M dwarf host star by 0.042 and 0.034 AU, respectively. Sousa-Silva et al. (2020) used a photochemical model to explore how PH3 concentrations responded to two different scenarios: an “active” M-dwarf UV flux model based on observations of GJ1214 and a “quiet” M-dwarf for which the UV fluxes shortward of 300 nm are 0.1% of the “active” scenario fluxes. The “quiet” scenario resulted in PH3 accumulating to roughly 100 times higher concentrations for the same emission fluxes compared to the “active” M-dwarf scenario. The PH3 fluxes that would lead to the potential detection of PH3 are larger than the average terrestrial PH3 emission estimates, although comparable to the rates at which other biosignature gases are produced, and they are well below observed site-specific PH3 fluxes on Earth (Sousa-Silva et al., 2020). The reader is directed to the work of Sousa-Silva et al. (2020) for more details on the modeling and the interactions between phosphine and other atmospheric constituents.
Overview of Simulation Parameters Used in the Super Earth Models
Overview of Simulation Parameters Used in the Super Earth Models
For more details, see Sousa-Silva et al. (2020).
We use models of Wang et al. (2017) (Table 4) for a Jupiter-like planet around a G-type star using chemical kinetics to calculate the abundances of different species, which were then input to the Smithsonian Astrophysical Observatory 1998 radiative transfer code (Kaltenegger and Traub, 2009; Traub and Stier, 1976) to calculate the transmission and emission spectra. Their chemical calculations show that PH3 is the main phosphorous carrier for exoplanets between 400K and 2000K.
Overview of Model Parameters for Case 3
Overview of Model Parameters for Case 3
“Solar” abundances refer to Asplund et al. (2009).
In these models, the effects of vertical mixing are considered but those of photochemistry in the upper atmosphere of these planets are neglected. The rationale for this is that, while photochemistry is relevant in very hot planets with a hydrogen dominated atmosphere of solar composition close to their stars, their effect on warm planets is limited (e.g., Miguel and Kaltenegger, 2014); in addition, vertical mixing effects are the most relevant ones in the pressures probed by transmission spectroscopy (at the millibar level). Wang et al. (2017) used the solar elemental (atomic) abundances of C, H, O, N, S, and P (Asplund et al., 2009) and let the code calculate the molecules that will be formed given the conditions in the planetary atmosphere calculation. We note that the Wang et al. (2017) calculation was done without considering the effect of photochemistry, which might affect abundances in the upper parts of the atmosphere (pressures <1e−4 bar) but will most likely have a small effect on the detectability of different species, in particular in MIR emission (vs., e.g., transit transmission in the optical, near infrared).
Estimated yields
In a first step of our study, we checked the availability of potential targets by analyzing the occurrence of the modeled cases in the Solar neighborhood accessible for the theorized observations. Therefore, we checked how many of these potential targets are already known and calculated how many of these LIFE would be able to detect in its detection phase alone even if no other methods would have found those by the time of LIFE's launch.
For the yield simulations, sections of the parameter space that correspond to the described cases are selected. These restrictions to the parameter space are shown in Table 5. We used planet occurrence rates inferred from NASA's Kepler mission to estimate the total expected number of exoplanets in the solar neighborhood and computed the fraction of these, which is detectable with LIFE within 10 h of integration time. For this task, we create synthetic exoplanet populations using the NASA ExoPAG SAG13 occurrence rates (Kopparapu et al., 2018) for all spectral types with P-pop ** (Kammerer and Quanz, 2018) and select different subsamples of planets (Venus twins, super-Earths, and warm Jovians) according to Table 5. For more details on the yield calculations, see Quanz et al. (2022).
Restrictions in Parameter Space for Exoplanet Categories
Restrictions in Parameter Space for Exoplanet Categories
(a) For the eHZ cases we defined it by the “Recent Venus” and “Early Mars” scenarios for 1 M⊕ planets from Kopparapu et al. (2014).
For each of these subsamples, we then compute the number of planets which could be detected with an signal-to-noise ratio (SNR) >7 in 10 h of integration time using LIFEsim (Dannert et al., 2022) (we require an SNR >7 to leave a margin for additional instrumental noise). Table 6 shows planets within 20 pc around FGK stars, and Table 7 shows planets within 10 pc around M stars. We limit the M star case to within 10 pc because the survey efficiency (number of detections per observed target) becomes low further out (Quanz et al., 2022). For the case of the super Earth around an M star at 5 pc, the on-sky separation between the planet and its host star lies at 6.8 mas. The detection and characterization of the planet at this close proximity of about 0.3 lambda/Bnulling in terms of the interferometric array properties illustrate the capability of LIFE to directly image close-in targets. This is made possible by the suppression of the central null being a continuous function down to the line-of-sight, which results in the inner working angle of the observatory being a strong function of the target planet luminosity at a fraction of lambda/Bnulling. It should be noted that an absolute lower bound of the inner working angle is given by the fringe spacing due to the imaging baseline, that is, 0.5 lambda/Bimaging.
Number of Potential Targets in the Respective Cases Within 20 pc for FGK Host Stars
Column one shows the planets already known from the NASA Exoplanet Archive, column two the number of planets expected from Kepler statistics (using P-pop with the SAG13 occurrence rates), and column three the number of planets detectable with LIFE within 10 h of integration time.
LIFE = Large Interferometer for Exoplanets.
Same as Table 6 But for M Star Hosts Within 10 pc Only
The limits of the “boxes” shown here are taken to be wider than the particular cases described in our examples and are chosen to be aligned with those of Kopparapu et al. (2018). The point we are making is that there will be dozens of planets in this parameter space to explore, which will allow us to derive the presence or nonpresence of phosphine within the order of integration times used in our examples.
Figure 2 shows the results of a simulated observation of a Venus twin around a Sun twin at 10 pc. We compare a simulated 100 h observation of an atmosphere with 310 ppm of phosphine to a phosphine free model. The significance is only ∼0.15 sigma, and the phosphine is thus not detectable.

Detectability of phosphine in the emission spectrum of a Venus Twin planet, after 100 h of observation with LIFE. Top: planet flux for atmospheres with and without PH3. The blue area represents the 1-σ sensitivity; the gray error bars show an individual simulated observation. Bottom: Statistical significance of the detected difference between an atmospheric model with and without phosphine.
Figures 3 and 4 show the results of simulated observations of H2- and CO2-dominated super-Earths orbiting an M star host at 5 pc (from Sousa-Silva et al., 2020; see Section 2.2.2). We modeled a 10 h observation of these atmospheres with 220 ppb and 310 ppm of phosphine, respectively, and compared them to phosphine free models for the same cases. We get a solid detection of up to 5 σ per spectral channel in the bands between 8 and 12 μm that show the biggest difference between a PH3 bearing atmosphere and one without.

Detectability of phosphine in the emission spectrum of a super Earth exoplanet (10 M⊕; 1.75 R⊕) with a hydrogen-rich atmosphere orbiting an active M-dwarf, after 10 h of observation with LIFE. Top: planet flux for atmospheres with and without PH3, respectively; blue and red curves represent a modeled atmosphere with a PH3 mixing ratio of 310 ppm and an atmosphere without PH3. The blue area represents the 1-σ sensitivity; the gray error bars show an individual simulated observation. Bottom: Statistical significance of the detected difference between an atmospheric model with and without phosphine.

Detectability of phosphine in the emission spectrum of a super Earth exoplanet (10 M⊕; 1.75 R⊕) with a carbon dioxide-rich atmosphere orbiting an active M-dwarf, after 10 h of observation with LIFE. Top: planet flux for atmospheres with and without PH3, respectively; blue and red curves represent a modeled atmosphere with a PH3 mixing ratio of 310 ppm and an atmosphere without PH3. The blue area represents the 1-σ sensitivity; the gray error bars show an individual simulated observation. Bottom: Statistical significance of the detected difference between an atmospheric model with and without phosphine.
Figure 5 shows the results of a simulated observation of a 500K giant planet orbiting at G star at 10 pc. We get a very solid detection of up to 50 σ per spectral channel in the band around 4.5 μm and up to 20 σ per spectral channel in the bands between 8 and 12 μm that show the biggest difference between a PH3 bearing atmosphere and one without.

Detectability of phosphine in the emission spectrum of 500K warm Jupiter like planet, after 1 h of observation with LIFE. Top: planet flux for atmospheres with and without PH3. The blue area represents the 1-σ sensitivity; the gray error bars show an individual simulated observation. Bottom: Statistical significance of the detected difference between an atmospheric model with and without phosphine.
By applying a detailed mission and instrument simulator for a future MIR space interferometer to map the detectability space of phosphine in three different, extreme planet types and scenarios we find the following: Observations of extrasolar PH3 in a Venus-like planet around G-type stars are very challenging. Even after 100 h of simulated observation, the difference in two extreme forward models (no vs. 310 ppm of PH3) of phosphine is not detectable. For the cases of H2 and CO2 dominated super-Earths around M star hosts, a mission like LIFE can produce highly informative observations in a relatively small amount of time. Applied to a suite of models for these planets taken from the literature, LIFEsim confirms that phosphine concentrations of 220 ppb (H2 case) and 310 ppm (CO2 case) can clearly be distinguished from PH3 free atmospheres with integration times in the order of 10 h. Giant planets, such as the example of the 500K Jupiter, can quickly and easily be characterized in the order of 1 h. LIFE will allow us to make—if present—detections of phosphine in many giant exoplanets and help understand its relevance and occurrence.
In Table 8, we show a comparison of the observation times needed to get a significant detection of phosphine in the described cases. In comparison to the studies of JWST observability of the same scenarios at comparable signal to noise ratios of Sousa-Silva et al. (2020) and Wang et al. (2017), we can report that a concept like LIFE will perform more than an order of magnitude faster: the super-Earths scenarios take 131 h (for a H2-dominated atmosphere) and 48 h (for a CO2-dominated atmosphere) of JWST secondary eclipse emission spectrophotometry (Sousa-Silva et al., 2020) in comparison to about 10 h of observation time needed with LIFE. While the warm Jupiter scenario of Wang et al. (2017) is observable only in transit transmission spectrophotometry within about 30 h of time with JWST, it takes only 1 h with LIFE for a solid detection of phosphine. It is also important to note that the JWST observations require multiple visits to add up the times from a number of eclipse/transit events, while the LIFE observation can be done in a single visit and on nontransiting systems as well.
Overview of Observation Times Needed to Detect the Modeled Levels of PH3
JWST = James Webb Space Telescope.
A caveat of the presented analysis is that it was done for presumably extreme cases and for the comparison of forward models with and without phosphine in the respective atmospheres. Therefore, the next steps for us will be to simulate more examples and run full Bayesian retrievals to get to more quantitative assessments also on the limits of phosphine detectability. The analysis presented here will help to prioritize the parameter range to go for in the next deeper dives into these atmospheres with more detailed retrieval simulations. These results will inform crucial questions and decisions in the early design phase of a mission like LIFE. In combination with other early results of the LIFE initiative, these results will help to get a better idea on how to handle the overall planet statistics in the planning phase of large future missions for terrestrial exoplanet characterization by giving a first order estimate of the realistically explorable parameter space. To build upon these estimates, the LIFE team is currently working on a comprehensive simulation of the consequences on scheduling for the survey and characterization phases of the LIFE mission. At this end, we are also investigating novel machine learning methods that are necessary to expand and augment our front-to-end simulations of the full LIFE survey, including full retrievals (see, e.g., Gebhard et al., 2022). These results will deliver timely information needed for the early definition of sensitivity, wavelength coverage, and spectral resolution requirements on the technology side and set important limits for the modelers to reduce the computing load for upcoming future work such as calculations of spectral/photochemical grids.
Footnotes
Acknowledgments
The authors thank the referees, as well as the participants, of AbSciCon 2022 for very useful feedback on this work.
Authors' Contributions
D.A.: Conceptualization, methodology, data curation, writing-original draft, writing-review and editing, visualization, supervision, project administration. M.O.: Software, methodology, visualization. F.D.: Software, methodology, formal analysis, data curation. Y.M. and C.S.S.: Methodology, writing-review and editing. J.K.: Methodology, software, formal analysis, writing-review and editing. F.M., B.S.K., and H.S.W.: Writing-review and editing. E.A.: Writing-review and editing, visualization. S.P.Q.: Writing-review and editing, supervision.
Author Disclosure Statement
No competing financial interests exist.
Funding Information
Part of this work was supported by SNF, NCCR PlanetS, and the Heising-Simons Foundation.
