PHOTOMETRIC ESTIMATES OF REDSHIFTS AND DISTANCE MODULI FOR TYPE Ia SUPERNOVAE

Richard Kessler; David Cinabro; Bruce Bassett; Benjamin Dilday; Joshua A. Frieman; Peter M. Garnavich; Saurabh Jha; John Marriner; Robert C. Nichol; Masao Sako; Mathew Smith; Joseph P. Bernstein; Dmitry Bizyaev; Ariel Goobar; Stephen Kuhlmann; Donald P. Schneider; Maximilian Stritzinger

doi:10.1088/0004-637X/717/1/40

1. INTRODUCTION

To investigate the expansion history of the universe, increasingly large samples of high-quality type Ia supernova (SN Ia) light curves are being used to measure luminosity distances as a function of redshift (the SN Ia Hubble diagram). Expected SN Ia samples will be in the thousands for the Dark Energy Survey (DES; Bernstein et al. 2009) and in the hundreds of thousands for the surveys to be carried out by the Panoramic Survey Telescope and Rapid Response System (Pan-STARRS)¹⁸ and by the Large Synoptic Survey Telescope (LSST; Ivezić et al. 2008; LSST Science Collaborations 2009). For the latter two, only a small fraction of the SNe will have spectroscopically determined redshifts from SN or host-galaxy spectra for the foreseeable future. To make use of these large SN samples, the redshifts will have to be determined photometrically using both the SN light curves and the host-galaxy photometric observables.

Methods for estimating galaxy photo-z's have been developed over many years (for a review, see, e.g., Abdalla et al. 2008). They generally fall into two categories: (1) an empirical approach, in which one translates observed colors, magnitudes, or other photometric observables into a redshift estimate, training the algorithm on a subset of galaxies with spectroscopic redshifts and (2) template fitting, in which observed colors are matched to redshifted template galaxy spectral energy distributions until the best match for galaxy redshift and type is obtained. The development of photo-z methods using SN data is more recent. Some have followed and adapted the empirical approach to galaxy photo-z estimation, e.g., using observed SN colors near the epoch of peak brightness to estimate the redshift (Wang 2007; Wang et al. 2007) or early-epoch colors to select SNe in particular redshift regions (Dahlen & Goobar 2002).

In this paper, we present and describe a method of SN Ia photo-z estimation that is analogous to the template fitting method for galaxy photo-z's. In models used to fit SN Ia light curves, one typically uses a spectroscopically determined redshift, and the fit parameters are usually taken to be the epoch of peak brightness, the light curve shape or stretch, a color or dust extinction estimate, and the distance modulus. In our approach, we extend the usual methods of fitting light curves to include the redshift as a fifth fit parameter. We apply this "lcfit+z" method to determine photo-z's for the spectroscopically confirmed Sloan Digital Sky Survey-II (SDSS-II; Frieman et al. 2008; Kessler et al. 2009a) and Supernova Legacy Survey (SNLS; Astier et al. 2006, hereafter A06) SNe, and compare the resulting photo-z precision to that obtained from simulated samples. Once we have verified the reliability of the simulations by comparison with the data, we apply the lcfit+z method to simulated LSST SN observations. In all cases, we use both mlcs2k2 (Jha et al. 2007; Kessler et al. 2009a) and saltii (Guy et al. 2007) light curve fitting models.

Variants of the light curve fit approach to SN photo-z's have been used before, by both the SNLS (Sullivan et al. 2006) and SDSS (Sako et al. 2008) surveys, to select SN Ia candidates for spectroscopic follow-up after only a few photometric epochs. Kim & Miquel (2007) used the saltii model and a Fisher matrix analysis to estimate uncertainties on photometric redshifts and distances. Using a technique similar to our lcfit+z method, Gong et al. (2010) studied LSST simulations, focusing mainly on contamination from non-Ia SNe and the resulting precision on cosmological parameters. In contrast, we focus here on the precision and bias for the photo-z and distance modulus. We also use more realistic simulations based on the LSST cadence for both the deep and wide surveys and illustrate some differences between these two components of the LSST survey. Our estimates of non-Ia contamination and cosmological precision will be presented in a future work. Recently, Palanque-Delabrouille et al. (2009, hereafter PD09) employed the light curve fit photo-z technique within the saltii framework. For nearly 300 SNe Ia from the SNLS, they evaluated both the photo-z precision and the fraction of catastrophic redshift outliers. The PD09 SN sample is by far the largest to date used to study SN photo-z methods; our SDSS-II sample, at lower redshifts, is of comparable size.

The plan of this paper is as follows. We introduce the spectroscopically confirmed SN data samples (SDSS-II and SNLS) in Section 2. In Section 3, we describe the simulation, and we present the lcfit+z method in detail in Section 4. The photo-z precision and fit parameter correlations for the data samples, along with the corresponding results from simulated samples, are presented in Section 5. We use the simulation to make predictions for the LSST survey in Section 6. In Section 7, we make direct comparisons with the color-based photo-z method presented in previous works.

As described in Kessler et al. (2009b), all light curve fitting and simulation software is publicly available in the SNANA package.¹⁹

2. THE SDSS-II AND SNLS DATA SAMPLES

To test the lcfit+z method, we use the full three-season sample from the SDSS-II Supernova (SN) Survey (Frieman et al. 2008), and the publicly available sample from the first season of the SNLS (A06). Below we give a brief description of these samples.

The SDSS-II SN Survey used the SDSS camera (Gunn et al. 1998) on the SDSS 2.5 m telescope (Gunn et al. 2006) at the Apache Point Observatory to search for SNe in the Fall seasons (September 1 through November 30) of 2005–2007. This survey scanned a region (designated stripe 82) centered on the celestial equator in the Southern Galactic hemisphere that is 2 fdg 5 wide and runs between right ascensions of 20^h and 4^h, covering a total area of 300 deg² with a typical cadence of every four nights per region. Images were obtained in five broad passbands, ugriz (Fukugita et al. 1996), with 55 s exposures and processed through the PHOTO photometric pipeline (Lupton et al. 2001). Within 24 hr of collecting the data, the images were searched for SN candidates that were selected for spectroscopic follow-up observations in a program involving about a dozen telescopes. The SDSS-II SN Survey discovered and spectroscopically confirmed a total of ∼500 SNe Ia. A larger sample of photometrically identified but spectroscopically unobserved SNe Ia was also compiled, and host-galaxy redshifts for several hundred of these photometric candidates have been obtained to date. The SDSS-III Survey (Schlegel et al. 2009), as a small part of its early program, is in the process of measuring host-galaxy redshifts for more than 1000 of these photometrically identified SNe Ia. The telescope aperture, focal plane, and exposure time of the SDSS system (York et al. 2000) were ideal for discovering SNe in the previously underexplored redshift range 0.1 < z < 0.3. Details of the SDSS-II SN Survey are given in Frieman et al. (2008) and Sako et al. (2008), the procedures for spectroscopic identification and redshift determinations are described in Zheng et al. (2008), and the SN photometry is described in Holtzman et al. (2008). A condensed summary of the SDSS-II survey, SN typing, redshift determination, photometry, and calibration can be found in Kessler et al. (2009a).

The SNLS was a five-year survey covering 4 deg² using the MegaCam imager on the 3.6 m Canada–France–Hawaii Telescope (CFHT). Images were taken in four bands similar to those used by the SDSS: g_M, r_M, i_M, z_M, where the subscript M denotes the MegaCam system. The SNLS exposures were ∼1 hr in order to discover SNe at redshifts up to z ∼ 1. The SNLS images were processed in a fashion similar to the SDSS-II so that spectroscopic observations could be used to confirm the identities and determine the redshifts of the SN candidates. We use the publicly available sample from their first year of operations that ended 2004 July 15. Detailed information about the SNLS can be found in A06 and references therein.

Since high-quality light curves are needed for the lcfit+z method, we apply the following selection requirements to the photometric data for inclusion in our analysis samples: (1) spectroscopic confirmation of type Ia, (2) a measurement with T_rest < −3 days, (3) a measurement with T_rest > + 10 days, (4) measurements in at least three observer-frame filters have signal-to-noise ratio (S/N)>8, and (5) the probability corresponding to the fit-χ²/N_dof (Section 4) is $P_{\chi ^2}> 0.02$ . Here, T_rest is the epoch in the SN rest frame relative to peak brightness in the B band, and we note that it depends on the fitted photo-z value. For the SDSS-II, we use only the gri passbands. The number of SNe Ia satisfying these selection requirements is nearly 300 and 40 for the SDSS-II and SNLS, respectively; the exact numbers of SNe depend on the fitting model (mlcs2k2 or saltii) and will be given in Section 5. The selection criteria above are not based on optimizations, but are instead motivated by the strong correlation between redshift and color. Requiring two colors and at least one measurement in each of three passbands with S/N>8 explicitly ensures good color measurements. The T_rest requirements (<−3 and > + 10 days) ensure a good determination of the time of peak brightness (t₀); since SNe become redder after peak light, a mismeasurement of t₀ translates directly into a mismeasurement of color, and hence redshift.

Here we provide some additional motivation for the above requirements. Relaxing the S/N requirement (No. 4 above) to S/N>5 results in about 10% more SNe Ia in the SDSS-II sample, and a 20% degradation in the photo-z precision. Making a more restrictive cut of S/N>10 results in a 20% loss of events and a negligible improvement in the precision. We have therefore chosen S/N>8 as a reasonable compromise between sample statistics and precision. To motivate the sampling requirements, we have applied the lcfit+z fitting method (Section 4) to the SDSS-II sample in which all measurements prior to peak brightness have been rejected; the resulting precision and bias on the fitted t₀ and photo-z are significantly degraded. The sampling requirements above (Nos. 2 and 3) are therefore designed to ensure a good determination of t₀.

3. SIMULATIONS

We use the SNANA simulation code to generate realistic SN Ia light curves that can be analyzed in exactly the same manner as the data. The simulation is used to compare with the data, to compare with previous photo-z studies based on simulations, and to make predictions for LSST. All SN simulations are based on a standard ΛCDM cosmology (w = −1, Ω_M = 0.3, Ω_Λ = 0.7), and they are generated and fit using the same light curve model: mlcs2k2 or saltii. This strategy explicitly assumes that the light curve model is correct and will therefore yield the most optimistic results. As discussed below, the models are adjusted to account for the anomalous Hubble scatter. However, these models have not been adjusted to account for the discrepancies in the ultraviolet region (Kessler et al. 2009a, hereafter K09). Details of the simulation are described in Kessler et al. (2009b) and in Section 6 of K09; here we give a brief overview.

For mlcs2k2, rest-frame model magnitudes are generated from light curve templates and then dimmed by host-galaxy dust extinction. Using SN Ia spectral templates from Hsiao et al. (2007), K-corrections (Nugent et al. 2002) are used to transform the rest-frame (UBVRI) model magnitudes into observer-frame magnitudes. For saltii, the model is based on a time sequence of rest-frame spectra, and observer-frame magnitudes are computed by convolution with the appropriate filter-response curves.

Since light curve models are defined over a specific wavelength range in the rest frame, one usually checks the rest-frame wavelength $\bar{\lambda }_{\rm f}/(1+z)$ , where $\bar{\lambda }_{\rm f}$ is the mean wavelength of the observer-frame filter and z is the redshift. If the rest-frame wavelength is outside the valid range of the model, then the corresponding filter is typically ignored. For photo-z applications, this method of ignoring filters clearly cannot be used, since the redshift is not known ahead of time. To be realistic, we should not use this filter-ignoring procedure in the simulations either. Therefore, our simulations use wavelength-extended models that generate fluxes for filters with $\bar{\lambda }_{\rm f}/(1+z)$ beyond the nominally defined wavelength range. For both mlcs2k2 and saltii, the rest-frame wavelengths are extended down to 2500 Å. For saltii, the 7000 Å upper limit has been raised to 8700 Å. We note that the simulation and fitter use the same wavelength-extended model, and therefore these tests do not probe potential problems if the extended part of the model is wrong.

We have implemented two models of intrinsic SN magnitude variations that produce "anomalous" scatter in the Hubble diagram. The source of anomalous scatter is unknown, and it is not clear if the scatter can be reduced with an improved light curve model, or if this scatter is due to some random physical process such as brightness variations as a function of viewing angle in an asymmetric explosion. For simulations with 10⁴ times the nominal exposure time and z < 0.5, i.e., for which photon noise is negligible, we define the anomalous scatter to be the rms (RMS_μ) of the difference between the fitted and generated distance modulus (μ_fit − μ_gen) from the four-parameter light curve fit using spectroscopically determined redshifts. The default model, called "color smearing," introduces an independent magnitude fluctuation in each passband, and the fluctuation is the same for all epochs within each passband. A random number r_f from a unit-variance Gaussian distribution is chosen for each passband f, and a magnitude fluctuation δm_f = r_fσ_f is added to the generated magnitude at all epochs. As described in Section 5, a scatter of σ_f = 0.1 mag is needed in order for the simulated photo-z precision to match that of the data. The resulting Hubble scatter is consistent with that seen in analyses of spectroscopically confirmed data samples; RMS_μ ≃ 0.16 for the SNLS (griz) simulations and RMS_μ ≃ 0.19 mag for the SDSS-II (gri) simulations. The simulated SNLS scatter is slightly smaller because this survey results in larger S/N values. The second model of intrinsic variations is called "coherent luminosity smearing": a coherent random magnitude shift, drawn from a Gaussian with σ_coh ∼ 0.15 mag, is added to the model magnitude for all epochs and passbands. In the coherent smearing method the intrinsic model colors are not varied, and the resulting anomalous scatter is RMS_μ = σ_coh.

Although both models of intrinsic magnitude variation result in the expected scatter in the Hubble diagram, only the color-smearing model can generate the observed photo-z precision in the SDSS-II and SNLS data samples (Section 5). We have not investigated simulations in which both models of intrinsic variation contribute, nor have we investigated variations in the color parameter (R_V for mlcs2k2 or β for saltii) that could also introduce anomalous scatter.

To simulate non-photometric conditions and varying time intervals between observations due to bad weather, actual observing conditions are used for an existing survey, or an estimate of such conditions for a planned (future) survey. For each simulated observation, the noise is determined from the measured point-spread function (PSF),²⁰ zero point, CCD gain, and sky background. Noise from the host-galaxy background is not included. The simulated flux in CCD counts is based on a mag-to-flux zero point and a random fluctuation drawn from the noise estimate. For the SDSS-II and SNLS surveys, a detailed treatment of the search efficiency, including spectroscopic selection effects, is described in Section 6.2 of K09.

The quality of the simulation is illustrated in Figures 1 and 2 for the SDSS-II and SNLS surveys, respectively, using the SN selection requirements described in Section 2. The parameters from each sample have been determined using the conventional fitting method with a fixed spectroscopically determined redshift. Each figure shows data-simulation comparisons for the distributions of spectroscopic redshift, color parameter (A_V for mlcs2k2, c for saltii), and shape–luminosity parameter (Δ for mlcs2k2, x₁ for saltii). The color and shape parameters are defined in Section 4. There is good overall consistency between the measured and simulated distributions.

**Figure 1.** For the SDSS-II SN Survey, comparison of data distributions (dots) with those from the simulation (histograms). Fitted parameters for mlcs2k2 (z, *A_V*, Δ) are on the left and for saltii (z, c, x₁) on the right. Fits were done with the redshift fixed to the true redshift. The simulated histograms are scaled to have the same number of entries as the data.
Download figure:
Standard image High-resolution image

**Figure 2.** Same as Figure 1 but for the SNLS survey, using the data sample from A06.
Download figure:
Standard image High-resolution image

4. THE lcfit+z METHOD

The general principle behind the SN Ia photo-z determination is illustrated in Figure 3 using colors at the epoch of peak brightness. Increasing the redshift causes uniform reddening at all wavelengths, while intrinsic reddening (or extinction) causes more reddening at bluer wavelengths. A time-dependent light curve model (e.g., mlcs2k2 or saltii) is used to account for the known color dependence on the light curve shape (commonly known as the stretch) and to use all epochs in order to maximize the photostatics in the photo-z measurement. Since SNe become redder with increasing epoch, any error in the epoch of peak brightness results in the wrong template colors, and hence an increased error in the photo-z.

For real observations, the ideal color–color bands in Figure 3 are smeared by photon statistics and by intrinsic SN color variations that are not described by the light curve model. With sufficient smearing, the colors of a very reddened SN at z ≃ 0.1 are degenerate with a blue SN at z ≃ 0.3 (see the region enclosed by dotted oval in Figure 3) and we indeed see this degeneracy in the SDSS-II photo-z measurements. From data-simulation comparisons, we find that smearing from photon statistics does not fully describe the photo-z precision, and we therefore propose that the modeling of intrinsic color variations is not adequate (Section 5).

While host-galaxy photometric redshifts depend on the determination of the 4000 Å break, a similar ∼2800 Å break in the SN spectrum has little impact on the photo-z measurement because this feature is either inaccessible at low redshifts, or it is poorly measured at higher redshifts. In current SN Ia models, fluxes at these very blue wavelengths are either ignored, or they are heavily down-weighted relative to the optical bands.

The basic idea of the lcfit+z method is to start with a light curve fit model that has four free parameters when the redshift is fixed to an accurately measured value, and simply float the redshift as a fifth fitted parameter. We refer to these methods as mlcs2k2+z and saltii+z, where mlcs2k2 and saltii refer to the conventional models in which the redshift is fixed to a precisely measured value. Compared to the color-based photo-z method, advantages of the lcfit+z method include a natural framework for tracking correlations between redshift and distance modulus (Section 5.2), and using all of the light curve information (instead of just peak flux) so that in principle the intrinsic SN color variations can be accounted for. The main advantage of the color-based method is that it works over a broader redshift range, and there is no need to worry about which observer-frame filters map into a valid wavelength range in the rest frame.

For mlcs2k2+z, the five fitted parameters (which we denote with the vector $\vec{x}_{5}$ ) are the time of maximum brightness in the (rest frame) B band (t₀), the shape–luminosity parameter (Δ), the host-galaxy extinction in the V band (A_V), the distance modulus (μ), and the redshift (z_phot). We use a flat A_V prior and R_V = A_V/E(B − V) = 2.2. For saltii+z the five parameters are the time of maximum brightness in the rest-frame B band (t₀), the shape–luminosity parameter (x₁), the B − V color (c), the flux normalization (x₀), and the redshift (z_phot). The following light curve fit χ² is minimized using minuit²¹:

$\begin{equation} \chi ^2= \sum _i \left\lbrace \frac{\left[F_i^{\rm data} - F_i^{\rm model}(\vec{x}_{5})\right]^2}{ \sigma _i^2 } + 2\ln (\sigma _i/\tilde{\sigma _i}) \right\rbrace, \end{equation} \tag{ 1 }$

and the corresponding probability $(e^{-\chi ^2/2})$ is used to marginalize as described in Appendix B. Here, F^data_i is the SN flux of the ith observation, $F_i^{\rm model}(\vec{x}_{5})$ is the predicted flux using the five model parameters $(\vec{x}_{5})$ , and σ²_i = σ²_i,stat + σ²_i,model is the quadrature sum of the measured and model uncertainties, respectively. The index i runs over all epochs and filters. The second term in Equation (1) accounts for model uncertainties that depend on the rest-frame passband and epoch, which in turn depend on the photo-z value. The model uncertainties are shown in Figure 4 for the rest-frame U and B passbands. The reference uncertainty $(\tilde{\sigma _i})$ from the first iteration of the fit is used in the denominator so that the second term is close to zero in the second iteration; although the $\tilde{\sigma _i}$ do not affect the minimization, these terms reduce the change in the calculated fit probability. As explained below, the χ² is minimized twice in order to include the appropriate filters and epochs. The minimized values and uncertainties are then used to estimate the integration ranges needed to obtain marginalized results.

**Figure 4.** Model uncertainty vs. rest-frame epoch for the U (solid) and B (dashed) passbands.
Download figure:
Standard image High-resolution image

In this study, we use the mlcs2k2 and saltii light curve fitters that have been implemented in the SNANA package. The main reasons for using the SNANA code are (1) exactly the same light curve model is guaranteed to be used in both the light curve fits and in the generated simulations, (2) there are several improvements to the mlcs2k2 light curve fitter as explained in K09, (3) the photo-z implementation is identical for both models, (4) the SNANA fitter is significantly faster than the original fitting software. To check the saltii implementation in SNANA, we have repeated the light curve fits and cosmology analysis for the six sample combinations in K09 and find that the dark energy equation of state parameter w is always within a few hundredths of the value obtained with the original code; these discrepancies are well below the statistical uncertainties.

Although it is straightforward to include the redshift as a free parameter in the light curve fit, there are subtle pre-fit issues related to the unknown redshift value: (1) SN selection criteria that depend on knowing T_rest = T_obs/(1 + z),²² such as requiring measurements with a minimum and maximum T_rest value; (2) as noted above, determining which observer-frame filters (with mean wavelength $\bar{\lambda }_{\rm f}$ ) have $\bar{\lambda }_{\rm f}/(1+z_{\rm phot})$ within the valid wavelength range of the fitting model; (3) determining the valid rest-frame epoch range for the fitting model; (4) as $\bar{\lambda }_{\rm f}/(1+z_{\rm phot})$ maps into a different rest-frame filter (for mlcs2k2) there is a discontinuous change in the model error, and therefore the χ² is not a continuous function of z_phot; and (5) determining robust initial fit parameter values. Our treatment of these issues is described in Appendix A.

We end this section with a discussion of the processing time. For the SDSS-II light curves, which have nearly 50 measurements on average, all of the minimization fit iterations take ∼1 s per SN using minuit. The marginalization (Appendix B) takes close to half a minute per SN using an integration grid of 11 points per fit parameter, or a total of 11⁵ integration cells. However, the integration ranges usually need adjustment after marginalizing, and therefore the marginalization typically runs twice, taking nearly a minute per SN. The processing time scales linearly with the number of measurements and as the fifth power of the number of grid points per fit parameter. For the integration grid above, the Monte Carlo Markov Chain technique requires about the same amount of processing time.

Using the SNANA implementation of mlcs2k2+z and saltii+z, we find that minuit gives adequate minimized values, but that the uncertainties are not reliable because of subtle discontinuities in the χ² derivative with respect to the photo-z. We must therefore marginalize in order to get useful uncertainties and covariances. To illustrate the computational issue more clearly, consider an LSST sample of 500,000 SNe Ia. The marginalization for all of the SNe in this sample requires a total of 1 CPU year. A factor of 100 is probably needed for code development and systematic studies and another factor of several for simulation studies. The total computing needs are therefore a few CPU centuries with today's processors, assuming that 11⁵ integration cells give sufficient accuracy. To use the minimization, which reduces the computing needs to a few CPU years, the light curve model magnitudes and errors must be continuous functions of redshift, as well as their derivatives.

5. RESULTS FOR SDSS-II AND SNLS

Here we present results for the three-season SDSS-II data and the first-season SNLS data described in Section 2, and we compare with results from simulations of those same samples. There are two fit minimizations (Section 4) to determine the appropriate filters to include. The photo-z and distance-modulus results are taken to be the mean of their respective probability distribution functions (pdf's) marginalized over the other fit parameters using a grid of 11⁵ integration cells (Appendix B). The uncertainty is taken to be the rms of the pdf. A prior on the host-galaxy photo-z or SN color could potentially improve the precision of the method and reduce the frequency of catastrophic SN photo-z outliers; we have not used such priors here in order to better illustrate the performance of the lcfit+z method on its own.

Following a commonly used practice in the literature, we characterize the precision of the lcfit+z photo-z precision with the quantity

$\begin{equation} \Delta _{z}\equiv (z_{\rm phot}- z_{\rm spec})/(1+z_{\rm spec}), \end{equation} \tag{ 2 }$

and we use ${\rm RMS}_{\Delta _{z}}$ to denote the root mean square of the distribution of Δ_z. The SDSS-II results are shown in Figures 5 and 6 for the mlcs2k2+z and saltii+z methods, respectively. There are fewer SNe in the sample fitted with saltii because the more restrictive rest-frame model wavelength range (2900–7000 Å) rejects i-band data for redshifts below about 0.1; without i band, these low-redshift SNe fail the requirement of three observer-frame filters. The SNLS results are shown in Figures 7 and 8. Distributions from the data and simulated samples are shown side-by-side in these figures, illustrating the reliability of the simulations in predicting the dispersions and bias. Each plot shows the number of SNe satisfying the selection criteria, ${\rm RMS}_{\Delta _{z}}$ (overall and versus z_spec), and the redshift bias (overall and versus z_spec). The ${\rm RMS}_{\Delta _{z}}$ values are ∼0.04 for both the SDSS-II and SNLS samples.

**Figure 5.** Redshift precision for SDSS-II sample using mlcs2k2+z. Left plots are for data; right plots for simulation. Top plots show the Δ_z distribution for the entire sample; the number of events, rms, and bias are indicated on the plot. Middle plots show Δ_z vs. z_spec. Bottom plots show the bias and ${\rm RMS}_{\Delta _{z}}$ in redshift bins.
Download figure:
Standard image High-resolution image

**Figure 6.** Same as Figure 5, but using saltii+z.
Download figure:
Standard image High-resolution image

**Figure 7.** Same as Figure 5, but using the SNLS sample.
Download figure:
Standard image High-resolution image

**Figure 8.** Same as Figure 5, but using the SNLS sample and saltii+z.
Download figure:
Standard image High-resolution image

For the mlcs2k2+z method applied to the SDSS-II sample (Figure 5), the overall Δ_z bias in the data is notably larger than in the simulation, and there is a redshift-dependent bias that is partially predicted by the simulation. There are two contributions to the Δ_z bias. First, there is a degeneracy between intrinsic reddening (due to extinction or color) and redshift. For redshifts z_spec > 0.3, the best-fit photo-z is sometimes near z_phot ∼ 0.15, and the best-fit color corresponds to a very red and intrinsically dim SN Ia. This degeneracy is sensitive to the S/N; the resulting bias is redshift dependent and is well modeled by the simulation.

To potentially identify fits with a catastrophic photo-z error resulting from a strong color–redshift degeneracy, we have looked for a second maximum in the one-dimensional marginalized pdf. For mlcs2k2+z, ∼9% of the fits have a second maximum in both the photo-z and color pdf, and for saltii+z the corresponding fraction is ∼4%. For both models, the photo-z precision is the same for the subset with a second maximum in the pdf, and therefore this approach cannot be used to identify photo-z outliers in the SDSS-II sample. We also find no correlation between the photo-z uncertainty and catastrophic outliers. Clearly, a reliable host-galaxy photo-z prior will help reduce catastrophic outliers, and this information will be used in future analyses that include spectroscopically unconfirmed SNe Ia in the Hubble diagram.

The second contribution to the Δ_z bias is related to the U-band anomaly discussed in K09, and this source of bias is not modeled in the simulation. The sub-sample with z > 0.2, where the observer-frame g band corresponds to the rest-frame UV region, has a bias larger than the average. The sub-sample with z < 0.2 has a much smaller bias.

For cosmological applications, it is of interest to study how the use of photo-z's in place of spectroscopic redshifts impacts the determination of SN distances. Distance modulus (μ) dispersions for fits with both spectroscopic and photometric redshifts are shown in Figure 9, where RMS_μ is the rms scatter of the distribution of μ_fit − μ_ref. The reference distance modulus (μ_ref) is calculated from the spectroscopic redshift and the same standard cosmology used in the simulation: w = −1, Ω_M = 0.3, Ω_Λ = 0.7. For mlcs2k2, μ_fit is the fitted distance modulus. For saltii, μ_fit is defined to be

$\begin{equation} {\mu }_{\rm fit}^{\rm SALT2} \simeq 30 - 2.5\log _{10}(x_0) + \alpha \cdot x_1 - \beta \cdot c, \end{equation} \tag{ 3 }$

where α = 0.11 and β = 2.6 are fixed parameters from the simulation. Note that the particular choice of α and β does not affect the μ dispersion. Compared to the four-parameter light curve fits using spectroscopic redshifts, the lcfit+z method increases RMS_μ by 0.1–0.2 mag.

**Figure 9.** RMS_μ for four-parameter mlcs2k2 and saltii fits (bottom of each arrow) and for five-parameter mlcs2k2+z and saltii+z fits (top of each arrow). The left panel shows RMS_μ for the data; the right panel shows the simulations.
Download figure:
Standard image High-resolution image

To gauge the appropriate level of intrinsic magnitude variations in the simulation, we compare the Δ_z precision in the data to that in two different simulated samples. The first sample is generated with 0.1 mag color smearing, and the second sample is generated with 0.2 mag coherent smearing. The width of a Gaussian fit to the Δ_z distribution $(\sigma _{\Delta _{z}})$ is used instead of ${\rm RMS}_{\Delta _{z}}$ to reduce sensitivity to outliers. The $\sigma _{\Delta _{z}}$ results are shown in Table 1 for the SDSS-II and SNLS data and for the simulated samples. Using coherent smearing, the simulated photo-z precision is significantly better than that of the data, while the color-smearing model matches the data well. For the small SNLS data sample fitted with mlcs2k2+z, the large uncertainty on $\sigma _{\Delta _{z}}$ is due to a statistical anomaly in the distribution that results in a poor fit to a Gaussian.

Table 1. Photo-z Precision $\sigma _{\Delta _{z}}$ ^a for Data and Simulations

Sample	SDSS-II	SDSS-II	SNLS	SNLS
	(mlcs2k2+z)	(saltii+z)	(mlcs2k2+z)	(saltii+z)
DATA	0.031 ± 0.003	0.027 ± 0.002	0.051 ± 0.020	0.032 ± 0.006
SIM^b	0.030 ± 0.001	0.030 ± 0.001	0.027 ± 0.002	0.028 ± 0.002
SIM^c	0.020 ± 0.001	0.017 ± 0.001	0.021 ± 0.001	0.014 ± 0.001

Notes. ^aΔ_z ≡ (z_phot − z_spec)/(1 + z_spec). ^bUses color-smearing model with σ_f = 0.1 mag. ^cUses 0.2 mag coherent mag-smearing.

Download table as: ASCII Typeset image

This empirical estimate of random intrinsic color dispersion needed in the simulation does not necessarily suggest that there are random color variations in SN light curves, but rather that there are additional sources of color variation that are not captured by the light curve models. Using the nearby SN Ia sample (z < 0.1), Nobili & Goobar (2008) also found evidence for intrinsic color dispersion. They fit the SNe with a light curve model that includes many more color parameters than mlcs2k2 or saltii, and their estimate of the color dispersion is considerably smaller than our empirical estimate based on matching the photo-z precision.

5.1. Comparison with Recent SNLS Photo-z Results

Using data and simulations for the SNLS, we compare our photo-z precision with recent results from PD09. They use the saltii+z method in a manner very similar to ours. The differences between our method and theirs are (1) they use all four griz filters, while we use only those filters that correspond to the valid rest-frame wavelength range; (2) their initial parameter scan is in redshift only (Δz = 0.1 bins), while our initial scan is over a two-dimensional grid of redshift (Δz = 0.04 bins) and color (Δc = 0.2 bins); (3) they impose priors on the color and redshift, while we do not use priors; and (4) they impose less restrictive light curve selection requirements, including the addition of photometrically identified SNe Ia (i.e., spectroscopically unconfirmed SNe). The trade-off between the use of priors versus selection criteria mainly affects the rate of catastrophic photo-z outliers. Our choice of using flat priors is intended to better illustrate the performance of the saltii+z method on higher-quality light curves. While the relaxed cuts in PD09 have the advantage of increasing the sample size, using a prior on color (or on host-galaxy extinction) requires a detailed understanding of the underlying color distribution and survey selection function. The optimal choice between priors and selection criteria is not addressed here and will need further study.

The PD09 sample is based on nearly 300 SNe Ia corresponding to the first three seasons of SNLS, while we use the publicly available sample from A06. To make the selection criteria more similar for this comparison, we have relaxed our requirement on the maximum S/N: three filters must have at least one measurement each with S/N >5 (instead of 8). Our modified selection results in 55 SNe Ia: 13 with z < 0.45 and 42 with z > 0.45.

To evaluate the photo-z precision, we use the PD09 metric σ_Δz/(1+z) ≡ 1.48 × median|Δ_z|, where Δ_z is defined in Equation (2). This quantity is much closer to the Gaussian sigma of Δ_z than to ${\rm RMS}_{\Delta _{z}}$ . To quantify the rate of catastrophic photo-z outliers, we define η_x as the fraction of SNe with a photo-z that satisfies |Δ_z|>x, and we follow PD09 in using x = 0.15.

Table 2 compares our precision metrics with those in PD09. We also give the σ_Δz/(1+z) breakdown for the low-redshift (z_spec < 0.45) and high-redshift (z_spec > 0.45) ranges. We caution that these comparisons are based on different data samples, different selection criteria, and different priors. For the data comparison, the two analyses are reasonably consistent in both σ_Δz/(1+z) and η_0.15 for both redshift ranges. For the simulation comparison there is a subtle disagreement. The PD09 simulation, which uses coherent mag-smearing, underestimates the scatter in the low-redshift range but accurately predicts the precision in the high-redshift range. Our simulation using coherent mag-smearing underestimates σ_Δz/(1+z) (in PD09) for both redshift ranges, but our simulation based on color smearing predicts σ_Δz/(1+z) fairly well, perhaps with a slight overestimate of the scatter. Our simulation supports our earlier conclusion that color smearing is needed to model the photo-z precision; the PD09 simulation supports our conclusion in the low-redshift range but not in the high-redshift range. Finally, our simulation underestimates the fraction of catastrophic outliers (η_0.15 ∼ 0.002), while the PD09 simulation gives good agreement with the data (η_0.15 ∼ 0.01).

Table 2. Photo-z Precision for SNLS Data and Simulation

Reference	σ_Δz/(1+z)	σ_Δz/(1+z)	η_0.15^a
	(z < 0.45)	(z > 0.45)	(all z)
PD09 using
DATA^b (three seasons)	0.016	0.025	0.014
SIM^b	0.006	0.027	0.010

This work using
A06 DATA	0.005(5)	0.036(7)	0.02(2)
SIM (coherent smear)	0.004(2)	0.016(1)	0.002(2)
SIM (color smear)	0.019(3)	0.030(3)	0(2)

Notes. Results for PD09 and this work are shown. Our results include the uncertainty in parentheses. ^aη_0.15 = fraction of SNe with |z_phot − z_spec|/(1 + z_spec)>0.15. ^bSee z_pho^b columns in Table 1 of PD09.

Download table as: ASCII Typeset image

5.2. Photo-z Correlations

Here we briefly discuss photo-z correlations that should be propagated in a Hubble diagram analysis. Using the lcfit+z results from the SDSS-II sample, Figure 10 shows reduced correlations (ρ) between the photo-z and each of the other four light curve fit parameters. The mlcs2k2+z photo-z correlation with t₀ and distance modulus (upper plots) are both peaked at large positive values, the correlation with extinction (A_V) is negative, and there is little correlation with the shape parameter Δ. For saltii+z (lower plots) the photo-z correlations with t₀, shape/stretch parameter (x₁), and color are qualitatively similar to those based on the mlcs2k2+z method. The ρ_(x0,z) correlation has a very broad distribution with an average near zero. For mlcs2k2+z, the simulated distributions match the data well, while for saltii+z there is a slight discrepancy in the distributions of ρ_(c,z) and ρ_x0,z. This discrepancy occurs only for z_spec > 0.2 and could be an artifact of the subset with smaller S/N.

**Figure 10.** Reduced correlations (ρ) between the photo-z (Z) and the other four light curve fit parameters as indicated in the title of each plot. Top plots are for the SDSS-II sample using mlcs2k2+z; bottom plots are for the saltii+z method. The data are shown by dots; the simulation is shown by the histogram.
Download figure:
Standard image High-resolution image

6. PREDICTIONS FOR LSST

Since we have demonstrated that the SNANA simulation can be used to reliably determine the photo-z precision for the SDSS-II and SNLS samples, we now turn our attention to forecasts for the LSST survey. LSST will discover far more SNe than spectroscopic resources can target, and photometric methods will be needed to determine both the redshift and SN type for the majority of events. Here we determine the precision (bias and rms) on the photo-z and distance modulus using the lcfit+z method on more than 10⁴ simulated SNe Ia (after selection requirements) corresponding to the LSST-DEEP and LSST-MAIN surveys. Contamination from non-Ia SNe and the resulting precision in cosmological parameters will be presented in a future work.

The DEEP survey comprises seven fields, each covering nearly 10 deg², that is densely time-sampled in all of the ugrizY LSST filters. After rejecting passbands with invalid $\bar{\lambda }_{\rm f}/(1+z_{\rm phot})$ , the average number of observations per SN Ia used in the light curve fit is 66. The MAIN survey covers more than 20,000 deg² but is not optimized for SN observations, so the light curve sampling often has large temporal gaps for each filter. The mean number of fitted observations per SN for the MAIN survey is 20, more than a factor of 3 fewer compared to the DEEP fields. More details about the LSST are given in Ivezić et al. (2008) and LSST Science Collaborations (2009). Due to the extensive computing resources needed to marginalize these large LSST samples, we have only performed the minimizations that are adequate for determining central values for the fitted parameters.

To simulate observing conditions, we use the output of version OPSIM1.29 of the LSST Operations Cadence Simulator (Delgado et al. 2006 and Section 3.1 of LSST Science Collaborations 2009) for the cadence, sky noise, and 5σ limiting magnitude for each measurement. For each observation, the SNANA simulation requires a zero point (Z_p.e.) to translate the simulated SN magnitude (m) into an observed CCD flux measured in photoelectrons, $F = 10^{-0.4(m-{Z_{\rm p.e.} })}$ . In terms of the OPSIM1.29 parameters, we calculate this zero point to be

$\begin{eqnarray} {Z_{\rm p.e.} }& = & 2{\cal M}_{5\sigma ps}- {\cal M}_{\rm sky}+ 2.5\log _{10}(A\cdot \rm{(S/N)}^2) \nonumber \\ && + 2.5\log _{10}[ 1 + A^{-1}\times 10^{0.4({\cal M}_{\rm sky}-{\cal M}_{5\sigma ps})} ], \end{eqnarray} \tag{ 4 }$

where ${\cal M}_{5\sigma ps}$ is the 5σ limiting magnitude, ${\cal M}_{\rm sky}$ is the Perry sky brightness (mag arcsec⁻²), A = [2π∫[PSF(r)]²rdr]⁻¹ = (1.51 · FWHM)² is the effective aperture (in arcsec²) where FWHM describes the seeing, and $\rm{S/N}=5$ is the signal-to-noise ratio corresponding to ${\cal M}_{5\sigma ps}$ .

For this study, we carry out the light curve fits both with and without a host-galaxy photo-z prior; no other priors are used. The host-galaxy prior is determined from the Bayesian Photometric Redshift Estimation (BPZ) technique (Benitez 2000) applied to a preliminary set of simulated galaxies. The galaxy colors and luminosities are generated to match observed distributions as a function of redshift. For more details, see Section 3.8 of LSST Science Collaborations (2009). The signal to noise as a function of apparent magnitude for the host-galaxy simulation is based on co-added exposures for 10 years of the MAIN survey, and photo-z's are determined for galaxies with r magnitudes down to 25. The average host-galaxy photo-z precision from BPZ is 0.02, and there are some variations with redshift as shown in Figure 11. The z_phot − z_gen bias versus redshift has wiggles of order 0.005, but the true bias could be larger if the SN host galaxies are not a random subset of the galaxies used for photo-z training. Note that in this LSST discussion of the host galaxy and SNe Ia, we characterize the photo-z precision in terms of z_phot − z_gen instead of Δ_z.

**Figure 11.** For 6 × 10⁴ simulated LSST host galaxies, BPZ bias (crosses) and rms (curve) of z_phot − z_gen vs. the true host-galaxy redshift (z_gen) in redshift bins of width Δz = 0.03.
Download figure:
Standard image High-resolution image

These host-galaxy photo-z values are stored in a library for the SNANA simulation. For each simulated SN with true redshift z_SN, the host galaxy with true redshift (z_gal) closest to z_SN is selected. The corresponding host-galaxy photo-z is then scaled by the ratio z_SN/z_gal to correct for the slight redshift mismatch between the SN and the host galaxy. The scaled host-galaxy photo-z and its uncertainty are used to impose a Gaussian prior in lcfit+z.

To ensure well-sampled light curves for the lcfit+z method, we apply the following selection requirements to the simulated LSST SN data: (1) at least two filters with a measurement satisfying T_rest < −5 days; (2) at least two filters with a measurement satisfying T_rest > + 20 days; (3) largest rest-frame gap (that overlaps −5 to +20 days) is <15 days; (4) at least three observer-frame filters have an epoch with S/N>10; and (5) the light curve fit probability satisfies $P_{\chi ^2}> 0.02$ . The requirement of at least two filters (cuts (2) and (3)) removes poorly sampled light curves predominantly from the MAIN survey. Using these requirements, the number of SNe per year is 1900 and 5 × 10⁴ for the DEEP and MAIN surveys, respectively. We note that these selection requirements are based on educated guesses rather than an optimization procedure. Example light curves from the DEEP and MAIN surveys are shown in Figure 12.

**Figure 12.** Left two panels show typical SN Ia light curves simulated for the LSST-DEEP fields; right two panels show typical light curves for the LSST-MAIN fields. The redshift is indicated on the top of each plot. Dots are simulated fluxes, the solid curve is the best-fit mlcs2k2+z model, and the dashed curves are the ±1σ error bands for the model.
Download figure:
Standard image High-resolution image

As a test of the fitting software, we first simulate ideal DEEP-field samples with no intrinsic mag-smearing and the exposure time artificially increased by a factor of 10⁴ compared to the nominal exposure. The resulting photo-z bias is less than 0.001 at all redshifts for both mlcs2k2+z and saltii+z, and the photo-z dispersion (rms) is less than 0.003. The bias on the distance modulus varies between 0 and 0.01 mag for mlcs2k2+z, and is less than ±0.005 mag for saltii+z. The distance modulus dispersion is ∼0.02 mag for both fitting models.

6.1. Results for LSST Simulations

The SN photo-z residuals as a function of redshift are shown in Figure 13 for a simulation without intrinsic magnitude fluctuations and for a simulation using the same intrinsic color fluctuations needed to match the photo-z precision for the SDSS-II and SNLS data samples (Section 5). Each panel shows the residuals as a function of fitting method (mlcs2k2+z or saltii+z), survey field (DEEP or MAIN) and redshift prior (flat or host-galaxy photo-z). With no intrinsic fluctuations, the most notable effects are (1) with a flat redshift prior, the extreme photo-z outliers extend up to |z_phot − z_gen| ∼ 0.2 and (2) the host-galaxy photo-z prior significantly reduces the number of outliers. With default color fluctuations, there is a notable increase in the photo-z outliers. In both cases, the redshift range is higher for saltii+z, because it extends to a lower rest-frame wavelength (2900 Å) than mlcs2k2 (3200 Å).

We quantify the rate of catastrophic outliers using η_0.15 and η_0.10 (see Section 5.1). With ∼10⁴ SNe per sample, the approximate uncertainty is $\sigma _{\eta } \simeq \sqrt{\eta }/100$ . Without intrinsic mag-smearing (Figure 13, left), η_0.15 = 0 in all cases. In this case, for the MAIN survey, η_0.10 ∼ 0.01 without a host-galaxy photo-z prior, and η_0.10 is ×10 smaller when the host-galaxy photo-z prior is used. Using the default color-smearing model in the simulation (Figure 13, right), and fitting without a host-galaxy photo-z prior, η_0.15 < 0.001 for the DEEP survey, and it is somewhat larger for the MAIN survey: 0.004 and 0.012 using the mlcs2k2+z and saltii+z methods, respectively. For η_0.10, the corresponding fractions are ×4 larger. Using the saltii+z method, the number of outliers is about ×3 larger compared to mlcs2k2+z; this difference could be related to the model, but it could also be an artifact of our implementation. We therefore make no claims that either method is more precise or has fewer catastrophic outliers. When the host-galaxy photo-z prior is used, η_0.10 = η_0.15 = 0 in all cases.

To quantify the precision of the lcfit+z method, we have evaluated the bias and rms spread as a function of redshift for both the photo-z and distance modulus (μ). Figure 14 shows the results based on simulations using the coherent mag-smearing model, and Figure 15 shows the results using the default color-smearing model. The main differences between using these two models of intrinsic variations are (1) the photo-z rms goes to nearly zero at low redshift for the coherent mag-smearing model, but has a floor of about 0.01 for the color-smearing model and (2) the rms is slightly larger at high redshift for the color-smearing model.

**Figure 14.** Bias (crosses) and rms (curve) on photo-z residual (z_phot − z_gen), and on distance modulus residual (μ_fit − μ_gen) vs. the true redshift (z_gen) in z-bins of width ΔZ = 0.03 for the LSST simulations. Each plot indicates DEEP or MAIN fields, and FLATZ (SN only) or HOSTZ (host-galaxy photo-z prior). The simulation uses coherent (0.15 mag) brightness fluctuations in all filters and is fitted with mlcs2k2+z (left) and saltii+z (right). For z < 0.15, the saltii curves drop to zero because there are only two valid filters (gr), and the data fail the three-filter requirement. The dashed horizontal lines in the μ_fit − μ_gen plots show the rms contribution from intrinsic variations.
Download figure:
Standard image High-resolution image

Here we briefly summarize the precision based on simulations using the default color-smearing model and fitting without a host-galaxy photo-z prior (see "FLATZ" panels in Figure 15). In the DEEP survey, the photo-z rms precision is ∼0.01 at low redshifts and rises to about 0.04 at the highest redshifts. The RMS_μ precision is near the 0.15 mag floor at low redshifts and roughly doubles at the highest redshifts. In the MAIN survey, the photo-z precision is about ×2 worse compared to the DEEP field survey. The corresponding RMS_μ precision is about 0.2 mag at the lowest redshifts and also roughly doubles at the highest redshift. The photo-z bias in the DEEP survey has wiggles of amplitude ∼0.01 as a function of redshift. The μ-bias wiggles are at most at the 0.01 mag level, and are notably less apparent than those seen in the photo-z bias. In the MAIN survey, the bias is noticeably larger and redshift dependent; the photo-z bias reaches 0.02 and the μ-bias reaches 0.1 mag. Both the photo-z and μ biases are largest at z_gen ∼ 0.6.

When fitting with a host-galaxy photo-z prior (see "HOSTZ" panels in Figure 15), the precision is significantly improved for both the photo-z and μ. In the MAIN survey, however, a redshift-dependent bias remains for z_gen > 0.6.

Although the LSST photo-z precision looks promising, we urge some caution in the interpretation of these simulations. If we consider the LSST DEEP-field subset over the same redshift range as the SDSS-II sample (z < 0.4), the forecast LSST photo-z precision (rms) is about ×3 better than that for the SDSS-II data sample described in Section 5, reflecting the higher expected signal to noise and broader wavelength coverage. Although we are confident in extrapolating the rms precision based on the treatment of photon statistics and color smearing, we cannot rule out unknown systematic effects, primarily from the unknown source of intrinsic brightness variations, that could limit the photo-z precision and accuracy. Concerning the photo-z bias presented here, this should be considered a lower limit because the same light curve model has been used in both the simulation and in the lcfit+z fit. The current mlcs2k2 and saltii models have been shown to differ significantly in the ultraviolet region (K09), but such modeling errors have not been considered here. In future studies, fitting saltii simulations with mlcs2k2 (and vice-versa) would likely give an upper limit on the photo-z bias, and may lead to additional clues about problems in the light curve models.

7. COMPARISONS WITH COLOR-BASED REDSHIFT ESTIMATES

Here we compare our lcfit+z results with color-based SN redshift estimates for the SNLS sample and for an mlcs2k2-based simulation.

7.1. Comparison with the SNLS Sample

Wang (2007) determined color-based photometric redshifts for 40 SNe Ia from the SNLS A06 sample. For the 20 SNe used in the training, the author finds ${\rm RMS}_{\Delta _{z}}= 0.03$ ; for the remaining 20 SNe, ${\rm RMS}_{\Delta _{z}}= 0.05$ . In our mlcs2k2+z analysis, 37 SNe satisfy the selection criteria, and ${\rm RMS}_{\Delta _{z}}= 0.045$ . For the saltii+z method, 37 SNe satisfy the selection requirements, and ${\rm RMS}_{\Delta _{z}}= 0.040$ . By this metric, our lcfit+z method works equally well compared to the color-based method. Since a list of SNe used by Wang (2007) is not available, we cannot make a more detailed comparison with the same subset.

7.2. Comparison with mlcs2k2-based Simulations

We compare photo-z results of the two methods on simulations as described in Wang et al. (2007, hereafter WNW07). Following the procedure in WNW07, we simulate observer-frame filters riz using the mlcs2k2 model for redshifts z < 0.95 (so that r-band data are always well defined within the model), fix the shape–luminosity parameter Δ = 0, generate a flat redshift distribution, and ignore intrinsic magnitude variations that introduce anomalous Hubble scatter. For each SN and each filter, the exposure time is adjusted so that S/N = 25 at the epoch of peak brightness. In WNW07, only the peak fluxes are used and therefore the light curve sampling does not matter; to investigate the comparative effectiveness of our mlcs2k2+z method, simulated light curves are sampled every 7 days in the observer frame.

In the ideal case of no host-galaxy extinction (A_V = 0), WNW07 find ${\rm RMS}_{\Delta _{z}}= 0.004$ with a mean bias of 5.4 × 10⁻⁴; using mlcs2k2+z under similar assumptions, we find ${\rm RMS}_{\Delta _{z}}= 0.006$ and a mean bias of (−1.7 ± 2.1) × 10⁻⁴. Including host-galaxy extinction in the simulation with a pdf P(A_V) = exp(−A_V/0.46) and reddening parameter R_V = 3.1, WNW07 find ${\rm RMS}_{\Delta _{z}}= 0.044$ with a mean bias of 0.008; using mlcs2k2+z, under the same conditions we find ${\rm RMS}_{\Delta _{z}}= 0.009$ and a mean bias of (0.5 ± 2.9) × 10⁻⁴. The significantly improved precision with our mlcs2k2+z method in these more realistic conditions is in part due to the increase in effective signal to noise that comes from using the entire light curve. In addition, shape and color information contained in the light curve enables the mlcs2k2+z method to partially untangle color variations from extinction versus those produced by redshift.

Since the color-based method in WNV07 cannot untangle reddening from extinction and redshift, we have attempted a more fair comparison to the mlcs2k2+z method in which A_V is fixed to the mean generated extinction of 0.46 mag; in this case ${\rm RMS}_{\Delta _{z}}$ nearly doubles to 0.015 for the mlcs2k2+z method, yet is nearly a factor of 3 smaller than the dispersion in WNV07. This difference is either due to the enhanced photon statistics from using the entire light curve in the mlcs2k2+z method or from non-optimal training in the color-based method. To increase ${\rm RMS}_{\Delta _{z}}$ to the WNV07 value of 0.044, the peak S/N must be reduced from 25 to less than 10.

In the mlcs2k2+z fits we have used R_V = 3.1, the same value used in generating the simulation. Using the correct value for R_V is an apparently unfair advantage over the WNW07 treatment, in which no assumptions are made about color variations. However, in practice the assumption about the value of R_V makes little difference to the results: fitting with R_V = 2.2 produces the same precision for Δ_z, although the resulting distance moduli are biased by about 0.1 mag.

Even though the effects of host-galaxy extinction are included the simulation described in WNW07, this simulation is not realistic because the shape–luminosity parameter Δ is fixed to zero, and there are no intrinsic magnitude variations. Simulating our best estimate for these effects (Section 3), the bias and scatter in the fitted photo-z and distance modulus are shown as a function of z_spec in Figure 16. For redshifts below 0.8 there is no significant bias in either the photo-z or distance modulus. For z > 0.8 the bias and scatter increase significantly. The nature of this high-redshift bias is illustrated in Figure 17, which shows the correlation between the fitted versus simulated photo-z difference and the fitted versus simulated difference for time of peak (t₀), extinction, and shape–luminosity parameter. The photo-z outliers are clearly correlated with t₀ outliers. Since the SN Ia color becomes redder with epoch, a misestimate of t₀ changes the apparent SN color, which is translated into an error in the redshift. The bias in t₀ is an artifact of the discrete mlcs2k2 passbands used to characterize the rest-frame light curves. For SNe simulated at redshifts z > 0.8, the three observer-frame filters map into only two rest-frame filters: riz → UBB. In a small fraction of the light curve fits, however, the wrong filter-mapping (riz → UBV) results in a smaller χ². Since the B- and V-band templates have different shapes, as well as a two-day shift in the time of peak brightness, the fitted t₀ is biased.

**Figure 16.** For mlcs2k2 simulations with peak S/N = 25, fitted photo-z bias and ${\rm RMS}_{\Delta _{z}}$ vs. z_spec (left), and distance modulus bias and RMS_μ vs. z_spec (right). The simulations are generated according to the prescription of WNW07, but also including non-zero Δ and intrinsic color variations.
Download figure:
Standard image High-resolution image

**Figure 17.** Fit-simulation photo-z residual (z_phot − z_gen) vs. fit-simulation residual for time of peak brightness (t₀), distance modulus (μ), host-galaxy extinction (*A_V*), and shape–luminosity parameter (Δ).
Download figure:
Standard image High-resolution image

8. CONCLUSIONS

We have developed and tested a photometric redshift estimation method using SN Ia light curves within the framework of the mlcs2k2 and saltii models and find that they result in similar photo-z precision. We used an iterative fitting procedure to determine the valid observer-frame filters to use in the fits. Applying this method to SDSS-II and SNLS data, we obtained an average photo-z precision of ${\rm RMS}_{\Delta _{z}}\sim 0.04$ . To reproduce a comparable level of precision in simulations, intrinsic color smearing is needed (Section 3) at the level of about 0.1 mag per passband or 0.14 mag per color. This empirical estimate of color smearing is consistent with but does not necessarily imply that there are random color variations in SN Ia light curves. However, this effect does indicate that there are additional sources of color variation that are not captured by the mlcs2k2 and saltii light curve models.

We applied the lcfit+z method to simulated LSST samples (Section 6). For the DEEP fields, the rms scatter of z_phot − z_gen varies from 0.01 to 0.04 without using a host-galaxy photo-z prior. For the MAIN survey, the photo-z precision is about ×2 worse. Using a host-galaxy photo-z prior significantly reduces outliers and improves the overall precision. The next critical step is to apply this method to simulations that include non-Ia type SNe and estimate the resulting contamination of photometric SN Ia samples by core-collapse SNe.

We gratefully acknowledge support from the Kavli Institute of Cosmological Physics at the University of Chicago, the National Science Foundation at Wayne State, and the Department of Energy at Fermilab, the University of Chicago, and Rutgers University. S.J. is grateful for the support of DOE grant DE-FG02-08ER41562. Funding for the creation and distribution of the SDSS and SDSS-II has been provided by the Alfred P. Sloan Foundation, the Participating Institutions, the National Science Foundation, the U.S. Department of Energy, the National Aeronautics and Space Administration, the Japanese Monbukagakusho, the Max Planck Society, and the Higher Education Funding Council for England. The SDSS Web site is http://www.sdss.org/.

The SDSS is managed by the Astrophysical Research Consortium for the Participating Institutions. The Participating Institutions are the American Museum of Natural History, Astrophysical Institute Potsdam, University of Basel, Cambridge University, Case Western Reserve University, University of Chicago, Drexel University, Fermilab, the Institute for Advanced Study, the Japan Participation Group, Johns Hopkins University, the Joint Institute for Nuclear Astrophysics, the Kavli Institute for Particle Astrophysics and Cosmology, the Korean Scientist Group, the Chinese Academy of Sciences (LAMOST), Los Alamos National Laboratory, the Max-Planck-Institute for Astronomy (MPA), the Max-Planck-Institute for Astrophysics (MPiA), New Mexico State University, Ohio State University, University of Pittsburgh, University of Portsmouth, Princeton University, the United States Naval Observatory, and the University of Washington.

This work is based in part on observations made at the following telescopes. The Hobby-Eberly Telescope (HET) is a joint project of the University of Texas at Austin, the Pennsylvania State University, Stanford University, Ludwig-Maximillians-Universität München, and Georg-August-Universität Göttingen. The HET is named in honor of its principal benefactors, William P. Hobby and Robert E. Eberly. The Marcario Low-Resolution Spectrograph is named for Mike Marcario of High Lonesome Optics, who fabricated several optical elements for the instrument but died before its completion; it is a joint project of the Hobby-Eberly Telescope partnership and the Instituto de Astronomía de la Universidad Nacional Autónoma de México. The Apache Point Observatory 3.5 m telescope is owned and operated by the Astrophysical Research Consortium. We thank the observatory director, Suzanne Hawley, and site manager, Bruce Gillespie, for their support of this project. The Subaru Telescope is operated by the National Astronomical Observatory of Japan. The William Herschel Telescope is operated by the Isaac Newton Group on the island of La Palma in the Spanish Observatorio del Roque de los Muchachos of the Instituto de Astrofisica de Canarias. The W. M. Keck Observatory is operated as a scientific partnership among the California Institute of Technology, the University of California, and the National Aeronautics and Space Administration. The Observatory was made possible by the generous financial support of the W. M. Keck Foundation.

APPENDIX A: FITTING ISSUES WITH UNKNOWN REDSHIFT

Here we discuss our implementation of the five pre-fit issues mentioned in Section 4: (1) SN selection criteria that depend on knowing T_rest = T_obs/(1 + z) such as requiring measurements with a minimum and maximum T_rest value; (2) determining which observer-frame filters (with mean wavelength $\bar{\lambda }_{\rm f}$ ) have $\bar{\lambda }_{\rm f}/(1+z_{\rm phot})$ within the valid wavelength range of the fitting model; (3) determining the valid rest-frame epoch range for the fitting model; (4) as $\bar{\lambda }_{\rm f}/(1+z_{\rm phot})$ maps into a different rest-frame filter (for mlcs2k2) there is a discontinuous change in the model error, and therefore the χ² is not a continuous function of z_phot; and (5) determining robust initial fit parameter values.

The simplest way to handle SN selection criteria (issue 1) is to postpone such requirements until the fit has finished and then use the fitted photo-z to determine the T_rest values. This solution is not practical, however, because we often wish to remove poorly sampled light curves before fitting, thereby avoiding pathological fits that are of no interest. A safe way to apply T_rest-dependent requirements before fitting is to relax such cuts by a factor of 1 + Z_max, where Z_max is a safe upper bound on all redshifts. For example, consider the requirements of a measurement with T_rest < −6 days and 21days < T_rest < 60 days. Using Z_max = 0.5 for the SDSS-II, the pre-fitted requirements are T_rest < −4 days and 14days < T_rest < 90 days. Any initial redshift value can be used to determine T_rest as long as it is less than Z_max. Although these T_rest-related requirements are relaxed, they are still useful for rejecting poorly sampled light curves; the nominal cut is applied after the fit using the fitted photo-z value.

To determine the valid observer-frame filters (issue 2), the first-iteration photo-z fit uses all filters, with a possible exception for those covering the ultraviolet region with a mean wavelength below about 4000 Å. The ultraviolet filter should be left out of the first iteration if it is rarely used, noting that it can be added back for the second fit iteration. The fundamental assumption about the fitting model is that extrapolating beyond the defined wavelength range gives reasonable magnitude estimates so that the fit converges and that the photo-z bias is not too large. The (biased) photo-z estimate is then used to determine which observer-frame filters to retain and to reject for the second fit iteration. Note that if an ultraviolet filter is excluded initially, then a low photo-z value will result in the inclusion of this filter for the second iteration. If a filter is excluded, the T_rest-related requirements are re-tested; the fit stops if the light curve no longer has adequate sampling.

Since the first-iteration photo-z (z¹_phot) is biased and has some uncertainty, a safety margin is used to determine which observer-frame filters to keep. For each observer-frame filter (f) with mean wavelength $\bar{\lambda }_{\rm f}$ , the valid redshift range for this filter is defined by

$\begin{eqnarray} z_{{\rm min,f}}& = & \bar{\lambda }_{\rm f}\big/\lambda _{\rm max}^{\rm model} - 1, \end{eqnarray} \tag{ A1 }$

$\begin{eqnarray} z_{{\rm max,f}}& = & \bar{\lambda }_{\rm f}\big/\lambda _{\rm min}^{\rm model} - 1, \end{eqnarray} \tag{ A2 }$

where λ^model_min,max are the minimum and maximum rest-frame wavelengths defined by the model. For example, the saltii model is defined for rest-frame wavelengths 2900–7000 Å; the observer-frame i band $(\bar{\lambda }_i = 7500 {\;\AA})$ is therefore valid for redshifts above z_min,i = 0.071, and g band $(\bar{\lambda }_g = 4720 \;\AA)$ is valid for redshifts below z_max,g = 0.63. For this analysis, a filter is kept in the second iteration if the first-iteration fitted photo-z satisfies

$\begin{equation} z_{{\rm min,f}}+ \Delta {z}_{\rm phot}^{\rm cut}< z_{\rm phot}^1< z_{{\rm max,f}}- \Delta {z}_{\rm phot}^{\rm cut}. \end{equation} \tag{ A3 }$

We have set Δz^cut_phot = 0.04 and 0.03 for the SNLS and SDSS surveys, respectively. Δz^cut_phot is smaller for the SDSS-II because the S/N for SNe at z_min,i = 0.07 is larger than the S/N for SNLS SNe at z_max,g = 0.063. The Δz^cut_phot filter-selection cut has not been optimized; the optimal cut is likely to depend on the uncertainty of the host-galaxy photo-z and may also depend on redshift.

This filter selection algorithm is illustrated in Figure 18. The solid histograms in the top two plots (SDSS data and simulation) show the true redshift (z_spec) distributions near z_min,i when the i band is kept in the saltii+z fit. When the i band is discarded (dashed), the entire light curve is discarded because of the requirement of having three filters. The gap between z_min,i = 0.071 and the bulk distribution is due to the Δz^cut_phot cut. The bottom two plots in Figure 18 illustrate cases for LSST when a dropped filter does not result in rejecting the entire light curve. When the LSST g band is included in the mlcs2k2+z fit (solid curve, lower left), the true redshift almost always satisfies the model-validity requirement (z_spec < 0.5). The dashed curve shows that the g band has been correctly excluded when the true redshift is above 0.5, but we have also excluded this passband in some cases where it is valid (i.e., z_spec < 0.5). The situation is similar for the LSST Y band (lower right plot). For both passbands, a small number of light curve fits include these bands when they should have been excluded; potential biases from these invalid passbands should be accounted for in the assessment of systematic uncertainties on cosmological parameters.

The selection of rest-frame epochs (issue 3), which depends on T_obs/(1 + z_phot), is made in the same way as the selection of filters (issue 2). All measurements are included in the first fit iteration, and the initial photo-z estimate is then used to select which epochs satisfy the epoch range of the model.

The χ² continuity (issue 4) is an issue because the SNANA minimization is based on minuit, and this program relies on computing local derivatives with respect to the fitting parameters. This means that discontinuities in the χ² function and its derivative can lead to fitting failures or pathological results. The problem can be solved by marginalizing, but the necessary five-dimensional integration requires ∼100 times more CPU processing. In addition to the utility of using the much faster minimization program, it is useful to identify χ² discontinuities because such functional pathologies usually correspond to unphysical behavior in the light curve model, and the model should be improved accordingly.

There are two main sources of χ² discontinuity: (1) interpolating lookup tables and (2) filter-dependent model parameters. For class (1), appropriate numerical methods must be used to ensure continuity in both the function and its derivative. For class (2), the problem occurs for rest-frame models such as mlcs2k2, in which an infinitesimal change in z_phot results in a different rest-frame filter for the model or a different color to use for spectral warping in the K-corrections. This step-function change in the model parameters results in a discontinuity in the model magnitude (or its error) as a function of z_phot. To prevent such sharp discontinuities, SNANA uses a smooth transition function (A + Btan⁻¹(λ/τ_λ)) to smoothly vary model parameters between neighboring filters.

Since the redshift and SN color are somewhat degenerate, initial parameter estimation (issue 5) is important so that minuit will find a global, rather than local, minimum. The initial color and photo-z are determined from a crude χ² minimization on a coarse grid: the color is varied in bins of 0.2 and the photo-z is varied in bins of 0.04. For each photo-z value, an initial distance modulus estimate (μ_ini) is needed to ensure robust convergence of minuit. Although μ_ini is calculated from a standard ΛCDM cosmology using a specific set of cosmological parameters, the fitted distance modulus is unconstrained and is therefore not biased by the μ_ini calculation. For saltii, the initial x₀ value is estimated by inverting Equation (3). The shape–luminosity parameter is initialized to an average value: Δ = −0.1 for mlcs2k2 and x₁ = 0.0 for saltii. After the first fit iteration, the fitted parameters are used as initial values for the second fit iteration. When a filter is dropped and the first fit iteration is repeated, the fitted parameters are used as initial values, even though an invalid filter was used in the fit. If we do not use a coarse grid to initialize the color and photo-z and simply start with average values, the photo-z precision for the SDSS-II sample is only slightly degraded. For the SNLS sample, however, which has a much larger redshift range compared to the SDSS-II, the photo-z precision is degraded by a factor of 2 due to a significant number of catastrophic outliers.

APPENDIX B: MARGINALIZATION

Here we describe some of the details related to the marginalization. The minimized values and uncertainties are used to estimate the integration ranges: ±4σ around the minimized value for each parameter. The integration grid consists of N_grid points per fit parameter or a total of N_grid⁵ integration cells. We find that N_grid = 11 is a good compromise between precision and computing time. To improve calculation speed per integration cell, the χ² calculation stops when the probability falls below 10⁻⁵.

The marginalization is repeated for either of the following cases: (1) the probability at any integration boundary is greater than 0.03 or (2) more than three one-dimensional bins (i.e., marginalized over the other four parameters) have a probability less than 10⁻⁴. In the first case the integration range is extended, while in the second case the integration range is reduced.

PHOTOMETRIC ESTIMATES OF REDSHIFTS AND DISTANCE MODULI FOR TYPE Ia SUPERNOVAE

Article metrics

Permissions

Author e-mails

Author affiliations

Dates

ABSTRACT

1. INTRODUCTION

2. THE SDSS-II AND SNLS DATA SAMPLES

3. SIMULATIONS

4. THE lcfit+z METHOD

5. RESULTS FOR SDSS-II AND SNLS