QUANTIFYING OBSERVATIONAL PROJECTION EFFECTS USING MOLECULAR CLOUD SIMULATIONS

Christopher N. Beaumont; Stella S. R. Offner; Rahul Shetty; Simon C. O. Glover; Alyssa A. Goodman

doi:10.1088/0004-637X/777/2/173

1. INTRODUCTION

All Galactic star formation occurs within molecular clouds (McKee & Ostriker 2007). Since the processes that form and sculpt molecular clouds set the initial conditions for star formation, the spatial and kinematic structure of molecular clouds provides clues about the star formation process.

CO is the most-utilized tracer of molecular cloud structure. Though molecular hydrogen and atomic helium are 10³–10⁴ times more abundant than CO, neither radiates efficiently in molecular clouds. The rotational transitions of CO, on the other hand, are easily excited at typical molecular cloud temperatures (10–20 K) and densities (∼100 cm⁻³), and are readily observed in the sub-mm and far infrared. ¹²CO is easily observed at low densities (n ∼ 100 cm⁻³), but is often optically thick; ¹³CO is ∼70 times less abundant than ¹²CO, and remains optically thin to higher volume density substructures (Davis et al. 2010; Wilson 1999; Frerking et al. 1982). ¹³CO emission is associated observationally with gas at n ≳ 10³ cm⁻³.

Ideally, the full six-dimensional (6D) spatial-kinematic information would be available for studying molecular cloud structure. Unfortunately, observations can only provide either 2D information of the intensity in the plane of the sky or 3D intensity information as a function of 2D space and line-of-sight velocity. For accurately interpreting observations, therefore, it is necessary to thoroughly understand the translation of physical properties in 6D space to the observed emission in position–position–velocity (PPV) space.

In most analyses, researchers assume (implicitly or explicitly) that intensity features in PPV datasets correspond more or less cleanly to 3D (positition–position–position or PPP) density structures in a cloud (see Table 1 for terminology). A typical molecular cloud analysis decomposes clouds into one or more structures based solely on the morphology of emission in PPV space and measures the properties of these structures. For example, this is the analysis strategy used to measure the size dependence of velocity dispersion, mass, and virial parameter (Larson 1981; Solomon et al. 1987; Bolatto et al. 2008).

Molecular cloud motions are dominated by turbulence at scales above ∼0.1 pc, and have complex velocity fields. Likewise, the temperature, excitation, and abundance conditions vary throughout clouds by factors of several (Pineda et al. 2008; Schneider et al. 2013). All of these factors affect the morphology of emission in PPV space and present a substantial obstacle to further data analysis. This raises the question: how well do features in observational data relate to intrinsic structures in the 3D cloud?

The aim of this paper is to measure how well intensity structures extracted from PPV cubes correspond to density structures in PPP space. Since we can never measure how well PPV and PPP structures match up in the observed Universe, we need to make these measurements using simulations, where complete information is available in both "spaces" (Goodman 2011).

We begin with a discussion of how observational effects can distort measurements of cloud properties in Section 1.1. In Section 2, we describe a technique to quantify how well an observed intensity feature corresponds to a real PPP structure, by measuring the partial overlap of density structures and observational features in PPV space. This technique leverages the dendrogram algorithm to decompose hierarchical cloud structure, by tracking how iso-intensity contour surfaces nest inside one another (Rosolowsky et al. 2008). By performing radiative transfer calculations on two numerical hydrodynamic models, we construct synthetic ¹²CO and ¹³CO observations of molecular clouds (Section 2.1). We use the COMPLETE observations of Perseus as a point of comparison throughout this work (Ridge et al. 2006) and compare these simulations to Perseus in detail in Section 2.3. We apply our analysis in Section 3 to study how well measurements of intrinsic cloud properties—mass, size, velocity dispersion, and the virial parameter—can be recovered from observations.

1.1. Overview of Observational Effects

We begin with a broad overview of how cloud information can be distorted during the observation process. A spectral line observation of a molecular cloud can be thought of as a transformation from a set of intrinsic quantities—density, velocity, temperature, chemical abundance—to a map of intensity in PPV. Information about the original cloud structure is lost during several steps of this transformation. The first problem is the projection from PPP space to the PPV space of the observation. This step is described by the following equation:

$\begin{equation} \rho _{\rm PPV}(x, y, v) = \sum _{v_z(x, y, z) = v} \rho _{\rm PPP} (x, y, z) \left| \frac{\partial z}{\partial v_z(x, y, z)} \right|, \end{equation} \tag{ 1 }$

where ρ_PPV is the density of material in PPV space (g cm⁻² km⁻¹ s), ρ_PPP is the density in PPP (g cm⁻³), and the derivative is the standard Jacobian used when transforming densities between coordinate systems.

From the perspective of feature identification, two aspects of this transformation break the correspondence between PPP structures and PPV features. First, distinct positions along the same line of sight that move at similar velocities will project to the same region in PPV. Thus, a feature in PPV may sample two or more density structures. This is the problem of superposition, and is illustrated in Figure 1(a). Second, spatial variations in v_z affect the gradient term in Equation (1), and can modulate the ρ_PPV field independently of ρ_PPP. In other words, a single density structure can map to multiple velocity-induced PPV features. This is shown schematically in Figure 1(b).

**Figure 1.** Schematic representation of superposition and velocity-induced structures. Colors indicate velocity. Left: three PPP structures (top) merge into 2 PPV structures (bottom), due to the similar velocity of the front and back structures. Right: a single density structure with internal velocity gradients (top) splits into two PPV structures (bottom).
Download figure:
Standard image High-resolution image

In addition to projection, observations are also subject to chemical and radiative transfer effects, which further distort the intensity field from the density field via spatially variable excitation, abundance, ionization, and opacity conditions (Bell et al. 2006; Lee et al. 2013; Pineda et al. 2008). The ρ_PPV field determines the column density and, along with the temperature, the collision rate of the gas. Both of these affect the intensity field—the column density sets the opacity and can obscure background features, while the collision rate affects the excitation state and emissivity of the gas. Finally, observations are subject to noise and spatial filtering, which further degrade the data, increasing opportunities for confusion.

The net effect of these phenomena is too complicated to characterize analytically. Instead, we turn to numerical simulations, where cloud properties can be compared before and after synthetic "observations." With simulations we also have the freedom to disable individual aspects of the observation process, to better isolate the influence of each factor.

1.2. Previous Work

Several authors have investigated the relationship between PPP structures and PPV structures, using different simulations and analysis techniques. An early study by Adler et al. (1992) investigated structures identified in longitude–velocity diagrams of a synthetic model of the Galaxy. They found that many of these identifications were superpositions of separate PPP regions. On a smaller scale, Ballesteros-Paredes & Mac Low (2002) simulated observations of molecular cloud clumps in local thermodynamic equilibrium. They measured a number of canonical relationships in both PPP and PPV, including the clump mass spectrum, size–linewidth relationship, and size-density relationship. They found that the amount of confusion due to superposition depends on both the strength and spatial scale of turbulent driving—if turbulence induces stronger or smaller-scale density perturbations, confusion worsens. Despite confusion problems, both of these papers recovered mass–size and size–linewidth scaling relationships similar to real clouds. Issa et al. (1990) also considered how cloud superposition affects the size–linewidth relationship in Galactic CO surveys, and demonstrated that the slope of the relationship is quite robust to crowding.

Gammie et al. (2003) analyzed the aspect ratios of molecular cloud clumps projected onto the sky. The observed distribution of cloud aspect ratios is affected both by projection into 2D, as well as superposition of distinct cloud features. Offner & Krumholz (2009) also studied the intrinsic and projected distribution of simulated core shapes, noting that the intrinsic triaxial shape of cores is lost during projection. Using a similar analysis, Jones & Basu (2002) attempted to invert the distribution of apparent axis ratios to recover the intrinsic shape distribution of clouds. This inversion assumed that observed cloud shapes are unaffected by superposition, however.

Pichardo et al. (2000) correlated PPV structures in MHD simulations with the original density field and velocity field. The features in their PPV maps resemble the patterns in the velocity field more than the density field. There were also more small-scale PPV structures than there were PPP structures. The authors attributed these small PPV structures to the velocity-induced structures shown in Figure 1(b).

Ostriker et al. (2001) cataloged "observed" structures in their 3D MHD simulations by identifying regions of contrast in 2D projections. They, too, noted that features identified in this way often consist of several superposed density structures. Even though their feature extraction process ignored line-of-sight velocity information, they reported that velocity information is often unable to disambiguate superpositions in 2D.

In a series of papers (Lazarian & Pogosyan 2000, 2004, 2006), Lazarian et al. developed a mathematical formalism to describe how to recover statistical properties of turbulence (namely, the turbulent velocity and density power spectrum) from observations. This approach differs from the previous references in that it does not focus on the reality of observed structures. Instead, it considers the structure functions of spectra and slices or slabs of PPV cubes. These papers derive the expected shape of observable spatial and spectral structure functions, for idealized turbulence. The advantage of this analysis is that it explicitly treats PPV superposition, though other factors like spatially varying excitation and abundance conditions are not treated.

An approach based on Principal Components Analysis has similarly been used to measure cloud statistics without identifying specific structures with clear boundaries (Heyer & Schloerb 1997; Brunt & Heyer 2002, 2013). This method decomposes PPV datacubes into linear superpositions of "eigenimages" with different spatial and spectral extents. These extents, derived from the spatial and spectral autocorrelation of the decomposed data, are used to reconstruct scaling relationships like the velocity power spectrum.

Recently, Shetty et al. (2010) carried out an analysis similar to the work by Ballesteros-Paredes & Mac Low (2002), and measured how projection affects the measurement of size–linewidth relationships in molecular cloud substructures. They identified structures using the dendrogram algorithm, which is explicitly designed to characterize hierarchical structures like molecular clouds. They concluded that superposition can change the power law scaling coefficient for the mass–size and size–virial relationships by ∼ ± 0.5, and the linewidth-size relationship by ∼ ± 0.05 (see Table 1 of that paper).

In summary, a broad range of work using a variety of numerical simulations has found that projection effects impact the study of cloud structures. Even though we can't explicitly measure this effect in real clouds, this work suggests it is important to quantify projection effects in the context of typical observed quantities. The analysis presented in this paper builds upon these previous studies in a few key aspects. First, we develop a method to systematically cross-match observed and real cloud structures, for all structures in a cloud simulation. This provides a more detailed view into how structure analysis is affected by factors like superposition. We also carry out a more detailed, non-LTE radiative transfer to better model the important effects of excitation and opacity.

2. METHODOLOGY

While the net effects of projection, chemistry, radiative transfer, noise, and resolution are very difficult to study analytically, the effects can be measured empirically in simulations. Here we describe our approach.

Consider a particular density structure, denoted by R_i. The structures in this work have clearly-defined boundaries (Table 1), so R_i is described by a collection of voxels (3D pixels). Likewise, let O_j denote the set of PPV voxels describing a particular observed feature. Using Equation (1), we can compute ρ_PPV(R_i), the distribution of R_i in PPV ignoring the rest of the cloud. We can also measure I(O_j), the intensity distribution of O_j in PPV. We then define the similarity between PPP structure i and PPV feature j as

$\begin{equation} S_{ij} = \frac{\sum \rho _{\rm PPV}(R_i) \times I(O_j)} {\left[\sum \rho _{\rm PPV}(R_i)^2 \times \sum I(O_j)^2 \right]^{1/2}}. \end{equation} \tag{ 2 }$

The summation is over all voxels in PPV.⁶ Conceptually, S_ij measures how much R_i and O_j overlap in PPV. The metric varies from 0 to 1 (0 indicating no overlap, and 1 indicating complete overlap). In other words, large values of S_ij suggest that O_j is the observational counterpart of R_i. Thus, we can match an observed feature to its likely counterpart in the density field via

$\begin{eqnarray} M_j &=& {\rm arg}\max_{i} S_{ij}, \end{eqnarray} \tag{ 3 }$

$\begin{eqnarray} q_j &=& \max _{i} S_{ij}, \end{eqnarray} \tag{ 4 }$

Table 1. Terminology

Term	Description
Density or PPP structure/feature	A contiguous volume in real, PPP space. Defined by a 3D density contour.
Intensity or PPV structure/feature	A contiguous region in PPV space. Defined by intensity contours in a spectral line observation.
Density field	The density at each PPP location in a simulation.
Velocity field	The (line-of-sight) velocity at each PPP location in a simulation.
Intensity field	The intensity of a spectral line at each PPV location in a simulation.
Confusion	General term for the imperfect correspondence between PPP structures and PPV structures.

Download table as: ASCII Typeset image

where M_j is the best-matching density counterpart for PPV structure O_j; it is the PPP structure i which maximizes S_ij. The quality factor q_j characterizes the quality of the match. When q_j is small, O_j has no correspondence to any density structure, and is an artifact.

Figure 2 depicts this process schematically, for a region with 3 PPP structures. These structures (panel a) superpose onto two PPV structures (panel b); the projection of each individual PPP structure into PPV is shown as a dotted line in panel b. The chart on the right shows the similarity matrix, as well as the match and quality for each observed structure. Structure O₂ matches to R₃ with high quality q = 0.9. Structure O₁, on the other hand, is a superposition of R₁ and R₂. It matches R₂ slightly better than R₁, but the corresponding quality is low: q = 0.5.

Equation (1) describes the projection of density from PPP to PPV. However, there are two subtleties that must be addressed when carrying out this projection. The first is that the simulations in this paper are discretely sampled on a grid. In general, two neighboring voxels along a line of sight can have velocity differences greater than the velocity sampling in PPV. If each PPP cell is assigned to the single nearest velocity bin, this leads to discretization artifacts where emission "skips over" some velocity channels. This is described in detail in Appendix B of Shetty et al. (2011b). We circumvent this by interpolating the density field as needed, so that the velocity jump between interpolated points is always one velocity channel.

Second, the simulations assume that the velocity is constant within a cell, when in fact there should be a range of velocities at that size scale. This stems both from the thermal motion of atoms, as well as microturbulence (the turbulence at spatial scales smaller than those resolved by the simulation). Thus, each PPP location in the simulation contains material at a variety of velocities. We account for this by convolving the ρ_PPV along the velocity dimension with a Gaussian of variance $\sigma ^2 = \sigma _{\rm thermal}^2 + \sigma _{\rm micro}^2$ , where σ_thermal is the thermal linewidth and σ_micro is the microturbulence listed in Table 2.

Table 2. Summary of each Simulation

	S11	O1
Box size	20 pc	25 pc
Simulation code	Zeus-MP	ORION
Gridding	256³	256³ + 4 levels of AMR refinement
Driven turbulence	Yes	Yes
Driving power spectrum	Uniform 1 < k < 2	Uniform 1 < k < 2
Gravity	No	Yes
B field	5.85 μG	0
Gas temperature	Variable (10–200K)	15 K
Chemistry	H, O, C	None
Background UV	2.7e-3 erg cm⁻² s⁻¹	No
Constant CO/H₂ abundance	No	1.75 e−4
¹²CO/¹³CO abundance	70	70
Radiative transfer code	RADMC 3D	RADMC 3D
Microturbulence	0.2 km s⁻¹	0.2 km s⁻¹
Metallicity	Solar	N/A
Mean number density (n_H)	100 cm⁻³	58 cm⁻³
Mach number	∼6	22
Isothermal	No	Yes
Output time(s)	5.7 Myr	2.5 Myr (with gravity)
Mass in stars	N/A	722 M_☉ (2.4%)

Download table as: ASCII Typeset image

Equations (3) and (4) suggest a strategy for investigating projection and other observational effects in detail. Given a simulation and a hypothetical observation (described in Section 2.1), we catalog both the PPP and PPV structures (Section 2.2). Then, we find M and compute q for all PPV structures in the simulation. These quantities allow us to investigate how well structures (and measurements of their properties) are recovered in these synthetic observations (Section 3). To the extent that any simulation resembles a real cloud (Section 2.3 and Appendix A), this analysis offers a way to quantify otherwise un-measurable observational effects in real data. In other words, for physical conditions represented by a simulation, we can use this machinery to quantify how well mapping out any particular set of spectral line in PPV lets us estimate basic cloud properties like mass, size, line width, and virial parameter.

2.1. Data Preparation

We have applied the similarity analysis described above to two cloud simulations. Each of these simulations is meant to broadly represent the conditions in a molecular cloud like Perseus (Ridge et al. 2006; Bally et al. 2008). However, the mean simulation temperature, density, and line-of-sight dimension may differ from the true values in Perseus by factors of two.

The first simulation, henceforth O1, is performed with the orion adaptive mesh refinement (AMR) code (Truelove et al. 1998; Klein 1999). The simulation assumes a simple isothermal equation of state, which means that it is scale-free for density and temperature (e.g., Offner et al. 2008). The simulation is produced following the same procedure in Offner et al. (2013), which we briefly summarize below.

The simulation domain begins with a uniform density, which we perturb with a random velocity field for two crossing times. The input field has a flat power spectrum for large wavenumbers, 1 < k < 2, and we normalize the perturbations to maintain a constant 3D Mach number, $\mathcal {M}=22$ . This Mach number was chosen to reproduce the observed velocity dispersion in Perseus. After the gas achieves a well-mixed turbulent state, we turn on self-gravity and allow collapse to proceed. The simulation has a 256³ base grid, four levels of AMR refinement, and employs periodic boundary conditions. New grids are added automatically to satisfy the Jeans criterion for a Jeans number (the ratio of cell size to the local jeans length) of 0.25 (Truelove et al. 1997). When the Jeans criterion is violated on level four within a collapsing region, a sink particle is introduced (Krumholz et al. 2004). Our similarity analysis is performed at half a freefall time, when ∼700 M_☉ is contained in sink particles (2.3% of the gas).

The second simulation (hereafter S11) is an updated version of a model originally presented in Shetty et al. (2011b) (specifically, the n100 simulation in that paper). It was generated using a modified version of the zeus-mp MHD code (Stone & Norman 1992a, 1992b; Norman 2000; Hayes et al. 2006). S11 differs from O1 in that it ignores gravity, includes a 5.85 μG magnetic field and includes treatments of the non-equilibrium heating and cooling of the gas, the penetration of UV radiation into the cloud, and also a simplified treatment of the formation and destruction of H₂ and CO. The original Shetty et al. (2011b) simulation used the chemical model presented in Glover et al. (2010), but the updated version presented here uses instead a treatment based on Nelson & Langer (1999), as described in Glover & Clark (2012). However, as explored in some detail in Glover & Clark (2012), this change in chemical networks does not significantly affect the CO distribution in the gas. Our updated version of the Shetty et al. (2011b) simulation also includes a number of improvements in the way in which the thermal evolution of the gas is modeled, as described in Appendix A of Glover & Clark (2012).

The S11 simulation begins with uniform density which is perturbed with a random turbulent velocity field for three turbulent crossing times. The input field is similar to that in the O1 simulation, and is normalized to maintain a constant 3D rms velocity dispersion of 5 km s⁻¹. Converting this value to a Mach number is complicated by the fact that the gas in the S11 simulation is not isothermal and hence has a spatially varying sound speed. The volume-weighted mean Mach number is relatively low, ${\cal M} \simeq 6$ , because much of the cloud volume is filled by warm, CO-poor gas with T ∼ 60–70 K. If, however, we compute ${\cal M}$ only for gas with more than 10% of its carbon in CO, we find a much higher value, ${\cal M} \simeq 14$ , as this gas is much colder, with T ∼ 10–20 K.

Table 2 summarizes the properties of each simulation.

We used the radiative transfer program RADMC-3D (Dullemond 2012) to generate synthetic observations of each simulation in ¹²CO (J = 1–0), ¹²CO (J = 3–2), and ¹³CO (J = 1–0), using the large-velocity-gradient (LVG) approximation (Sobolev 1957; Shetty et al. 2011a). The observations were gridded to a spatial resolution of 0.1 pc pixel⁻¹, and velocity resolution of 0.05 km s⁻¹. Finally, we added noise to each cube (0.6K, 0.15 K and 0.25 K for the ¹²CO (J = 1–0), ¹²CO (J = 3–2), and ¹³CO (J = 1–0) transitions, respectively). These are representative of the noise values of present-day cloud surveys in these transitions (Ridge et al. 2006; Davis et al. 2010).

2.2. Structure Identification

We used the dendrogram algorithm to catalog intensity structures in each synthetic observation, as well as the density structures in the PPP density fields. The dendrogram algorithm is described in detail in Rosolowsky et al. (2008). Briefly, each structure in a dendrogram corresponds to a surface of constant intensity (in the observation) or density (in the PPP cube). The name dendrogram refers to the fact that these surfaces are hierarchically nested inside each other and representable via tree diagrams (Figure 3). Dendrograms capture the hierarchical structure of molecular clouds—a clear advantage over non-hierarchical clump-finding algorithms—and dendrogram decompositions are not sensitively dependent on tuning parameters of the algorithm (Pineda et al. 2009).

**Figure 3.** Schematic representation of a 2D cloud (left) and its dendrogram decomposition (right). Each dendrogram structure is a closed contour in the image. The extension to 3D data is straightforward, but each structure corresponds to an iso-surface instead of a contour line.
Download figure:
Standard image High-resolution image

When constructing a dendrogram, the main freedom one has is the degree to which structures are further decomposed into nested substructures. This process is called "pruning" the dendrogram, since it amounts to controlling how many "branches" (structures) are in the decomposition. The main purpose of pruning is to suppress the extraction of insignificant structures that are poorly resolved or are possible noise fluctuations. A dendrogram constructed with no pruning assigns every local intensity or density maximum (including every noise spike) to a unique structure. The effect that pruning has on the statistical properties of a dendrogram has been studied in detail by Burkhart et al. (2013).

Each dendrogram in this work is pruned such that every leaf contains a local intensity maximum that is brighter than the neighboring 7 voxels in any direction. Each leaf (the brightest, most-compact structure in a hierarchy) also contains at least 800 voxels (for spectral line cubes in PPV) or 100 voxels (for density cubes in PPP) and contains a voxel that is at least 7σ brighter than the contour at which the leaf merges with its neighboring structure. The PPP dendrogram is pruned less heavily than the PPV dendrograms, yielding a catalog with more structures. This prevents PPV structures from being poorly matched to PPP structures simply because the density structure decomposition is too coarsely grained. One of the convenient aspects of dendrograms is that the boundaries of non-pruned structures are independent of the pruning; in other words, while pruning can add or remove structures from a catalog, it does not affect how the included structures are defined. Thus our choice of pruning has little effect on subsequent analysis, other than to exclude from consideration the smallest cloud substructures. We explore how sensitive our analysis is to our pruning choice in Section 3.4.

2.3. Comparison to Perseus

We compare synthetic CO observations of the simulations to the COMPLETE Perseus data (Ridge et al. 2006) using several diagnostics: namely, the distribution of column density, velocity dispersion, and line intensity in various CO transitions. Both the O1 and S11 simulations represent the general physical properties of Perseus. Appendix A provides details about how each quantity was extracted from the data. Neither simulation agrees with Perseus when these diagnostics are examined in detail, but they are the closest available approximations. We discuss the limitations imposed by the suitability of the simulations in Section 3.10.

Figure 4 shows isosurface renderings for Perseus, O1, and S11, in the ¹²CO (J = 1–0) transition. Isosurfaces are drawn at 3, 8, and 15 K. The O1 simulation (panel b) stands out from the other two panels in this figure. Compared to Perseus and S11, it has more space-filling emission at 3K and a lack of emission at 15 K.

To make the differences between the simulations and Perseus more precise, Figure 5 shows three statistical comparisons between the S11 simulation and Perseus: the distribution of column density, ¹²CO (J = 1–0)integrated intensity, and line-of-sight velocity dispersion. The simulation has a higher average column density than Perseus and fainter CO lines. To rough approximation, integrated line intensity increases with gas density, temperature, and velocity dispersion. Since the S11 simulation is at a higher column density than Perseus (panel a), the stronger lines in Perseus are probably due to hotter gas, higher turbulence, and/or poor modeling of CO abundance.

The velocity dispersion of the spatially-averaged spectrum is shown as a vertical line in Figure 5(c). This number is larger than the typical line-of-sight velocity dispersion (the histograms in panel c), due to spatial velocity gradients across the region. In other words, the linewidth of the spatially-averaged spectrum is different from—and larger than—the mean of the line-of-sight linewidth distribution. While the spatially-averaged velocity dispersion in S11 is comparable to Perseus, the individual line-of-sight velocity dispersions in Perseus are skewed towards higher values. We speculate that this is related to the characteristic depth of each line of sight; lines of sight in S11 often intersect a single, ∼1 pc-thick filament of material, which moves more coherently—i.e., with a smaller velocity dispersion—than the cloud as a whole. It may be that typical lines of sight in Perseus pass through material spread out over a longer column, and hence have larger dispersions. This may explain the higher integrated intensities in Perseus, since W = ∫Tdv.

The O1 simulation exhibits similar discrepancies (Figure 6). Its mean density and Mach number were chosen to match Figures 6(a) and (c). It better reproduces the column density distribution in Perseus by construction, but like S11, the integrated emission is too faint. The mode of the line-of-sight velocity dispersion matches Perseus moderately well (panel 6c), but Perseus has a longer tail of high-velocity dispersion material. The velocity dispersion of the cloud-averaged spectrum (dashed lines in 6c) is 50% larger in Perseus.

Turbulence in both simulations is arbitrarily driven, and this non-physical prescription does not reproduce statistics of the 1D line-of-sight velocity field in Perseus. One possibility for this discrepancy is that the simulations are not fully resolving the velocities on the smallest scales. Resolution studies of grid based codes indicate that on small scales the amplitude of the velocity power spectrum is larger in simulations with higher resolution (e.g., Vestuto et al. 2003). For the intermediate resolutions considered here, therefore, the velocity dispersions on the smallest scales may be underestimated. Additionally, some of the high-dispersion emission from Perseus coincides with the parsec-scale, stellar-driven shells studied in Arce et al. 2011 (in particular, Arce's CPS 4 and CPS 5). Neither simulation considers the effect of stellar feedback.

Lee et al. (2013) have also recently compared a spatially-truncated version of the S11 simulation to Perseus. They too note that the S11 simulation is under-luminous. They report a more extreme under-luminosity in W(¹²CO (J = 1–0)) of eight times relative to Perseus, though their truncation acts to further diminish the line intensity. They posit that velocity crowding is responsible for this discrepancy–the lower velocity dispersion in S11 makes superposition more likely, and optically thick ¹²CO features are more likely to obscure each other. Indeed, we find that the peak of the W(¹³CO (J = 1–0)) distributions in both O1 and S11 better reproduce the values in Perseus (Appendix A). The optical depth in W(¹³CO (J = 1–0)) is lower and should be less affected by velocity crowding. Nevertheless, Perseus still shows an excess of large ¹³CO (J = 1–0) intensities that neither simulation reproduces.

We hope that these discrepancies will motivate further efforts to generate simulated clouds which better agree with the statistical properties of Perseus. To that end, we discuss a suite of diagnostics in Appendix A, to facilitate standardized comparisons in the future. For the purposes of this paper, it is important to bear in mind that the "observed" properties in both simulations are under-luminous and under-dispersed compared to Perseus.

3. RESULTS

For each simulation, we synthesize PPV observations of ¹²CO (J = 1–0), ¹²CO (J = 3–2), and ¹³CO (J = 1–0) line emission. Figure 7 shows the match quality q for features identified in each line observation of the O1 simulation. Each plot shows the q (Equation (4)) of a structure (color) as a function of area and mean brightness. Several trends merit discussion. Most obvious in Figure 7 is the variation amongst the three CO tracers. Structures identified in ¹³CO (J = 1–0) are much better representations of the underlying density field than ¹²CO (J = 1–0) or ¹²CO (J = 3–2); ¹³CO (J = 1–0) structures are dominated by structures with q ⩾ 0.5, which are rare in either of the more space-filling, optically thick transitions. We expect that in synthetic spectral-line maps of much higher-density-tracing species than CO, such as NH₃ or N₂H⁺, overlap and superposition would be less of a problem. Similar trends in the dependence of q on the CO transition can also be seen in Figures 8 and 10.

**Figure 8.** Emission at a single velocity in each transition for O1, with (left) and without (right) noise, color-coded by match quality.
Download figure:
Standard image High-resolution image

Second, there is a weaker trend between structure size and q—there is a left-to-right color gradient in Figures 7(a)–(c). Smaller structures tend to be more deeply embedded in cloud material, and are more susceptible to chance occlusion by or superposition with other structures. The large-scale features of the cloud, on the other hand, are less susceptible to superposition. Note that the simulations do not include other clouds along the line of sight; real clouds suffer confusion from other regions in the Galaxy. The dependence of q on size is evident to various extents throughout Figures 7–19.

**Figure 9.** Same as Figure 7, but comparing the ¹²CO (J = 1–0) transition in the S11 and O1 simulations.
Download figure:
Standard image High-resolution image

**Figure 10.** Same as Figure 8, but for the S11 simulation.
Download figure:
Standard image High-resolution image

**Figure 11.** Same as Figure 7, but for the O2 simulation where opacity was disabled during radiative transfer.
Download figure:
Standard image High-resolution image

**Figure 12.** Same as Figure 7, but for the O1 simulation with less pruning.
Download figure:
Standard image High-resolution image

**Figure 13.** Scatter matrix of mass, size, velocity dispersion, and virial parameter for the ¹³CO (J = 1–0) transition of simulation O1. Points are color-coded by match quality. The black line is the linear fit to all points, while the blue line is for q > 0.5.
Download figure:
Standard image High-resolution image

**Figure 14.** Comparison between mass, size, linewidth, and virial parameter measurements (panels a, b, c, d) derived from PPV and PPP data, for the ¹³CO (J = 1–0) transition of O1. Points are colored by match quality, and scaled by structure size. The dashed lines are a factor of two above/below the solid 1:1 line.
Download figure:
Standard image High-resolution image

**Figure 15.** Same as Figure 14, but for the O1 simulation with no gravity.
Download figure:
Standard image High-resolution image

**Figure 16.** H₂ column density distribution in O1 at (a) t = 2.5 Myr, and (b) t = 0 Myr (the instant gravity is turned on).
Download figure:
Standard image High-resolution image

**Figure 17.** Same as Figure 14, for the ¹³CO (J = 1–0) transition of the S11 simulation.
Download figure:
Standard image High-resolution image

**Figure 18.** H₂ column density map of S11 (a), and the integrated ¹³CO (J = 1–0) maps with and without chemistry (b, c).
Download figure:
Standard image High-resolution image

**Figure 19.** Same as Figure 17, but for the S11 simulation with no chemistry.
Download figure:
Standard image High-resolution image

Finally and most subtly, low quality points tend to cluster towards smaller brightnesses at a given scale—the lowest-quality points in Figures 7(b)–(c) are skewed towards the lower envelope of points. As discussed above, low-quality structures correspond to superposition artifacts, or pseudo- (i.e., non-density) structures created by radiative transfer effects or spatial variation in the v_z field. Because of this, the size and intensity of artifacts depend on how "organized" these processes are; smaller and fainter artifacts are more probable since, in the case of superposition, they require only a partial overlap of two real structures or, in the case of velocity-induced structures, only a small scale organization in v_z.

Figure 8 shows a single PP slice in the O1 simulation. Again, the color scale gives the match quality for each structure.⁷ The interior box draws attention to one particularly crowded region. The ¹²CO (J = 1–0) simulation is most affected by confusion in this region, and has a lower average match quality. Line saturation tends to broaden features in ¹²CO (J = 1–0)such that the morphology in that transition is less representative of the true density field which, as the ¹³CO (J = 1–0) transition suggests, is more compact.

3.1. O1 versus S11

As a first comparison between the properties of the O1 and S11 simulations, Figure 9 compares the ¹²CO (J = 1–0) transitions for each simulation, and Figure 10 shows a color-coded PP slice of the S11 simulation. The O1 and S11 simulations show marked differences. The S11 simulation has overall better match quality. The S11 simulation also has a higher dynamic range of structure brightnesses; this is probably due to the fact that that simulation explicitly treated gas heating and CO dissociation, whereas the O1 simulation is isothermal and assumes a constant CO abundance. Heating and dissociation give the S11 simulation more freedom to affect line intensity (by raising the excitation temperature as well as the abundance of emitting material). CO is dissociated in low column density regions of S11, and the simulation has large regions devoid of emission.

3.2. Effect of Noise

Noise has been added to each synthetic observation, to match the noise levels in present-day cloud observations in these transitions (Section 2.1). This raises the following question: to what extent does noise make it more difficult to extract cloud features, and hence lower the match quality? To address this, we also show in Figure 8 the same PP slices without noise. Note that the presence or absence of noise has little bearing on the match quality of most structures. Instead, the lower match quality in the ¹²CO (J = 1–0) transition seems dominated by the high filling factor and opacity of emission, which crowds features in PPV and leads to superposition.

3.3. Disentangling Projection and Radiative Transfer Effects

The ability to recover structures varies with tracer; Figure 7 shows that structures in ¹²CO (J = 1–0) are most affected by projection problems, followed by ¹²CO (J = 3–2) and ¹³CO (J = 1–0). The latter two transitions trace higher densities and are optically thinner than ¹²CO (J = 1–0). It is unclear from Figure 7 alone whether the poor match quality in ¹²CO (J = 1–0) is the result of the higher filling factor or higher opacity in that line.

We can partially decouple the effects of filling factor and opacity by disabling absorption in the radiative transfer. This doesn't prevent crowding or superposition in PPV, but it does prevent structures from blocking radiation. If opacity is the primary problem with ¹²CO (J = 1–0) observations, then disabling absorption should increase the overall match quality.

To test the effects of filling factor alone, we perform a modified radiative transfer calculation on the O1 simulation, where opacity is disabled. The full equation of radiative transfer is given by

$\begin{equation} I_\nu = \int e^{-\tau (z)} B_\nu \left(T_{\rm ex}\left(z\right)\right) \alpha (z)\, dz, \end{equation} \tag{ 5 }$

where τ(z) is the optical depth to depth z, B_ν is the Planck function, T_ex(z) is the excitation temperature of the gas, and α the absorption coefficient. T_ex and α are functions of the gas density at each energy level.

For our modified radiative transfer calculation, we run RADMC-3D as normal to compute the level populations and hence T_ex and α throughout the simulation volume. However, we then integrate a modified equation of radiative transfer with no absorption:

$\begin{equation} \tilde{I}_\nu = \int B_\nu \left(T_{\rm ex}\left(z\right)\right) \alpha (z)\, dz. \end{equation} \tag{ 6 }$

This produces a modified version of the O1 synthetic observations, which we label as O2. The only difference between O1 and O2 is that O2 includes no absorption. Figure 11 compares the match quality for the ¹²CO (J = 1–0) transition in simulations O1 and O2. The O2 structures are markedly higher quality. Figure 11 indicts the e^−τ(z) term in Equation (5) as a primary reason for low match qualities in the ¹²CO (J = 1–0) transition.

Note that this experiment does not fully disable the effects of opacity. In addition to the e^−τ(z) term, opacity acts to increase the excitation temperature of the gas by absorbing radiation emitted by other parts of the cloud. Disabling this absorption could lead to de-excitation of parts of the cloud, and this spatially-varying excitation would partially decouple the intensity field from the density field and decrease the filling factor of emission.

3.4. Effect of Pruning

In Section 2.2, we described our pruning strategy—namely, we require that each structure contains a voxel 7σ above the ambient intensity and contains at least 800 voxels altogether. Figure 12 shows a less aggressive pruning strategy, where we relax the N > 800 voxel criterion to N > 400. There are more structures in this dendrogram (283 structures in the ¹³CO (J = 1–0) transition, compared to 191 in the original pruning). Compared to Figure 7(c), these additional points are concentrated at Areas <0.5 pc². We reiterate that the points in Figure 12 are a superset of Figure 7(c), and the extra points are substructures nested inside the structures from the original pruning.

On average, the new structures have modestly lower match qualities: 38% of the new structures have q < 0.5, compared to 27% of structures in the original pruning. However, we interpret Figure 12 as evidence that the reality or quality of dendrogram-identified structures is fairly insensitive to the details of pruning, provided that statistically-insignificant noise-spikes are not identified as structures.

3.5. Impact on Scaling Relations

What impact does confusion caused by projection and radiative transfer have on subsequent analyses? This is a problem-dependent question, and we focus here on the fairly common virial analysis. The virial parameter, often defined as $\alpha = 5 \sigma _v^2 R / {G M}$ , gives the approximate ratio of kinetic to gravitational potential energy (McKee & Zweibel 1992). The value α ∼ 2 denotes the approximate equipartition between these two energy terms and is often used to assess the boundedness of a given structure. However, the true virial state of an object is affected by several additional unobservable terms (for example, surface terms and magnetic energy; Ballesteros-Paredes 2006; Dib et al. 2007; Bertoldi & McKee 1992), and we emphasize that the α < 2 threshold is a crude proxy for boundedness. Furthermore, the virial analysis implicitly assumes that structures are roughly spherically-symmetric, which does not well-describe the larger features in a dendrogram of a filamentary cloud.

Figure 13 shows, for the ¹³CO (J = 1–0) transition of simulation O1, the relationship between size, velocity dispersion, mass, and virial parameter for each structure in the dendrogram. Each point is color-coded by match quality as before. Appendix B describes how each quantity is measured.

We also show a simple power law fit to the size–linewidth, mass–size, and virial–size relationship, which are frequently measured in molecular cloud studies. The black line shows the fit to all points, while the blue line shows structures with q > 0.5. The slopes of these lines essentially reproduce Larson's classical relationships (M ∼ R², V_rms ∼ R^0.5, α ∼ R⁰; Larson 1981). Ignoring the low-quality match structures affects the slope by ≲ 0.05.

Assessing the uncertainty in these scaling relationships is subtle. A naive least-squares error analysis suggests a small uncertainty for the scaling exponents (∼.025). However, these data points correspond to nested structures and are not independent of each other. Consequently, the least-squares error estimate is overly optimistic. We have experimented with different sensible strategies for pruning the dendrogram, as well as different definitions for the size of an irregular structure (see Appendix B). Varying these options can change the slope of the scaling relationships by ∼ ± 0.2 (see, for example, Table 3), and we feel this is a more appropriate estimate for how precisely the scaling relationships are constrained.

Table 3. Sensitivity of Mass–Size Relationship to Size Definition

Size Definition	Scaling Coefficient M ∼ r^a
Second moment^a,b	0.48
r = V^1/3	0.34
r = A^1/2 ^c	0.43

Notes. ^aValue obtained via least squares fit to the O1 data. ^bThe size definition used throughout this paper and described in Appendix B. ^cA is the area of the structure projected onto the sky.

Download table as: ASCII Typeset image

For the O1 simulation, then, filtering based on q does not significantly affect the scaling relationships one obtains from PPV data.

3.6. Parameters Compared as Measured in PPP and PPV

PPV-derived properties are most often used as approximations for properties of the (partially un-measurable) 6D spatial-kinematic state of the cloud. How accurate are these approximations?

Figure 14 compares, for the ¹³CO (J = 1–0) transition of O1, quantities measured in PPV with the equivalent measurement of each PPV structure's nearest PPP match (measured in PPP). The point sizes indicate structure size, and the dashed lines are a factor of two above and below the 1:1 line. In the lower right corner of each panel, we also report a few summary statistics: the geometric mean of the ratio of PPV/PPP measurements and the geometric standard deviation of this ratio. The geometric mean is defined as μ_g = (∏x_i)^1/N and the geometric standard deviation as $\sigma _g = \exp {(\sqrt{\vphantom{A^A}\smash{{({{1}/{N}) \sum {\ln (x_i / \mu _g)}}}}})}$ . The base-10 log of σ_g is the scatter about the ratio μ_g in dex. These numbers measure the fractional bias from and scatter about the 1:1 line, respectively. We report the geometric mean and standard deviation for all points, as well as the subset of points with q > 0.5. In this analysis, we exclude structures which touch the edge of the cube, since their full extent is not measured.

There are several features of Figure 14 worth commenting on. First, the strongest outliers are the reddest, lowest-quality PPV structures, shown as the red points in the upper-left corner of panels (a)–(c). These structures have no correspondence to any PPP structure. Because these artificial features cannot be sensibly matched to anything in the PPP cube, they are arbitrarily matched to the largest density structures and occupy the upper left corners of panels (a)–(c).

Second, structures with q > 0.5 are dispersed about the 1:1 line by σ_g = 1.4–1.7 (∼0.1–0.2 dex) in panels a–c. The act of filtering on q reduces scatter by 15–30% for mass and velocity. The reduction is larger for masses in panel a, but the dispersion in these points is dominated by the handful of outliers in the upper left corner.

Finally, the virial parameter plot (panel d) shows higher scatter for q > 0.5 structures—0.34 dex, or a factor of 2.2—than do the mass, size, or linewidth plots. The individual errors in the mass, size, and velocity dispersion measurements compound when measuring the virial parameter, and this property is the least-faithfully recovered.

To summarize Figure 14, radiative transfer and projection effects produce a factor of 1.4–2 uncertainty on kinematic properties derived from the O1 simulation using ¹³CO (J = 1–0) emission.

3.7. Effect of Gravity

The O1 simulation includes gravity. Gravity acts to gather and collapse gas, which may create locally crowded regions of high confusion. On the other hand, gravitational collapse should also gather diffuse material on large scales. This may decrease the filling factor on large scales, in mitigate confusion. To probe how important each of these effects are, Figure 15 shows the equivalent set of comparisons of O1 at an earlier epoch, at the instant gravity is enabled. This timestamp captures the steady-state turbulent structure of the O1 simulation, before gravity has had an influence. The column density of the simulation at the original and earlier timestamp is shown in Figure 16.

Gravity causes structure collapse at small scales, producing more dense, small regions (Figure 16(a)). Before gravity is enabled, structures on average are larger, more diffuse, and overlap more. This effects the kinematic properties in Figure 15 in the following ways: without gravity, there are fewer structures overall (147 versus 191 at the original simulation time). Because structures overlap more without gravity, there is a greater fraction of q < 0.5 structures (37% versus 28%). Finally, the virial parameter averaged over all structures is 5.8 without gravity, compared to 3.0 for the original O1 simulation.

Despite the moderately worse confusion, the scatter and bias in Figures 14 and 15 are remarkably similar. In other words, the influence of gravity in the O1 simulation does not greatly impact the ability to recover physical properties.

3.8. Effect of Chemistry

A main difference between the O1 and S11 simulations is the inclusion of limited CO chemistry in S11. Spatial abundance variations in CO can decouple the CO density from the H₂ density. This, in turn, can break the correspondence between CO intensity and PPP density.

Figure 17 shows the same comparisons for the S11 simulation. Panels b and c have a comparable amount of bias and scatter as the previous figure for O1, albeit with fewer structures overall. The most dramatic difference between this plot and Figure 14 is the mass comparison in panel a. The points in the S11 simulation are shallower than the 1:1 line—while PPV-structures recover sensible masses below ∼100 M_☉, M_PPV overestimates M_PPP for larger structures. This directly affects the virial plot in panel d, which exhibits a bias towards PPV-underestimates of α.

One may also wonder if the approximation to estimate mass from intensity (given by Equation (B6)) contributes to the mass discrepancy. This is unlikely. Masses are estimated by measuring the mass-to-light ratio of the brightest 5% of the pixels and using this as a conversion factor. This conversion factor underestimates the mass-to-light ratio for low-column density lines of sight, where gas is sub-thermally excited and emits inefficiently. Adopting a more appropriate mass-to-light ratio for these lines of sight would actually lead to even larger M_PPV measurements, which exacerbates the discrepancy.

The mass discrepancy for large structures is caused by CO dissociation at low column densities, which S11 includes but O1 does not. Figure 18(b) shows that the S11 simulation contains large regions devoid of CO. There is H₂ in these regions (panel a), but little CO due to dissociation. In other words, the topologies of CO and H₂ gas partially diverge on large scales. It is thus more difficult to find exact PPP-equivalents of large-scale PPV structures in S11. This leads to discrepancies which, evidently, are most pronounced for mass measurements. This doesn't explain why the bias is towards PPV mass overestimates, as opposed to underestimates. We do not have a simple explanation for the direction of the bias. For whatever reason, PPV structures are better matched (as quantified by Equation (3)) to slightly too-small PPP structures than they are to slightly too-large PPP structures. We speculate this has to do with the detailed topology of gas in PPP versus PPV.

We can verify that chemistry in S11 causes the mass discrepancy by repeating the analysis on a version of S11 without chemistry. To do this, we re-compute the radiative transfer on S11, assuming a constant temperature of 15 K, and a constant CO/H₂ abundance of 10⁻⁴. The resulting integrated intensity map is shown in Figure 18(c), and the kinematic comparisons in Figure 19. There are more structures in this simulation due to extra CO emission. Furthermore, the mass-mass plot follows the 1:1 line much better.

At the largest scales, the S11 simulation (which does not include gravity) has a virial parameter of α_PPV ∼ 9. This is higher than the O1 simulation, for which the largest-scale structures have a virial parameter of α_PPV ∼ 1. The S11 structures with the smallest values of α_PPV ∼ 1 are in fact cause for concern, since they indicate that gravitational and kinetic energies are comparable in magnitude. Because S11 does not include the effects of gravity, the dynamical nature of these structures is less faithfully modeled.

3.9. Assessing Boundedness

The virial parameter is most often used to estimate the gravitational boundedness of a structure, with values of α < 2 interpreted to indicate that a structure is bound. This interpretation is problematic, as it ignores other forces and oversimplifies the role of turbulence as a support against collapse (Bertoldi & McKee 1992; Ballesteros-Paredes 2006). Our kinematic analysis of S11 and O1 add another cause for concern: measurements of the virial parameter based on PPP and PPV data differ by a factor of two to three, and cluster around α_PPV = 1–5. This implies that, using CO data alone, it is unclear on which side of the α_PPP = 2 boundary many structures fall.

To illustrate this, we repeat an analysis similar to Goodman et al. (2009), who measured the fraction of low-virial parameter structures for the L1448 subregion of Perseus as a function of size.

For each simulation, we assign a virial parameter to each voxel according to the smallest structure to which that voxel belongs. Next, we bin the structures by size, and in each bin, make a mask of all the pixels associated with these structures; we refer to this set as $\mathcal {S}$ . Finally, we compute the fraction of emission in these pixels with α_PPV < 2:

$\begin{equation} f = \frac{\sum {\lbrace L(\boldsymbol {r}) | \boldsymbol {r} \in \mathcal {S}, \alpha (\boldsymbol {r}) < 2}\rbrace }{\sum {\lbrace L(\boldsymbol {r}) | \boldsymbol {r} \in \mathcal {S}}\rbrace } \end{equation} \tag{ 7 }$

We plot this fraction as a function of size for O1, S11, Perseus, and L1448 in Figure 20. Note that the L1448 plot is slightly different from Figure 4 in Goodman et al. (2009) because we use a different scheme for measuring the fraction of α_PPV < 2 emission.⁸ Also, remember that α_PPV underestimates α_PPP in the S11 simulation, pushing the line higher than it would otherwise be.

For the O1 and S11 simulations in Figures 20(a) and (b), we plot the relationship for all structures (black), as well as those with high match qualities (q > 0.5, blue). Because estimates of α in PPV are scattered by a factor of ∼2 about the corresponding PPP measurements, the grey bands show the range of possible values the black line can take if each value of α is mis-estimated by a factor of two. Because so many structures fall within a factor of two of α_PPV = 2, the grey band covers a large swath of the plot. Thus, in addition to the conceptual problems associated with inferring boundedness from the value of α_PPV, there is an intrinsic observational ambiguity associated with reliably determining structures to be above or below α = 2.

3.10. Generalizing to Real Data

As discussed in Section 2.3, the O1 and S11 simulations do not reproduce several statistical properties of Perseus. In particular, both simulations tend to have lower line-of-sight velocity dispersions than Perseus. This may act to increase superposition effects in the simulations since the material is more crowded in PPV space. Similarly, the lack of external radiation fields in O1 (with no chemistry) suppresses dissociation of low-density CO, producing an artificially high filling factor of emission and greater likelihood for superposition. Clouds with significant excitation variation due to external heating may have less PPV superposition of features.

As the field of molecular cloud simulation and synthetic observation advances, we should be able to make new quantitative estimates of the degree to which various tracers are confused under different physical and observing conditions. The suite of diagnostics presented in Appendix A enables a standardized comparison between a simulated and observed dataset. These comparisons address the main statistical cloud properties most relevant for superposition analysis—column density, intensity, and linewidth—and can help to assess how well a given simulation acts as a surrogate for studying unobservable projection effects in a real dataset.

4. CONCLUSION

Intensity features in molecular cloud observations do not always correspond neatly to real density structures. The degree of correspondence is difficult to assess observationally or characterize analytically, but it can be measured in simulations. Such a comparison helps develop intuition about problems when interpreting real datasets.

We have conducted such a comparative analysis in this paper, presenting a new technique to cross-match PPP density structures with PPV intensity structures in synthetic molecular cloud observations. This gives a structure-by-structure assessment of how well cloud properties are recovered in observations. In particular, we find that:

1.
Structures traced in CO are more distorted in observations of more space-filling emission, so that the ¹²CO (J = 1–0) transition shows the most severe effects of overlap, while ¹²CO (J = 3–2) is less affected, and ¹³CO (J = 1–0) gives the most faithful representation of PPP structures in PPV space. This is primarily due to the opacity of the lines, which obscures density structures in the back of the cloud.
2.
Comparing size, mass, velocity dispersion, and virial parameter as measured in PPP (real) and PPV (observed) space, we find that size, mass, and velocity dispersion can usually be recovered to within 40%. Measurements of the virial parameter have a larger scatter of 0.3 dex (a factor of two).
3.
The uncertainty in recovering the virial parameter from CO measurements imposes an unavoidable ambiguity about the energy balance of many cloud structures. In particular, it is often ambiguous to which side of the α = 2 threshold most cloud substructures fall. Thus, assessing the relative dominance of gravitational versus kinetic energy is difficult, as is assessing boundedness.
4.
In the simulations studied here, most molecular cloud structures have PPV-measured virial parameters within a factor of two of α_PPV = 2. Thus, if projection effects induce a factor of two uncertainty on α, there is a large ambiguity regarding which substructures in a cloud are "bound" in the sense that α < 2.
5.
Gravity can act to modestly reduce confusion, by gathering material into more compact, less-overlapping structures. However, this does not have a significant impact on the precision to which intensity structures can be recovered from CO measurements.
6.
The primary impact of chemistry is lower the abundance of CO at low column densities and create excitation temperature variations. This reduces the optical depth and, hence, reduces the amount of confusion, but it may also decouples the topology of CO emission from the underlying H₂ density. This most heavily affects structures with M ≳ 100 M_☉, leading to factors of 2–3 discrepancies between PPV-derived and PPP-derived masses.

Simulations can be powerful probes of otherwise-unobservable phenomena that affect real data. However, conclusions drawn from such analyses are limited by how well simulations approximate the observed properties of real clouds. The simulations in this work do not reproduce several of the details of the emission properties of Perseus (in particular the characteristic brightness and velocity dispersion of CO lines). We conclude that simulations which initially appear to be qualitatively similar to an observed cloud can be surprisingly different in detail. The direct comparison of simulations and observations is fraught with subtleties, and much care must be taken to obtain true quantitative agreement. We advocate for future studies to examine PPP-PPV issues in more detail, including the production of simulations that are more representative of well-characterized molecular clouds like Perseus.

We thank Jens Kauffmann, Lukas Konstandin, Ralf Klessen, Eve Ostriker, Erik Rosolowsky, and Mark Heyer (the referee), whose comments improved this manuscript. Support for this work was provided by NASA through Hubble Fellowship grant #HF-51311.01 awarded by the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, INC., for NASA, under contract NAS 5-26555 (SSRO). RS and SG acknowledge financial support from the Deutsche Forschungsgemeinschaft (DFG) via SFB 881 "The Milky Way System" (sub-projects B1 and B2). This material is based in part upon work supported by the National Science Foundation under Grant Number AST-0908159.

APPENDIX A: DIAGNOSTICS

This paper uses simulations as surrogates for molecular clouds like Perseus, to better understand how unobservable effects like superposition affect subsequent analyses. These results generalize to real data only to the extent that the simulations share the same observable properties as real clouds. These simulations are approximate analogs to molecular clouds like Perseus. However, they still show several discrepancies with Perseus when examined in detail. We present several diagnostics in this appendix, in part to define a standard set of criteria that can be used to evaluate the observational applicability of cloud simulations. We also propose a way to reduce each diagnostic to a single score from 0–1, to more easily communicate how well a given simulation reproduces a particular property of a real cloud observation.

Figures 21 and 22 summarize the diagnostic comparisons for the O1 and S11 simulations.

**Figure 21.** Grid of diagnostic comparisons for the O1 simulation.
Download figure:
Standard image High-resolution image

**Figure 22.** Grid of diagnostic comparisons for the S11 simulation.
Download figure:
Standard image High-resolution image

A.1. Data Filtering

The metrics that follow measure properties along lines-of-sight. As such, we try to focus only on lines-of sight with substantial cloud emission and mask out regions with little cloud material. We base our masking on the process described in Pineda et al. (2008) and require each line of sight to satisfy the following inequalities:

1.
The peak ¹²CO (J = 1–0) line intensity is at least 10σ above the T = 0.
2.
The peak ¹³CO (J = 1–0) line intensity is at least 5σ above T = 0.
3.
The velocity dispersion of ¹²CO (J = 1–0) is at least 0.8 × the velocity dispersion of ¹³CO (J = 1–0).

These cuts are applied both to the Perseus data and to each simulation. The first two cuts are self explanatory. Pineda et al. proposed the last cut to further filter noisy or pathological lines of sight; the rationale is that, since ¹²CO is more abundant and opaque than ¹³CO, it should always have a larger spatial and kinematic extent.

A.2. Column Density Distribution

The distribution of column densities is reasonably well-characterized for nearby clouds: near-infrared extinction measurements trace column densities across the range $10^{20} \lesssim N_{\rm H_2} [{\rm cm}^{-2}] \lesssim 10^{23}$ , and far infrared dust emission probes higher column densities (Goodman et al. 2009; Kelly et al. 2012; Lombardi 2009; Kainulainen et al. 2009).

Most cloud column density distributions are approximately log-normal, with mean column densities of $N_{\rm H_2} \sim 10^{21}$ cm⁻² and width parameters σ ∼ 0.3–0.5. In addition, some clouds (especially those currently undergoing star formation) display excess power-law tails at column densities above ∼3 × 10²¹ cm⁻² (Goodman et al. 2009; Kainulainen et al. 2009).

The column density distribution is shown in Figure 21(a) for O1, and 22(a) for S11.

A.3. Integrated Intensity Distribution

The distribution of line-of-sight integrated intensity W = ∫I(x, y, v) dv is also straightforward to compute from spectral line observations. Its distribution is shown in Figures 21(b) and 22(b) for ¹²CO (J = 1–0), and Figures 21(c) and 22(c) for ¹³CO (J = 1–0).

A.4. Distribution of Peak Intensity

The distribution of peak line-of-sight intensity is a crude measure of the excitation state of the gas; it breaks the degeneracy between excitation state and column density in the integrated line intensity. Its distribution is shown in Figures 21(d) and 22(d) for ¹²CO (J = 1–0), and Figures 21(e) and 22(e) for ¹³CO (J = 1–0).

A.5. Velocity Dispersion Distribution

Likewise, we can compute the distribution of line-of-sight velocity dispersions. We compute the velocity dispersion by computing the intensity-weighted second moment of velocity along each line of sight. Since the second moment is sensitive to faint emission at large velocity offsets, we only consider pixels 3σ above the background. Its distribution is shown in Figures 21(f) and 22(f) for ¹²CO (J = 1–0), and Figures 21(g) and 22(g) for ¹³CO (J = 1–0).

A.6. Joint Distribution of Column Density and Line Intensity

The previous diagnostics are 1D distributions, and say nothing about the correlation among different quantities. The joint distribution of line intensity and column density is particularly interesting, since the ratio of these quantities defines the much-studied X-factor. Higher line intensities at a given column density indicate higher excitation levels, lower opacity, greater abundance of the exciting molecule, and/or greater linewidth (if the line is opaque). The joint distributions are shown in Figures 21(h) and 22(h) for ¹²CO (J = 1–0), and Figures 21(i) and 22(i) for ¹³CO (J = 1–0).

A.7. Scoring Diagnostics

Each of the above diagnostics can be converted into a numerical score, to quickly summarize how well a given simulation reproduces a given diagnostic. We base our score on the Kuiper statistic for two cumulative distribution functions:

$\begin{equation} K = \max {\rm \left(CDF_A - CDF_B \right) + \max \left(CDF_B - CDF_A \right). } \end{equation} \tag{ A1 }$

The Kuiper statistic is a modification of the well-known Kolmogorov–Smirnov statistic, and is more sensitive to discrepancies in the tails of distributions (see the discussion in Section 14.3.4 of Press et al. 2007).

For every comparison of 1D distributions, we define the score as 1 − K, where smaller scores indicate less similarity between the simulation and Perseus.

There are a few ways to generalize the Kuiper statistic to the 2D joint distribution of column density and line intensity (see Section 14.8 of Press et al. 2007). For each of these 2D distributions, we compute 4 cumulative distribution functions

$\begin{eqnarray} {\rm CDF}_1(X, Y) &=& P(x < X, y < Y), \end{eqnarray} \tag{ A2 }$

$\begin{eqnarray} {\rm CDF}_2(X, Y) &=& P(x > X, y < Y), \end{eqnarray} \tag{ A3 }$

$\begin{eqnarray} {\rm CDF}_3(X, Y) &= &P(x < X, y > Y), \end{eqnarray} \tag{ A4 }$

$\begin{eqnarray} {\rm CDF}_4(X, Y) &=& P(x > X, y > Y), \end{eqnarray} \tag{ A5 }$

compute the K statistic for each CDF and save the largest statistic. Our final score is defined as 1 − K_max/2. The factor of 2 correction is included because, in 2D, the Kuiper statistic varies between 0–2 whereas, in 1D, it varies between 0–1.

Table 4 summarizes these scores for the O1 and S11 simulation, using Perseus as the benchmark. We encourage other researchers to generate molecular cloud simulations that better reproduce these observational diagnostics.

Table 4. Diagnostic Scores for the O1 and S11 Simulations

Category	O1 Score	S11 Score
Column density	0.87	0.52
W(¹²CO 1-0)	0.36	0.47
W(¹³CO 1-0)	0.68	0.53
¹²CO 1-0 velocity dispersion	0.84	0.47
¹³CO 1-0 velocity dispersion	0.91	0.41
Peak ¹²CO 1-0 intensity	0.68	0.77
Peak ¹³CO 1-0 intensity	0.75	0.84
N_col vs. W(¹²CO 1-0)	0.60	0.50
N_col vs. W(¹³CO 1-0)	0.78	0.53

Download table as: ASCII Typeset image

APPENDIX B: EXTRACTING CLOUD PROPERTIES FROM DENDROGRAMS

The dendrogram algorithm defines a structure as a specific set of connected voxels; we denote this set as Ω. The intensity value at a given location $\boldsymbol {r}$ is denoted as $I(\boldsymbol {r})$ (this corresponds to the density for structures in a PPP cube). Here we describe how we measure properties from such a structure.

In all measurements, we define structures by contour surfaces and assume the structure ends at this boundary; that is, we adopt the "bijection paradigm" discussed in Rosolowsky et al. (2008). This assumption "clips" the low-intensity wings of structures embedded in ambient emission.

Location. We compute the intensity-weighted first moment of each structure to define its center.

$\begin{equation} \boldsymbol {\mu } = \frac{\sum _\Omega {I(\boldsymbol {r}) \cdot \boldsymbol {r}}}{\sum _\Omega {I(\boldsymbol {r})}}. \end{equation} \tag{ B1 }$

Orientation. We compute the three moments of inertia of I; these vectors give the direction of greatest and smallest elongation. We project the direction of greatest elongation onto the PP plane, which defines the structure's major axis $\hat{r}_{{\rm maj}}$ . The minor axis $\hat{r}_{{\rm min}}$ is perpendicular to this.

Size Scale. We define the extent of each structure along the major and minor axes to be the intensity-weighted second moment:

$\begin{eqnarray} \ell _{{\rm maj}}^2 &=& \frac{\sum _{\Omega }{I(\boldsymbol {r}) \cdot [(\boldsymbol {r} - \boldsymbol {\mu }) \cdot \hat{r}_{{\rm maj}}]^2}}{ \sum _\Omega {I(\boldsymbol {r})}}, \end{eqnarray} \tag{ B2 }$

$\begin{eqnarray} \ell _{{\rm min}}^2 &=& \frac{\sum _{\Omega }{I (\boldsymbol {r}) \cdot [(\boldsymbol {r} - \boldsymbol {\mu }) \cdot \hat{r}_{{\rm min}}]^2}}{ \sum _\Omega {I(\boldsymbol {r})}}, \end{eqnarray} \tag{ B3 }$

$\begin{eqnarray} \ell _r &=& \sqrt{\ell _{{\rm maj}} \times \ell _{{\rm min}}} \end{eqnarray} \tag{ B4 }$

$\begin{eqnarray} A &=& \ell _r^2. \end{eqnarray} \tag{ B5 }$

Another common method for measuring the size scale of irregular PPV structures is to measure the area by counting the number of distinct (X, Y) pixels that a structure occupies, and defining $\ell _r^{\prime } = \sqrt{A/\pi }$ . $\ell _r^{\prime }$ tends to be about 50% larger than ℓ_r, since the latter measure is intensity-weighted, and structures are usually centrally-condensed.

When measuring the virial parameter, we multiply ℓ_r by 1.91 to correct for this central concentration. This is the same factor applied and discussed in Rosolowsky et al. (2008).

Velocity Dispersion. We define the velocity dispersion v_rms as the second moment of the intensity distribution along the velocity direction.

Mean Intensity. We simply compute the mean of the intensity for all voxels belonging to a structure Ω.

Mass. For PPP structures, mass can be computed directly by integrating the density field. For PPV structures, we assume that a structure's mass is linearly proportional to its integrated intensity (that is, we adopt the "X-factor" assumption; Pineda et al. 2008). Such a proportionality exists if emission is optically thin and the emitting molecule has a constant abundance and excitation temperature. None of these assumptions holds for the simulations in this paper, and the X-factor varies as a function of position. Furthermore, since these simulations are under-luminous compared to real clouds, it would be unwise to use standard X-factors quoted in the literature. Instead, we set the conversion factor individually for each simulation, to best recover the input mass from synthetically-observed bright emission. We look at the brightest 5% of the lines-of sight, and define X^syn as the mean of the ratio of surface density/integrated CO intensity in these pixels. We then estimate mass as

$\begin{equation} M_{\rm PPV} = \sum _{\Omega } I(\boldsymbol {r}) \delta v \delta x^2 X^{\rm syn}, \end{equation} \tag{ B6 }$

where δv is the velocity width of a pixel, and δx is the length of a pixel.

The brightest 5% of pixels represent the densest regions of each simulation, where the opacity is highest. The X factor derived from these pixels tends to over-estimate masses from less-opaque but equally-excited lines of sight, with lower mass-to-light ratios. Likewise, it underestimates the mass for the faintest lines of sight, where CO is sub-thermally excited and the mass-to-light ratio is large.

QUANTIFYING OBSERVATIONAL PROJECTION EFFECTS USING MOLECULAR CLOUD SIMULATIONS

Article metrics

Permissions

Author e-mails

Author affiliations

Author notes

Dates

ABSTRACT

1. INTRODUCTION

1.1. Overview of Observational Effects

1.2. Previous Work

2. METHODOLOGY

2.1. Data Preparation

2.2. Structure Identification

2.3. Comparison to Perseus

3. RESULTS

3.1. O1 versus S11

3.2. Effect of Noise

3.3. Disentangling Projection and Radiative Transfer Effects

3.4. Effect of Pruning

3.5. Impact on Scaling Relations

3.6. Parameters Compared as Measured in PPP and PPV

3.7. Effect of Gravity

3.8. Effect of Chemistry

3.9. Assessing Boundedness

3.10. Generalizing to Real Data

4. CONCLUSION

APPENDIX A: DIAGNOSTICS

A.1. Data Filtering

A.2. Column Density Distribution

A.3. Integrated Intensity Distribution

A.4. Distribution of Peak Intensity

A.5. Velocity Dispersion Distribution

A.6. Joint Distribution of Column Density and Line Intensity

A.7. Scoring Diagnostics

APPENDIX B: EXTRACTING CLOUD PROPERTIES FROM DENDROGRAMS

Footnotes

QUANTIFYING OBSERVATIONAL PROJECTION EFFECTS USING MOLECULAR CLOUD SIMULATIONS

Article metrics

Permissions

Share this article

Author e-mails

Author affiliations

Author notes

Dates

ABSTRACT

1. INTRODUCTION

1.1. Overview of Observational Effects

1.2. Previous Work

2. METHODOLOGY

2.1. Data Preparation

2.2. Structure Identification

2.3. Comparison to Perseus

3. RESULTS

3.1. O1 versus S11

3.2. Effect of Noise

3.3. Disentangling Projection and Radiative Transfer Effects

3.4. Effect of Pruning

3.5. Impact on Scaling Relations

3.6. Parameters Compared as Measured in PPP and PPV

3.7. Effect of Gravity

3.8. Effect of Chemistry

3.9. Assessing Boundedness

3.10. Generalizing to Real Data

4. CONCLUSION

APPENDIX A: DIAGNOSTICS

A.1. Data Filtering

A.2. Column Density Distribution

A.3. Integrated Intensity Distribution

A.4. Distribution of Peak Intensity

A.5. Velocity Dispersion Distribution

A.6. Joint Distribution of Column Density and Line Intensity

A.7. Scoring Diagnostics

APPENDIX B: EXTRACTING CLOUD PROPERTIES FROM DENDROGRAMS

Footnotes