INCREASING EVIDENCE FOR HEMISPHERICAL POWER ASYMMETRY IN THE FIVE-YEAR WMAP DATA

, , , , , and

Published 2009 June 17 © 2009. The American Astronomical Society. All rights reserved.
, , Citation J. Hoftuft et al 2009 ApJ 699 985 DOI 10.1088/0004-637X/699/2/985

0004-637X/699/2/985

ABSTRACT

Motivated by the recent results of Hansen et al. concerning a noticeable hemispherical power asymmetry in the Wilkinson Microwave Anisotropy Probe (WMAP) data on small angular scales, we revisit the dipole-modulated signal model introduced by Gordon et al.. This model assumes that the true cosmic microwave background signal consists of a Gaussian isotropic random field modulated by a dipole, and is characterized by an overall modulation amplitude, A, and a preferred direction, $\hat{p}$. Previous analyses of this model have been restricted to very low resolution (i.e., 3fdg6 pixels, a smoothing scale of 9° FWHM, and ℓ ≲ 40) due to computational cost. In this paper, we double the angular resolution (i.e., 1fdg8 pixels and 4fdg5 FWHM smoothing scale), and compute the full corresponding posterior distribution for the five-year WMAP data. The results from our analysis are the following: the best-fit modulation amplitude for ℓ ⩽ 64 and the ILC data with the WMAP KQ85 sky cut is A = 0.072 ± 0.022, nonzero at 3.3σ, and the preferred direction points toward Galactic coordinates (l, b) = (224°, − 22°) ± 24°. The corresponding results for ℓ ≲ 40 from earlier analyses were A = 0.11 ± 0.04 and (l, b) = (225°, − 27°). The statistical significance of a nonzero amplitude thus increases from 2.8σ to 3.3σ when increasing ℓmax from 40 to 64, and all results are consistent to within 1σ. Similarly, the Bayesian log-evidence difference with respect to the isotropic model increases from Δln E = 1.8 to Δln E = 2.6, ranking as "strong evidence" on the Jeffreys' scale. The raw best-fit log-likelihood difference increases from $\Delta \ln \mathcal {L} = 6.1$ to $\Delta \ln \mathcal {L} = 7.3$. Similar, and often slightly stronger, results are found for other data combinations. Thus, we find that the evidence for a dipole power distribution in the WMAP data increases with ℓ in the five-year WMAP data set, in agreement with the reports of Hansen et al.

Export citation and abstract BibTeX RIS

1. INTRODUCTION

The question of statistical isotropy in the cosmic microwave background (CMB) has received much attention within the cosmological community ever since the release of the first-year Wilkinson Microwave Anisotropy Probe (WMAP; Bennett et al. 2003a) in 2003. The reasons for this are twofold. On the one hand, the current cosmological concordance model is based on the concept of inflation (Starobinsky 1980; Guth 1981; Linde et al. 1982; Mukhanov et al. 1981; Starobinsky 1982; Linde et al. 1983, 1994; Smoot 1992; Ruhl 2003; Rynyan 2003; Scott 2003), which predicts a statistically homogeneous and isotropic universe. Since inflation has proved highly successful in describing a host of cosmological probes, most importantly the CMB and large-scale power spectra, this undeniably imposes a strong theoretical prior toward isotropy and homogeneity.

On the other hand, many detailed studies of the WMAP sky maps, employing higher-order statistics, have revealed strong hints of both violation of statistical isotropy and non-Gaussianity. Some early notable examples include unexpected low-ℓ correlations (de Oliveira-Costa et al. 2004), a peculiar large cold spot in the southern Galactic hemisphere (Vielva et al. 2004), and a dipolar distribution of large-scale power (Eriksen et al. 2004b). Today, the literature on non-Gaussianity and violation of statistical isotropy in the WMAP data has grown very large, indeed (e.g., Bernui et al. 2006; Bielewicz et al. 2005; Copi et al. 2006; Cruz et al. 2005, 2006; Eriksen et al. 2004a, 2004c, 2005; Jaffe et al. 2005, 2006; Martínez-González et al. 2006; McEwen et al. 2008; Räth et al. 2007; Yadav & Wandelt 2008), and it would be unwise not to consider these issues very seriously.

Of particular interest to us is the question of hemispherical distribution of power in the WMAP data, first reported by Eriksen et al. (2004b) and later confirmed by, e.g., Hansen et al. (2004) and Eriksen et al. (2005). The most recent works on this topic include those presented by Hansen et al. (2008), who found that the power asymmetry extends to much smaller scales than previously thought, and by Eriksen et al. (2007a), who quantified the large-scale power asymmetry in the three-year WMAP data using an optimal Bayesian framework.

A separate, but possibly physically related, line of work was recently presented by Groeneboom & Eriksen (2009), who considered the specific model for violation of Lorenz invariance in the early universe, proposed by Ackerman et al. (2007). This model involves CMB correlations with a quadrupolar distribution on the sky, and is thus orthogonal to the current dipolar model. Surprisingly, when analyzing the five-year WMAP data, Groeneboom & Eriksen (2009) found supportive evidence for this model at the 3.8σ significance level, when considering angular scales up to ℓ ⩽ 400. Thus, assuming that the WMAP observations are free of unknown systematics, there appears to be increasing evidence for both dipolar and quadrupolar structure in the CMB power distribution, at all angular scales.

In this paper, we repeat the Bayesian analysis of Eriksen et al. (2007a), but double the angular resolution of the data. Nevertheless, we are still limited to relatively low angular resolutions, since the method inherently relies on brute-force evaluation of a pixel-based likelihood, and therefore scales as $\mathcal {O}(N_{{\rm pix}}^3)$. Yet, simply by spending more computer resources we are able to increase the pixel resolution from Nside = 16 to 32 and decrease the degradation smoothing scale from 9° to 4fdg5 FWHM. This provides additional support for multipoles between ℓ ≈ 40 and 80. While not sufficient to provide a full and direct comparison with the results of Hansen et al. (2008), it is a significant improvement over the results presented by Eriksen et al. (2007a).

2. OVERVIEW OF MODEL AND ALGORITHMS

The Bayesian analysis framework used in this paper is very similar to that employed by Eriksen et al. (2007a). We therefore only give a brief overview of its main features here, and refer the reader interested in the full details to the original paper and references therein.

2.1. Data Model and Likelihood

The starting point for our analysis is the phenomenological CMB signal model first proposed by Gordon et al. (2005),

Equation (1)

Here $\mathbf {d}(\hat{n})$ denotes the observed data in direction $\hat{n}$, $\mathbf {s}(\hat{n})$ is an intrinsically isotropic and Gaussian random field with power spectrum C, $f(\hat{n})$ is an auxiliary modulating field, and $\mathbf {n}(\hat{n})$ denotes instrumental noise.

Obviously, if f = 0, one recovers the standard isotropic model. However, we are interested in a possible hemispherical asymmetry, and we therefore parameterize the modulation field in terms of a dipole with a free amplitude A and a preferred direction $\hat{p}$,

Equation (2)

The modulated signal component is thus an anisotropic, but still Gaussian, random field, with covariance matrix

Equation (3)

where

Equation (4)

We now introduce one new feature compared to the analysis of Eriksen et al. (2007a), for two reasons. First, we are interested in studying the behavior of the modulation field as a function of ℓ-range, and therefore want a mechanism to restrict the impact of the modulation parameters in harmonic space. Second, we also want to minimize the impact of the arbitrary regularization noise (see Section 3) on the modulation parameters at high ℓ's. Therefore, we split the signal covariance matrix into two parts, one modulated low-ℓ part and other isotropic high-ℓ part,

Equation (5)

where only multipoles between 2 ⩽ ℓ < ℓmod are included in Smod, and only multipoles at ℓ ⩾ ℓmod are included in Siso. (Note that we are not proposing a physical mechanism for generating the modulation field in this paper, but only attempt to characterize its properties. This split may or may not be physically well motivated, but it does serve a useful purpose in the present paper as it allows us to study the scale dependence of the modulation field in a controlled manner.)

Including instrumental noise and possible foreground contamination, the full data covariance matrix reads

Equation (6)

The noise and foreground covariance matrices depend on the data processing, and will be described in greater detail in Section 3.

We also have to parameterize the power spectrum for the underlying isotropic component, C. Following Eriksen et al. (2007a), we choose a simple two-parameter model with a free amplitude q and tilt n for this purpose,

Equation (7)

Here ℓ0 is a pivot multipole and Cfid is a fiducial model, in the following chosen to be the best-fit ΛCDM power-law spectrum of Komatsu et al. (2009).

Since both the signal and noise are assumed to be Gaussian, the log-likelihood now reads

Equation (8)

up to an irrelevant constant, with $\mathbf {C}=\mathbf {C}(A, \hat{p}, q, n)$.

2.2. The Posterior Distribution and Bayesian Evidence

The posterior distribution for our model is given by Bayes' theorem,

Equation (9)

Here $P(q, n, A, \hat{p}|H)$ is a prior, and P(d|H) is a normalization factor often called the "Bayesian evidence." Note that we now have included an explicit reference to the hypothesis (or model), H, in all factors, as we will in the following compare two different hypotheses, namely "H1: The universe is isotropic (A = 0)" versus "H2: The universe is anisotropic (A ≠ 0)."

We adopt uniform priors for all priors in the following. Specifically, we adopt P(q) = Uniform[0.5, 1.5] and P(n) = Uniform[ − 0.5, 0.5] for the power spectrum, and a uniform prior over the sphere for the preferred axis, $\hat{p}$. The modulation amplitude prior is chosen uniformly over [0, Amax], where Amax = 0.20 is sufficiently large to fully encompass the nonzero parts of the likelihood. If more liberal priors are desired, the interested reader can easily calculate the corresponding evidence by subtracting the logarithm of the volume expansion factor from the results quoted in this paper.

With these definitions and priors, the posterior distribution, $P(q, n, A, \hat{\mathbf {p}} | \mathbf {d}, H)$, is mapped out with a standard MCMC sampler. The Bayesian evidence, E = P(d|H), is computed with the "nested sampling" algorithm (Skilling 2004; Mukherjee et al. 2006). For further details on both procedures, we refer the interested reader to Eriksen et al. (2007a).

For easy reference, we recall Jeffreys' interpretational scale for the Bayesian evidence (Jeffreys 1961): a value of Δln E < 1 indicates a result "not worth more than a bare mentioning;" a value of 1 < Δln E < 2.5 is considered as "significant" evidence; a value of 2.5 < Δln E < 5 is considered "strong to very strong;" and Δln E > 5 ranks as "decisive."

3. DATA

In this paper, we analyze several downgraded versions of the five-year WMAP temperature sky maps, namely the template-corrected Q-, V-, and W-band maps, as well as the "foreground cleaned" Internal Linear Combination (ILC) map (Gold et al. 2009). Each map is downgraded to low resolution as follows (Eriksen et al. 2007b): first, each map is downgraded to a HEALPix8 resolution of Nside = 32, by smoothing to an effective resolution of 4fdg5 FWHM and properly taking into account the respective pixel windows. We then add uniform Gaussian noise of σn = 1 μK rms to each pixel, in order to regularize the pixel–pixel covariance matrix at small angular scales. The resulting maps have a signal-to-noise ratio of unity at ℓ = 80, and are strongly noise dominated at ℓmax = 95.

Two different sky cuts are used in the analyses, both of which are based on the WMAP KQ85 mask (Gold et al. 2009). In the first case, we directly downgrade the KQ85 cut to the appropriate Nside, by excluding any HEALPix pixel for which more than half of the corresponding sub-pixels are missing. This mask is simply denoted by KQ85. In the second case, we smooth the mask image (consisting of 0's and 1's) with a beam of 4fdg5 FWHM, and reject all pixels with a value less than 0.99. We call this expanded mask KQ85e. The two masks remove 16.3% and 26.9% of the pixels, respectively.

The instrumental signal-to-noise ratio of the WMAP data is very high at large angular scales, at about 150 for the V band at ℓ = 100. The only important noise contribution in the downgraded sky maps is therefore the uniform regularization noise, which is not subject to the additional beam smoothing. We therefore approximate the noise covariance matrix by Nij = σ2nδij. Note that this approximation was explicitly validated by Eriksen et al. (2007a) for the three-year WMAP data, which have higher instrumental noise than the five-year data.

We also marginalize over a fixed set of "foreground templates," ti, by adding an additional term to the data covariance matrix of the form Fi = αititTi, with αi ≳ 103, for each template. In addition to one monopole and three dipole templates,9 we use the V–ILC difference map as a template for both the V band and ILC maps, the Q–ILC difference for the Q band, and the W–ILC difference for the W band. However, these foreground templates do not affect the results noticeably in either case, due to the sky cuts used.

4. RESULTS

The main results from the analysis outlined above are summarized in Table 1. We consider nine different data combinations (i.e., frequency bands, masks, and multipole range), and show (1) the best-fit modulation axis and amplitude, both with 68% confidence regions, (2) the statistical significance of the corresponding amplitude (i.e., AA), and (3) the raw improvement in χ2 and Bayesian log-evidence for the modulated model over the isotropic model. The last items are shown for the ILC with the KQ85 sky cut only. For reference, we also quote the ILC result for the Kp2 mask (Bennett et al. 2003b) reported by Eriksen et al. (2007a) when analyzing the Nside = 16° and 9° FWHM data.

Table 1. Summary Statistics for Modulated CMB Model Posteriors

Data Mask mod (lbf, bbf) Abf Significance (σ) $\Delta \log \mathcal {L}$ Δlog E
ILC KQ85 64 (224°, − 22°) ± 24° 0.072 ± 0.022 3.3 7.3 2.6
V band KQ85 64 (232°, − 22°) ± 23° 0.080 ± 0.021 3.8 ... ...
V band KQ85 40 (224°, − 22°) ± 24° 0.119 ± 0.034 3.5 ... ...
V band KQ85 80 (235°, − 17°) ± 22° 0.070 ± 0.019 3.7 ... ...
W band KQ85 64 (232°, − 22°) ± 24° 0.074 ± 0.021 3.5 ... ...
ILC KQ85e 64 (215°, − 19°) ± 28° 0.066 ± 0.025 2.6 ... ...
Q band KQ85e 64 (245°, − 21°) ± 23° 0.088 ± 0.022 3.9 ... ...
V band KQ85e 64 (228°, − 18°) ± 28° 0.067 ± 0.025 2.7 ... ...
W band KQ85e 64 (226°, − 19°) ± 31° 0.061 ± 0.025 2.5 ... ...
ILCa Kp2 ∼40 (225°, − 27°)       0.11 ± 0.04 2.8 6.1 1.8

Notes. Listed quantities are data set (first column); mask (second column); maximum multipole used for modulation covariance matrix, ℓmod (third column); marginal best-fit dipole axis (fourth column) and amplitude (fifth column) with 68% confidence regions indicated; statistical significance of nonzero detection of A (sixth column); the change in maximum likelihood between modulated and isotropic models, $\Delta \log \mathcal {L} = \log \mathcal {L}_{{\rm mod}} -\log \mathcal {L}_{{\rm iso}}$ (seventh column); and the Bayesian evidence difference, Δlog E = log Emod − log Eiso (eighth column). The latter two were only computed for one data set, due to a high computational cost. However, other values can be estimated by comparing the significances indicated in the sixth column. aResults computed from Nside = 16 and 9° FWHM data, as presented by Eriksen et al. (2007a).

Download table as:  ASCIITypeset image

The reason for providing the full evidence for only one data set is solely computational. The total CPU cost for the full set of computations presented here was ∼50, 000 CPU hr, and the evidence calculation constitutes a significant fraction of this. On the other hand, the evidence is closely related to the significance level AA, and one can therefore estimate the evidence level for other cases in Table 1 given the two explicit evidence values and significances. We have therefore chosen to spend our available CPU time on more MCMC posterior analyses, rather than on more evidence computations.

We first consider the results for the ILC map with the KQ85 mask and ℓmod = 64. In this case, the best-fit amplitude is A = 0.072 ± 0.022, nonzero at the 3.3σ confidence level. The best-fit axis points toward Galactic coordinates (l, b) = (224°, − 22°), with a 68% uncertainty of 24°. These results are consistent with the results presented by Eriksen et al. (2007a), who found an amplitude of A = 0.11 ± 0.04 and a best-fit axis of (l, b) = (225°, − 27°) for ℓ ≲ 40.

Second, we see that these results are only weakly dependent on frequency, as both the V band and W band for the same mask and ℓ-range have amplitudes within 0.5σ of the ILC map, with A = 0.080 and A = 0.074, and nonzero at 3.8σ and 3.5σ, respectively. (We have not included the Q-band analysis for the KQ85 mask, as there were clearly visible foreground residuals outside the mask for this case.) The corresponding marginal posteriors are shown in Figure 1, clearly demonstrating the consistency between data sets. Figure 2 compares the best-fit axes of the three data sets, and also indicates the axes reported by Eriksen et al. (2004b) and Eriksen et al. (2007a).

Figure 1.

Figure 1. Posterior distributions for the dipole modulation amplitude, marginalized over direction and CMB power spectrum, computed for the KQ85 sky cut and ℓmod = 64.

Standard image High-resolution image
Figure 2.

Figure 2. Posterior distribution for the dipole modulation axis, shown for the V-band map and KQ85 sky cut, marginalized over power spectrum and amplitude parameters. Gray sky pixels indicate pixels outside the 2σ confidence region. The dots indicate the axis (1) reported by Eriksen et al. (2004b) in white; (2) for both the ILC and V-band maps (these have the same best-fit axis) with the KQ85 sky cut in black; (3) for the W bands in blue, and the axis reported by Eriksen et al. (2007a) in green. Note that the background distribution has been smoothed for plotting purposes to reduce visual Monte Carlo noise.

Standard image High-resolution image

Next, we also see that the results are not strongly dependent on the choice of mask, as the amplitudes for the extended KQ85e mask are consistent with the KQ85 results, even though it removes an additional 10% of the sky. However, we do see, as expected, that the error bars increase somewhat by removing the additional part of the sky, and this reduces the absolute significances somewhat.

Finally, the best-fit modulation amplitudes for the V-band data and KQ85 mask are A = 0.12 for ℓmod = 40, A = 0.080 for ℓmod = 64, and A = 0.070 for ℓmod = 80 at 3.5σ, 3.8σ, and 3.7σ, respectively. This is an interesting observation for theoreticians who are interested in constructing a fundamental model for the effect: taken at face value, these amplitudes could indicate a non-scale-invariant behavior of A, as also noted by Hansen et al. (2008). On the other hand, the statistical significance of this statement is so far quite low, as a single common value A ∼ 0.07 is also consistent with all measurements. Better measurements at higher ℓ's are required to unambiguously settle this question.

5. CONCLUSIONS

Shortly, following the release of the first-year WMAP data in 2003, Eriksen et al. (2004a) presented the early evidence for a dipolar distribution of power in the CMB temperature anisotropy sky, considering only the large angular scales of the WMAP data. Next, Groeneboom & Eriksen (2009) presented the evidence for a quadrupolar distribution of CMB power, and found that this feature extended over all ℓ's under consideration. Finally, Hansen et al. (2008) found that the dipolar CMB power distribution is also present at high ℓ's. The evidence for violation of statistical isotropy in the CMB field is currently increasing rapidly, and the significance of these detections are approaching 4σ.

In this paper, we revisit the high-ℓ claims of Hansen et al. (2008), by applying an optimal Bayesian framework based on a parametric modulated CMB model to the WMAP data at higher multipoles than previously considered with this method, albeit lower than those considered by Hansen et al. (2008). In doing so, we find results very consistent with those presented by Hansen et al. (2008): the evidence for a dipolar distribution of power in the WMAP data increases with ℓ. For example, when considering the V-band data and KQ85 sky cut, the statistical significance of the modulated model increases from 3.2σ at ℓmod = 40 to 3.8σ at ℓmod = 64, and 3.7σ at ℓmod = 80.

The Bayesian evidence now also ranking within the "strong to very strong" category on Jeffreys' scale. However, it should be noted that the Bayesian evidence is by nature strongly prior dependent, and if we had chosen a prior twice as large as the one actually used, the corresponding log-evidence for the ILC map would have fallen from Δln E = 2.6 to 1.7, ranking only as "substantial" evidence. For this reason, it is in many respects easier to attach a firm statistical interpretation to the posterior distribution than the Bayesian evidence.

It is interesting to note that the absolute amplitude A may show hints of decreasing with ℓ. It is premature to say whether this is due simply to a statistical fluctuation, or whether it might point toward a non-scale-invariant underlying physical effect, in which case the amplitude A should be replaced with a function A(ℓ). Either case is currently allowed by the data.

To answer this question, and further constrain the overall model, better algorithms are required. The current approach relies on brute-force inversion of an Npix × Npix covariance matrix, and therefore scales as $\mathcal {O}(N_{{\rm pix}}^3)$ or $\mathcal {O}(N_{{\rm side}}^6)$. However, already the present analysis, performed at Nside = 32, required ∼50,000 CPU hr, and increasing Nside by an additional factor of 2 would require ∼3 million CPU hr. More efficient algorithms are clearly needed.

To summarize, there is currently substantial evidence for both dipolar (Hansen et al. 2008 and this work) and quadrupolar power distribution (Groeneboom & Eriksen 2009) in the WMAP data, and this is seen at all probed scales. The magnitude of the dipolar mode is considerably stronger than the quadrupolar mode, as a ∼3.5σ significance level is reached already at ℓ ∼ 64 for the dipole, while the same significance was obtained at ℓ ∼ 400 for the quadrupole.

These observations may prove useful for theorists attempting to construct alternative models for these features, either phenomenological or fundamental. Considerable efforts have gone toward this goal already (e.g., Ackerman et al. 2007; Böhmer & Mota 2008; Carroll et al. 2008, 2009; Chang et al. 2009; Erickcek et al. 2008a, 2008b; Gordon et al. 2005; Emir Gümrükçüoglu et al. 2007; Himmetoglu et al. 2009a, 2009b; Kahniashvili et al. 2008; Kanno et al. 2008; Koivisto & Mota 2008a, 2008b; Pereira et al. 2007; Pitrou et al. 2008; Pullen & Kamionkowski 2007; Watanabe et al. 2009; Yokoyama & Soda 2008), but so far no fully convincing model has been established. Clearly, more work is needed on both the theoretical and observational side of this issue. Fortunately, it is now only a few years until Planck will open up a whole new window on these issues by producing high-sensitivity maps of the CMB polarization, as well as measuring the temperature fluctuations to arcminute scales. We will then be able to measure the properties of the dipole, quadrupole, and, possibly, higher-order modes of the modulation field to unprecedented accuracy.

H.K.E. acknowledges financial support from the Research Council of Norway. The computations presented in this paper were carried out on Titan, a cluster owned and maintained by the University of Oslo and NOTUR. Some of the results in this paper have been derived using the HEALPix (Górski et al. 2005) software and analysis package. We acknowledge use of the Legacy Archive for Microwave Background Data Analysis (LAMBDA). Support for LAMBDA is provided by the NASA Office of Space Science.

Footnotes

  • For an explicit demonstration of the importance of monopole and dipole marginalization on this specific problem, see Gordon (2007).

Please wait… references are loading.
10.1088/0004-637X/699/2/985