INCREASING EVIDENCE FOR HEMISPHERICAL POWER ASYMMETRY IN THE FIVE-YEAR WMAP DATA

J. Hoftuft; H. K. Eriksen; A. J. Banday; K. M. Górski; F. K. Hansen; P. B. Lilje

doi:10.1088/0004-637X/699/2/985

1. INTRODUCTION

The question of statistical isotropy in the cosmic microwave background (CMB) has received much attention within the cosmological community ever since the release of the first-year Wilkinson Microwave Anisotropy Probe (WMAP; Bennett et al. 2003a) in 2003. The reasons for this are twofold. On the one hand, the current cosmological concordance model is based on the concept of inflation (Starobinsky 1980; Guth 1981; Linde et al. 1982; Mukhanov et al. 1981; Starobinsky 1982; Linde et al. 1983, 1994; Smoot 1992; Ruhl 2003; Rynyan 2003; Scott 2003), which predicts a statistically homogeneous and isotropic universe. Since inflation has proved highly successful in describing a host of cosmological probes, most importantly the CMB and large-scale power spectra, this undeniably imposes a strong theoretical prior toward isotropy and homogeneity.

On the other hand, many detailed studies of the WMAP sky maps, employing higher-order statistics, have revealed strong hints of both violation of statistical isotropy and non-Gaussianity. Some early notable examples include unexpected low-ℓ correlations (de Oliveira-Costa et al. 2004), a peculiar large cold spot in the southern Galactic hemisphere (Vielva et al. 2004), and a dipolar distribution of large-scale power (Eriksen et al. 2004b). Today, the literature on non-Gaussianity and violation of statistical isotropy in the WMAP data has grown very large, indeed (e.g., Bernui et al. 2006; Bielewicz et al. 2005; Copi et al. 2006; Cruz et al. 2005, 2006; Eriksen et al. 2004a, 2004c, 2005; Jaffe et al. 2005, 2006; Martínez-González et al. 2006; McEwen et al. 2008; Räth et al. 2007; Yadav & Wandelt 2008), and it would be unwise not to consider these issues very seriously.

Of particular interest to us is the question of hemispherical distribution of power in the WMAP data, first reported by Eriksen et al. (2004b) and later confirmed by, e.g., Hansen et al. (2004) and Eriksen et al. (2005). The most recent works on this topic include those presented by Hansen et al. (2008), who found that the power asymmetry extends to much smaller scales than previously thought, and by Eriksen et al. (2007a), who quantified the large-scale power asymmetry in the three-year WMAP data using an optimal Bayesian framework.

A separate, but possibly physically related, line of work was recently presented by Groeneboom & Eriksen (2009), who considered the specific model for violation of Lorenz invariance in the early universe, proposed by Ackerman et al. (2007). This model involves CMB correlations with a quadrupolar distribution on the sky, and is thus orthogonal to the current dipolar model. Surprisingly, when analyzing the five-year WMAP data, Groeneboom & Eriksen (2009) found supportive evidence for this model at the 3.8σ significance level, when considering angular scales up to ℓ ⩽ 400. Thus, assuming that the WMAP observations are free of unknown systematics, there appears to be increasing evidence for both dipolar and quadrupolar structure in the CMB power distribution, at all angular scales.

In this paper, we repeat the Bayesian analysis of Eriksen et al. (2007a), but double the angular resolution of the data. Nevertheless, we are still limited to relatively low angular resolutions, since the method inherently relies on brute-force evaluation of a pixel-based likelihood, and therefore scales as $\mathcal {O}(N_{{\rm pix}}^3)$ . Yet, simply by spending more computer resources we are able to increase the pixel resolution from N_side = 16 to 32 and decrease the degradation smoothing scale from 9° to 4 fdg 5 FWHM. This provides additional support for multipoles between ℓ ≈ 40 and 80. While not sufficient to provide a full and direct comparison with the results of Hansen et al. (2008), it is a significant improvement over the results presented by Eriksen et al. (2007a).

2. OVERVIEW OF MODEL AND ALGORITHMS

The Bayesian analysis framework used in this paper is very similar to that employed by Eriksen et al. (2007a). We therefore only give a brief overview of its main features here, and refer the reader interested in the full details to the original paper and references therein.

2.1. Data Model and Likelihood

The starting point for our analysis is the phenomenological CMB signal model first proposed by Gordon et al. (2005),

$\begin{equation} \mathbf {d}(\hat{n}) = [1 + f(\hat{n})] \mathbf {s}(\hat{n}) + \mathbf {n}(\hat{n}). \end{equation} \tag{ 1 }$

Here $\mathbf {d}(\hat{n})$ denotes the observed data in direction $\hat{n}$ , $\mathbf {s}(\hat{n})$ is an intrinsically isotropic and Gaussian random field with power spectrum C_ℓ, $f(\hat{n})$ is an auxiliary modulating field, and $\mathbf {n}(\hat{n})$ denotes instrumental noise.

Obviously, if f = 0, one recovers the standard isotropic model. However, we are interested in a possible hemispherical asymmetry, and we therefore parameterize the modulation field in terms of a dipole with a free amplitude A and a preferred direction $\hat{p}$ ,

$\begin{equation} f(\hat{n}) = A\,(\hat{n}\cdot \hat{p}). \end{equation} \tag{ 2 }$

The modulated signal component is thus an anisotropic, but still Gaussian, random field, with covariance matrix

$\begin{equation} \mathbf {S}_{{\rm mod}}(\hat{n}, \hat{m}) = [1+A\,(\hat{n}\cdot \hat{p})] \mathbf {S}_{{\rm iso}}(\hat{n}, \hat{m})[1+A\,(\hat{m}\cdot \hat{p})], \end{equation} \tag{ 3 }$

where

$\begin{equation} \mathbf {S}_{{\rm iso}}(\hat{n}, \hat{m}) = \frac{1}{4\pi }\sum _{\ell } (2\ell +1) C_{\ell } P_{\ell }(\hat{n}\cdot \hat{m}). \end{equation} \tag{ 4 }$

We now introduce one new feature compared to the analysis of Eriksen et al. (2007a), for two reasons. First, we are interested in studying the behavior of the modulation field as a function of ℓ-range, and therefore want a mechanism to restrict the impact of the modulation parameters in harmonic space. Second, we also want to minimize the impact of the arbitrary regularization noise (see Section 3) on the modulation parameters at high ℓ's. Therefore, we split the signal covariance matrix into two parts, one modulated low-ℓ part and other isotropic high-ℓ part,

$\begin{equation} \mathbf {S}_{{\rm total}}= \mathbf {S}_{{\rm mod}}+\mathbf {S}_{{\rm iso}}, \end{equation} \tag{ 5 }$

where only multipoles between 2 ⩽ ℓ < ℓ_mod are included in S_mod, and only multipoles at ℓ ⩾ ℓ_mod are included in S_iso. (Note that we are not proposing a physical mechanism for generating the modulation field in this paper, but only attempt to characterize its properties. This split may or may not be physically well motivated, but it does serve a useful purpose in the present paper as it allows us to study the scale dependence of the modulation field in a controlled manner.)

Including instrumental noise and possible foreground contamination, the full data covariance matrix reads

$\begin{equation} \mathbf {C} = \mathbf {S}_{{\rm mod}}(A,\hat{p}) + \mathbf {S}_{{\rm iso}} + \mathbf {N} + \mathbf {F}. \end{equation} \tag{ 6 }$

The noise and foreground covariance matrices depend on the data processing, and will be described in greater detail in Section 3.

We also have to parameterize the power spectrum for the underlying isotropic component, C_ℓ. Following Eriksen et al. (2007a), we choose a simple two-parameter model with a free amplitude q and tilt n for this purpose,

$\begin{equation} C_{\ell } = q \left(\frac{\ell }{\ell _0}\right)^{n} C_{\ell }^{{\rm fid}}. \end{equation} \tag{ 7 }$

Here ℓ₀ is a pivot multipole and C^fid_ℓ is a fiducial model, in the following chosen to be the best-fit ΛCDM power-law spectrum of Komatsu et al. (2009).

Since both the signal and noise are assumed to be Gaussian, the log-likelihood now reads

$\begin{equation} -2\log \mathcal {L}(A, \hat{p}, q, n) = \mathbf {d}^T \mathbf {C}^{-1} \mathbf {d} + \log |\mathbf {C}|, \end{equation} \tag{ 8 }$

up to an irrelevant constant, with $\mathbf {C}=\mathbf {C}(A, \hat{p}, q, n)$ .

2.2. The Posterior Distribution and Bayesian Evidence

The posterior distribution for our model is given by Bayes' theorem,

$\begin{equation} P(q, n, A, \hat{p} | \mathbf {d}, H) = \frac{\mathcal {L}(q, n, A, \hat{p}) P(q, n, A, \hat{p}|H)}{P(\mathbf {d}|H)}. \end{equation} \tag{ 9 }$

Here $P(q, n, A, \hat{p}|H)$ is a prior, and P(d|H) is a normalization factor often called the "Bayesian evidence." Note that we now have included an explicit reference to the hypothesis (or model), H, in all factors, as we will in the following compare two different hypotheses, namely "H1: The universe is isotropic (A = 0)" versus "H2: The universe is anisotropic (A ≠ 0)."

We adopt uniform priors for all priors in the following. Specifically, we adopt P(q) = Uniform[0.5, 1.5] and P(n) = Uniform[ − 0.5, 0.5] for the power spectrum, and a uniform prior over the sphere for the preferred axis, $\hat{p}$ . The modulation amplitude prior is chosen uniformly over [0, A_max], where A_max = 0.20 is sufficiently large to fully encompass the nonzero parts of the likelihood. If more liberal priors are desired, the interested reader can easily calculate the corresponding evidence by subtracting the logarithm of the volume expansion factor from the results quoted in this paper.

With these definitions and priors, the posterior distribution, $P(q, n, A, \hat{\mathbf {p}} | \mathbf {d}, H)$ , is mapped out with a standard MCMC sampler. The Bayesian evidence, E = P(d|H), is computed with the "nested sampling" algorithm (Skilling 2004; Mukherjee et al. 2006). For further details on both procedures, we refer the interested reader to Eriksen et al. (2007a).

For easy reference, we recall Jeffreys' interpretational scale for the Bayesian evidence (Jeffreys 1961): a value of Δln E < 1 indicates a result "not worth more than a bare mentioning;" a value of 1 < Δln E < 2.5 is considered as "significant" evidence; a value of 2.5 < Δln E < 5 is considered "strong to very strong;" and Δln E > 5 ranks as "decisive."

3. DATA

In this paper, we analyze several downgraded versions of the five-year WMAP temperature sky maps, namely the template-corrected Q-, V-, and W-band maps, as well as the "foreground cleaned" Internal Linear Combination (ILC) map (Gold et al. 2009). Each map is downgraded to low resolution as follows (Eriksen et al. 2007b): first, each map is downgraded to a HEALPix⁸ resolution of N_side = 32, by smoothing to an effective resolution of 4 fdg 5 FWHM and properly taking into account the respective pixel windows. We then add uniform Gaussian noise of σ_n = 1 μK rms to each pixel, in order to regularize the pixel–pixel covariance matrix at small angular scales. The resulting maps have a signal-to-noise ratio of unity at ℓ = 80, and are strongly noise dominated at ℓ_max = 95.

Two different sky cuts are used in the analyses, both of which are based on the WMAP KQ85 mask (Gold et al. 2009). In the first case, we directly downgrade the KQ85 cut to the appropriate N_side, by excluding any HEALPix pixel for which more than half of the corresponding sub-pixels are missing. This mask is simply denoted by KQ85. In the second case, we smooth the mask image (consisting of 0's and 1's) with a beam of 4 fdg 5 FWHM, and reject all pixels with a value less than 0.99. We call this expanded mask KQ85e. The two masks remove 16.3% and 26.9% of the pixels, respectively.

The instrumental signal-to-noise ratio of the WMAP data is very high at large angular scales, at about 150 for the V band at ℓ = 100. The only important noise contribution in the downgraded sky maps is therefore the uniform regularization noise, which is not subject to the additional beam smoothing. We therefore approximate the noise covariance matrix by N_ij = σ²_nδ_ij. Note that this approximation was explicitly validated by Eriksen et al. (2007a) for the three-year WMAP data, which have higher instrumental noise than the five-year data.

We also marginalize over a fixed set of "foreground templates," t_i, by adding an additional term to the data covariance matrix of the form F_i = α_it_it^T_i, with α_i ≳ 10³, for each template. In addition to one monopole and three dipole templates,⁹ we use the V–ILC difference map as a template for both the V band and ILC maps, the Q–ILC difference for the Q band, and the W–ILC difference for the W band. However, these foreground templates do not affect the results noticeably in either case, due to the sky cuts used.

4. RESULTS

The main results from the analysis outlined above are summarized in Table 1. We consider nine different data combinations (i.e., frequency bands, masks, and multipole range), and show (1) the best-fit modulation axis and amplitude, both with 68% confidence regions, (2) the statistical significance of the corresponding amplitude (i.e., A/σ_A), and (3) the raw improvement in χ² and Bayesian log-evidence for the modulated model over the isotropic model. The last items are shown for the ILC with the KQ85 sky cut only. For reference, we also quote the ILC result for the Kp2 mask (Bennett et al. 2003b) reported by Eriksen et al. (2007a) when analyzing the N_side = 16° and 9° FWHM data.

Table 1. Summary Statistics for Modulated CMB Model Posteriors

Data	Mask	ℓ_mod	(l_bf, b_bf)	A_bf	Significance (σ)	$\Delta \log \mathcal {L}$	Δlog E
ILC	KQ85	64	(224°, − 22°) ± 24°	0.072 ± 0.022	3.3	7.3	2.6
V band	KQ85	64	(232°, − 22°) ± 23°	0.080 ± 0.021	3.8	...	...
V band	KQ85	40	(224°, − 22°) ± 24°	0.119 ± 0.034	3.5	...	...
V band	KQ85	80	(235°, − 17°) ± 22°	0.070 ± 0.019	3.7	...	...
W band	KQ85	64	(232°, − 22°) ± 24°	0.074 ± 0.021	3.5	...	...
ILC	KQ85e	64	(215°, − 19°) ± 28°	0.066 ± 0.025	2.6	...	...
Q band	KQ85e	64	(245°, − 21°) ± 23°	0.088 ± 0.022	3.9	...	...
V band	KQ85e	64	(228°, − 18°) ± 28°	0.067 ± 0.025	2.7	...	...
W band	KQ85e	64	(226°, − 19°) ± 31°	0.061 ± 0.025	2.5	...	...
ILC^a	Kp2	∼40	(225°, − 27°)	0.11 ± 0.04	2.8	6.1	1.8

Notes. Listed quantities are data set (first column); mask (second column); maximum multipole used for modulation covariance matrix, ℓ_mod (third column); marginal best-fit dipole axis (fourth column) and amplitude (fifth column) with 68% confidence regions indicated; statistical significance of nonzero detection of A (sixth column); the change in maximum likelihood between modulated and isotropic models, $\Delta \log \mathcal {L} = \log \mathcal {L}_{{\rm mod}} -\log \mathcal {L}_{{\rm iso}}$ (seventh column); and the Bayesian evidence difference, Δlog E = log E_mod − log E_iso (eighth column). The latter two were only computed for one data set, due to a high computational cost. However, other values can be estimated by comparing the significances indicated in the sixth column. ^aResults computed from N_side = 16 and 9° FWHM data, as presented by Eriksen et al. (2007a).

Download table as: ASCII Typeset image

The reason for providing the full evidence for only one data set is solely computational. The total CPU cost for the full set of computations presented here was ∼50, 000 CPU hr, and the evidence calculation constitutes a significant fraction of this. On the other hand, the evidence is closely related to the significance level A/σ_A, and one can therefore estimate the evidence level for other cases in Table 1 given the two explicit evidence values and significances. We have therefore chosen to spend our available CPU time on more MCMC posterior analyses, rather than on more evidence computations.

We first consider the results for the ILC map with the KQ85 mask and ℓ_mod = 64. In this case, the best-fit amplitude is A = 0.072 ± 0.022, nonzero at the 3.3σ confidence level. The best-fit axis points toward Galactic coordinates (l, b) = (224°, − 22°), with a 68% uncertainty of 24°. These results are consistent with the results presented by Eriksen et al. (2007a), who found an amplitude of A = 0.11 ± 0.04 and a best-fit axis of (l, b) = (225°, − 27°) for ℓ ≲ 40.

Second, we see that these results are only weakly dependent on frequency, as both the V band and W band for the same mask and ℓ-range have amplitudes within 0.5σ of the ILC map, with A = 0.080 and A = 0.074, and nonzero at 3.8σ and 3.5σ, respectively. (We have not included the Q-band analysis for the KQ85 mask, as there were clearly visible foreground residuals outside the mask for this case.) The corresponding marginal posteriors are shown in Figure 1, clearly demonstrating the consistency between data sets. Figure 2 compares the best-fit axes of the three data sets, and also indicates the axes reported by Eriksen et al. (2004b) and Eriksen et al. (2007a).

**Figure 1.** Posterior distributions for the dipole modulation amplitude, marginalized over direction and CMB power spectrum, computed for the KQ85 sky cut and ℓ_mod = 64.
Download figure:
Standard image High-resolution image

**Figure 2.** Posterior distribution for the dipole modulation axis, shown for the V-band map and KQ85 sky cut, marginalized over power spectrum and amplitude parameters. Gray sky pixels indicate pixels outside the 2σ confidence region. The dots indicate the axis (1) reported by Eriksen et al. (2004b) in white; (2) for both the ILC and V-band maps (these have the same best-fit axis) with the KQ85 sky cut in black; (3) for the W bands in blue, and the axis reported by Eriksen et al. (2007a) in green. Note that the background distribution has been smoothed for plotting purposes to reduce visual Monte Carlo noise.
Download figure:
Standard image High-resolution image

Next, we also see that the results are not strongly dependent on the choice of mask, as the amplitudes for the extended KQ85e mask are consistent with the KQ85 results, even though it removes an additional 10% of the sky. However, we do see, as expected, that the error bars increase somewhat by removing the additional part of the sky, and this reduces the absolute significances somewhat.

Finally, the best-fit modulation amplitudes for the V-band data and KQ85 mask are A = 0.12 for ℓ_mod = 40, A = 0.080 for ℓ_mod = 64, and A = 0.070 for ℓ_mod = 80 at 3.5σ, 3.8σ, and 3.7σ, respectively. This is an interesting observation for theoreticians who are interested in constructing a fundamental model for the effect: taken at face value, these amplitudes could indicate a non-scale-invariant behavior of A, as also noted by Hansen et al. (2008). On the other hand, the statistical significance of this statement is so far quite low, as a single common value A ∼ 0.07 is also consistent with all measurements. Better measurements at higher ℓ's are required to unambiguously settle this question.

5. CONCLUSIONS

Shortly, following the release of the first-year WMAP data in 2003, Eriksen et al. (2004a) presented the early evidence for a dipolar distribution of power in the CMB temperature anisotropy sky, considering only the large angular scales of the WMAP data. Next, Groeneboom & Eriksen (2009) presented the evidence for a quadrupolar distribution of CMB power, and found that this feature extended over all ℓ's under consideration. Finally, Hansen et al. (2008) found that the dipolar CMB power distribution is also present at high ℓ's. The evidence for violation of statistical isotropy in the CMB field is currently increasing rapidly, and the significance of these detections are approaching 4σ.

In this paper, we revisit the high-ℓ claims of Hansen et al. (2008), by applying an optimal Bayesian framework based on a parametric modulated CMB model to the WMAP data at higher multipoles than previously considered with this method, albeit lower than those considered by Hansen et al. (2008). In doing so, we find results very consistent with those presented by Hansen et al. (2008): the evidence for a dipolar distribution of power in the WMAP data increases with ℓ. For example, when considering the V-band data and KQ85 sky cut, the statistical significance of the modulated model increases from 3.2σ at ℓ_mod = 40 to 3.8σ at ℓ_mod = 64, and 3.7σ at ℓ_mod = 80.

The Bayesian evidence now also ranking within the "strong to very strong" category on Jeffreys' scale. However, it should be noted that the Bayesian evidence is by nature strongly prior dependent, and if we had chosen a prior twice as large as the one actually used, the corresponding log-evidence for the ILC map would have fallen from Δln E = 2.6 to 1.7, ranking only as "substantial" evidence. For this reason, it is in many respects easier to attach a firm statistical interpretation to the posterior distribution than the Bayesian evidence.

It is interesting to note that the absolute amplitude A may show hints of decreasing with ℓ. It is premature to say whether this is due simply to a statistical fluctuation, or whether it might point toward a non-scale-invariant underlying physical effect, in which case the amplitude A should be replaced with a function A(ℓ). Either case is currently allowed by the data.

To answer this question, and further constrain the overall model, better algorithms are required. The current approach relies on brute-force inversion of an N_pix × N_pix covariance matrix, and therefore scales as $\mathcal {O}(N_{{\rm pix}}^3)$ or $\mathcal {O}(N_{{\rm side}}^6)$ . However, already the present analysis, performed at N_side = 32, required ∼50,000 CPU hr, and increasing N_side by an additional factor of 2 would require ∼3 million CPU hr. More efficient algorithms are clearly needed.

To summarize, there is currently substantial evidence for both dipolar (Hansen et al. 2008 and this work) and quadrupolar power distribution (Groeneboom & Eriksen 2009) in the WMAP data, and this is seen at all probed scales. The magnitude of the dipolar mode is considerably stronger than the quadrupolar mode, as a ∼3.5σ significance level is reached already at ℓ ∼ 64 for the dipole, while the same significance was obtained at ℓ ∼ 400 for the quadrupole.

These observations may prove useful for theorists attempting to construct alternative models for these features, either phenomenological or fundamental. Considerable efforts have gone toward this goal already (e.g., Ackerman et al. 2007; Böhmer & Mota 2008; Carroll et al. 2008, 2009; Chang et al. 2009; Erickcek et al. 2008a, 2008b; Gordon et al. 2005; Emir Gümrükçüoglu et al. 2007; Himmetoglu et al. 2009a, 2009b; Kahniashvili et al. 2008; Kanno et al. 2008; Koivisto & Mota 2008a, 2008b; Pereira et al. 2007; Pitrou et al. 2008; Pullen & Kamionkowski 2007; Watanabe et al. 2009; Yokoyama & Soda 2008), but so far no fully convincing model has been established. Clearly, more work is needed on both the theoretical and observational side of this issue. Fortunately, it is now only a few years until Planck will open up a whole new window on these issues by producing high-sensitivity maps of the CMB polarization, as well as measuring the temperature fluctuations to arcminute scales. We will then be able to measure the properties of the dipole, quadrupole, and, possibly, higher-order modes of the modulation field to unprecedented accuracy.

H.K.E. acknowledges financial support from the Research Council of Norway. The computations presented in this paper were carried out on Titan, a cluster owned and maintained by the University of Oslo and NOTUR. Some of the results in this paper have been derived using the HEALPix (Górski et al. 2005) software and analysis package. We acknowledge use of the Legacy Archive for Microwave Background Data Analysis (LAMBDA). Support for LAMBDA is provided by the NASA Office of Space Science.

INCREASING EVIDENCE FOR HEMISPHERICAL POWER ASYMMETRY IN THE FIVE-YEAR WMAP DATA

Article metrics

Permissions

Author e-mails

Author affiliations

Dates

ABSTRACT

1. INTRODUCTION

2. OVERVIEW OF MODEL AND ALGORITHMS

2.1. Data Model and Likelihood

2.2. The Posterior Distribution and Bayesian Evidence

3. DATA

4. RESULTS

5. CONCLUSIONS

Footnotes

INCREASING EVIDENCE FOR HEMISPHERICAL POWER ASYMMETRY IN THE FIVE-YEAR WMAP DATA

Article metrics

Permissions

Share this article

Author e-mails

Author affiliations

Dates

ABSTRACT

1. INTRODUCTION

2. OVERVIEW OF MODEL AND ALGORITHMS

2.1. Data Model and Likelihood

2.2. The Posterior Distribution and Bayesian Evidence

3. DATA

4. RESULTS

5. CONCLUSIONS

Footnotes