Formal Solutions for Polarized Radiative Transfer. I. The DELO Family

, , , and

Published 2017 May 12 © 2017. The American Astronomical Society. All rights reserved.
, , Citation Gioele Janett et al 2017 ApJ 840 107 DOI 10.3847/1538-4357/aa671d

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/840/2/107

Abstract

The discussion regarding the numerical integration of the polarized radiative transfer equation is still open and the comparison between the different numerical schemes proposed by different authors in the past is not fully clear. Aiming at facilitating the comprehension of the advantages and drawbacks of the different formal solvers, this work presents a reference paradigm for their characterization based on the concepts of order of accuracy, stability, and computational cost. Special attention is paid to understand the numerical methods belonging to the Diagonal Element Lambda Operator family, in an attempt to highlight their specificities.

Export citation and abstract BibTeX RIS

1. Introduction

The transfer of partially polarized light is described by a system of coupled first-order, inhomogeneous ordinary differential equations. Explicitly,

Equation (1)

where s is the spatial coordinate measured along the ray under consideration, ${\boldsymbol{I}}={({I}_{1},{I}_{2},{I}_{3},{I}_{4})}^{T}\equiv {(I,Q,U,V)}^{T}$ is the Stokes vector,

is the propagation matrix, and ${\boldsymbol{\epsilon }}={({\epsilon }_{I},{\epsilon }_{Q},{\epsilon }_{U},{\epsilon }_{V})}^{T}$ is the emission vector, which represents the source term. The different coefficients appearing in the propagation matrix and in the emission vector depend on the considered frequency, on the propagation direction, and on different atmospheric parameters (Landi Degl'Innocenti & Landi Degl'Innocenti 1985; Landi Degl'Innocenti 1987). For notational simplicity, the frequency dependence of the quantities is not explicitly indicated.

Analytical solutions of Equation (1) are available for a few simple model atmospheres only (López Ariste & Semel 1999a), which explains the necessity of numerical schemes able to solve it. The definition of formal solution was first introduced for the scalar problem: it is the evaluation of the radiation intensity, given knowledge of the boundary conditions and the spatial, angular, and frequency dependence of the opacity and the emissivity at a discrete set of points (Mihalas 1978; Auer 2003). The generalization to the polarized case consists in substituting radiation intensity, opacity, and emissivity by Stokes vector, propagation matrix, and emission vector, respectively.

The formal solution of the radiative transfer equation is a key step of iterative schemes for solving the nonlinear full radiative transfer problem, where the atomic system and the radiation field interact in non-local thermodynamic equilibrium conditions. From the computational point of view, the formal solution is, in most cases, the slowest part of the iterative scheme (e.g., Štěpán & Trujillo Bueno 2013). Moreover, large magnetohydrodynamic simulations of stellar atmospheres call for massive synthesis of Stokes profiles. Therefore, the requirement for the numerical method is to be as accurate and as fast as possible. The effort of the community has produced an extensive literature on the different formal solvers, the major contributions being summarized in Table 1. The quest of the "best formal solver" available is still open and the comparison between the different numerical schemes provided by the community is somehow confusing.

Table 1.  List of Formal Solvers Proposed by Different Authors

Year Method Proposed by
1974 Runge–Kutta–Merson Wittmann (1974)
1976 Runge–Kutta 4 Landi Degl'Innocenti (1976)
1985 Piecemeal Evolution Operator Landi Degl'Innocenti & Landi Degl'Innocenti (1985)
1989 Zeeman Feautrier Rees et al. (1989)
1989 DELO-linear Rees et al. (1989)
1998 (cubic) Hermitian Bellot Rubio et al. (1998)
1999 DIAGONAL López Ariste & Semel (1999b)
2003 DELOPAR Trujillo Bueno (2003)
2013 (quadratic and cubic) DELO-Bézier De la Cruz Rodríguez & Piskunov (2013)
2013 BESSER Štěpán & Trujillo Bueno (2013)
2016 Piecewise Continuous Steiner et al. (2016)

Download table as:  ASCIITypeset image

This paper aims to give a structured overview over formal solvers, in particular over those belonging to the Diagonal Element Lambda Operator (DELO) family, and to clarify some incoherences found in the literature. Section 2 presents a reference paradigm for the characterization of formal solvers. The concepts of order of accuracy, stability, and computational cost are briefly presented and used to characterize a reference numerical scheme, namely the trapezoidal method. Section 3 provides an introduction to exponential integrators, a class of numerical methods for the solution of differential equations. A simple description of this class is given, in an attempt to highlight its specific features and its paternity of the DELO methods. In Section 4, the well known DELO family is presented and characterized. Some new methods belonging to this family are introduced and compared to the already existing ones. Finally, Section 5 provides remarks and conclusions, with a view on future work.

2. Characterization of Formal Solvers

In order to be able to answer the question "which is the best formal solver?," one first has to fix some criteria for judging the different numerical schemes. Briefly discussing a method, claiming some strong points, and showing them with a few specific examples, is not the best way to proceed. The literature about the numerical approach to ordinary differential equations is particularly broad and great efforts have been directed toward the characterization of the different methods. This section aims to give an overview on this characterization, in order to facilitate the comprehension of the advantages, weaknesses, and possible incoherences of the already existing formal solvers and of those yet to come. To ease the appreciation, a very common and simple method is presented and analyzed, namely the trapezoidal method.

2.1. Exempli Gratia: The Trapezoidal Method

A spatial grid $\{{s}_{k}\}$ $(k=0,\ldots ,N)$ is introduced, discretizing the ray path. The spatial coordinate s and the index k increase along the propagation direction. For a given grid point sk, the points ${s}_{k-1}$ and ${s}_{k+1}$ represent the upwind and downwind points, respectively, guaranteeing ${s}_{k-1}\leqslant {s}_{k}\leqslant {s}_{k+1}$. Applying the fundamental theorem of calculus to Equation (1) in the interval $[{s}_{k},{s}_{k+1}]$, one obtains

Equation (2)

The integral can be approximated by different numerical quadratures and one possibility is to use the trapezoidal rule, i.e.,

Approximating the integral in Equation (2) in terms of the trapezoidal rule, one recovers

Equation (3)

where ${\rm{\Delta }}{s}_{k}={s}_{k+1}-{s}_{k}$ is the cell width. The numerical approximation of a certain quantity at node sk is indicated by substitution of the explicit dependence on s with the subscript k, for instance,

Inserting numerical quantities for ${\boldsymbol{I}}$, ${\bf{K}}$, and ${\boldsymbol{\epsilon }}$ in Equation (3) and applying some algebra, one recovers the implicit linear system

Equation (4)

where the matrices ${{\boldsymbol{\Phi }}}_{k}$ and ${{\boldsymbol{\Phi }}}_{k+1}$ and the vectors ${{\boldsymbol{\Psi }}}_{k}$ and ${{\boldsymbol{\Psi }}}_{k+1}$ are given by

Given the upwind Stokes vector ${{\boldsymbol{I}}}_{k}$ at node sk, one solves the implicit linear system given by Equation (4), finding the emergent Stokes vector ${{\boldsymbol{I}}}_{k+1}$ at node ${s}_{k+1}$.

The trapezoidal method is therefore classified as an implicit method and belongs to both the famous classes of the Runge–Kutta methods and the linear multistep methods.

2.2. Order of Accuracy

In order to recover a good approximation, one tries to maintain as small an error as possible. When discussing numerical schemes, one usually refers to two different kinds of errors: the local truncation error and the global error.

The local truncation error is the error introduced by the numerical scheme in a single step, assuming the exact solution at the precedent step. Considering a scalar initial value problem (IVP) of the general form

supplied by the discrete grid $\{{t}_{k}\}$ $(k=0,\ldots ,N)$, the local truncation error is defined by

where $y({t}_{k-1})$ and $y({t}_{k})$ are the exact solutions at nodes ${t}_{k-1}$ and tk, respectively, and yk represents the numerical approximation at node tk after performing the single step from ${t}_{k-1}$ to tk. The operator $\parallel \cdot \parallel $ represents any suitable vector norm. A numerical method for an ordinary differential equation has order of accuracy p if the local truncation error, which usually depends on the step size denoted by ${\rm{\Delta }}t$, satisfies

Equation (5)

Hence, the larger the order of accuracy, the faster the error is reduced as ${\rm{\Delta }}t$ decreases.

By contrast, the global error is defined as

and represents the accumulation of the local truncation error over all the N steps. If the same amount of error is produced at each step, i.e., without any amplification of errors over subsequent steps, then a local truncation error of $O({\rm{\Delta }}{t}^{p+1})$ implies a global error of $O({\rm{\Delta }}{t}^{p})$. Explicitly

Equation (6)

using Equation (5) and $N\propto 1/{\rm{\Delta }}t$. The constant C typically depends on the exact solution and on other parameters of the numerical scheme, but is strictly independent of ${\rm{\Delta }}t$. The concept of amplification of errors is related to the idea of stability, which will be shortly discussed. The intuitive connection between local truncation error and global error given by Equation (6) can be generalized to any suitable discrete grid (e.g., Deuflhard & Bornemann 2002), including logarithmically spaced grids. In the case of non-uniform discrete grids, the step size ${\rm{\Delta }}t$ must be replaced by the maximal step size given by

with ${\rm{\Delta }}{t}_{k}={t}_{k+1}-{t}_{k}$. The expression "suitable discrete grid" indicates that the maximal step size is inversely proportional to the total number of grid points, i.e., ${\rm{\Delta }}\hat{t}\propto {N}^{-1}$.

The power law presented in Equation (6) implies a linear relationship between the logarithms of the global error EN and the step size ${\rm{\Delta }}t$, i.e.,

where $\tilde{C}=\mathrm{log}(C)$. This relation can be appreciated in a log–log plot, where the resulting straight line, often called the signature or error curve, should show a slope equal to p. The trapezoidal method is well known to be second-order accurate (e.g., Frank 2012) and an explicit example, showing its signature, will be presented later in this paper.

Note, however, that definitions (5) and (6) assume sufficiently small ${\rm{\Delta }}t$ in order to avoid pre-asymptotic behavior, i.e., the fact that for large step sizes the data points are more scattered and do not necessarily follow the power law. For instance, a global error of the form ${E}_{N}\approx {C}_{1}\cdot {\rm{\Delta }}t+{C}_{2}\cdot {\rm{\Delta }}{t}^{2}+{C}_{3}\cdot {\rm{\Delta }}{t}^{3}$ is dominated by the first term for small enough ${\rm{\Delta }}t$, but for large step sizes, higher-order terms may tangibly contribute, depending on their constants C2 and C3.

2.3. Stability

For a numerical scheme, stability means that any numerical error introduced at some stage does not blow up in the subsequent steps of the method. The concept of stability is often related to the concept of stiffness. A differential equation is said to be stiff when some numerical methods have to take an extremely small step size to achieve convergence. Therefore, step-size control is also based on stability requirements, because instabilities lead to a deterioration of accuracy. More details about the concept of stability in the numerical treatment of ordinary differential equations can be found, for instance, in Hackbusch (2014).

There are different ways to determine the stability of a numerical scheme and the stability analysis often depends on the considered class of methods. In the following, the common class of the Runge–Kutta methods is considered. The stability of a numerical method is often deduced through the simple autonomous scalar IVP given by

Equation (7)

with $\lambda \in {\mathbb{C}}$. The solution $y(t)={y}_{0}{e}^{\lambda t}$ converges to zero as $t\to \infty $ for $\mathrm{Re}(\lambda )\lt 0$. A Runge–Kutta method applied to the IVP (7) can be recast into the form

where ϕ is called the stability function. The numerical method is said to be stable if, applied to the IVP (7), it converges to zero for $k\to \infty $ and this condition is equivalent to

Equation (8)

Intuitively, this guarantees that any perturbation in the solution is attenuated with the recursive numerical integration. The stability of a method is therefore related to both the step size ${\rm{\Delta }}t$ and the eigenvalue λ, more precisely to the term $\lambda {\rm{\Delta }}t$. The stability region of a Runge–Kutta method is defined as the set of complex values $\lambda {\rm{\Delta }}t$ for which Equation (8) is satisfied.

The stability analysis for the scalar problem given by Equation (7) can be easily generalized to the linear system of ordinary differential equations given by

Equation (9)

where the d × d matrix ${\bf{A}}$ has a basis of eigenvectors corresponding to the eigenvalues ${\lambda }^{(1)},\ldots ,{\lambda }^{(d)}$. Equation (9) formally corresponds to the homogeneous version of Equation (1). The emission term is deliberately omitted in the stability analysis because of its locality. In fact, the emission term does not affect the propagation of the information, preventing any contribution to the amplification of errors. As shown in Frank (2012), a Runge–Kutta scheme applied to Equation (9) is stable if and only if it guarantees stability once applied to Equation (7), with λ representing any eigenvalue of ${\bf{A}}$. It is important to mention the fact that the eigenvalues of the propagation operator $-{\bf{K}}$ in Equation (1) have always negative real parts. Therefore, A-stability (see below) is a sufficient condition to avoid any instability problem in the formal solution.

Applying the trapezoidal method to the IVP (7), one recovers the following stability function

The stability region for the trapezoidal method is then given by the condition (8) and it is usually displayed as presented in Figure 1(a). If the stability region of a numerical method contains the whole left-hand side of the complex plane, as in the case of Figure 1(a), then the numerical scheme is said to be A-stable. A broad and exhaustive literature is dedicated to the determination of the specific stability regions for the different numerical methods (Dahlquist 1963; Collatz 1966), but a deep digression would stray from the main aim of this paper.

Figure 1.

Figure 1. The stability region for the trapezoidal method for (a) $\lambda ={\lambda }_{k}={\lambda }_{k+1}$, (b) $\lambda ={\lambda }_{k}=\tfrac{1}{5}{\lambda }_{k+1}$, and (c) $\lambda =\tfrac{1}{5}{\lambda }_{k}={\lambda }_{k+1}$. Colors indicate the absolute values of the stability function $| {\phi }_{{\rm{T}}}| $ given by Equation (10). The region where the stability condition is not satisfied, i.e., $| {\phi }_{{\rm{T}}}| \gt 1$, is indicated in white.

Standard image High-resolution image

A strong limitation of this simplified stability analysis is the assumption of a constant eigenvalue λ in Equation (7). A less restrictive stability analysis shows that variations of λ along the integration path could affect the stability region of the numerical method. For example, the stability function of the trapezoidal method in the interval $[{t}_{k},{t}_{k+1}]$ would read

Equation (10)

where ${\lambda }_{k}$ and ${\lambda }_{k+1}$ are the eigenvalues at the positions tk and ${t}_{k+1}$, respectively. Thus, the stability region depends on both eigenvalues ${\lambda }_{k}$ and ${\lambda }_{k+1}$, as illustrated in Figure 1. However, the variation of the eigenvalue λ along the integration path strongly depends on the spatial scale. In particular, a conversion from geometrical height to optical depth (see Appendix A) mitigates fluctuations of the eigenvalues of the propagation operator $-{\bf{K}}$, supporting the assumption of a constant eigenvalue λ in the stability analysis.

2.4. Computational Cost

When designing or choosing a numerical scheme, an important point is the amount of computational time and data storage necessary to execute it (see for instance Goldreich 2008). A suitable parallelization strategy of the radiative transfer problem includes the use of multiple central processor unit (CPU) cores and parallelization via domain decomposition and/or in the frequency domain (Štěpán & Trujillo Bueno 2013). The computational cost may act as an important factor in the choice of the appropriate formal solver, especially for large scale applications in which the repetitive integration of the radiative transfer equation plays a leading role.

The computational cost analysis of numerical schemes must be based on the premise that it depends on the specific coding, programming language, compiler, and computer architecture. Therefore, the interpreted Octave language used in this paper is not suitable to reliably determine time costs. Nevertheless, one can make some objective considerations. First of all, common sense suggests keeping the algorithm as sleek as possible, avoiding any unnecessary superstructure. Second, basic floating-point operations are carried out directly on the CPU, whereas elementary functions are usually emulated on a higher level. Correspondingly, the evaluation of, e.g., an exponential is 10–40 times more expensive than a floating-point multiplication (Schörghofer 2015). A third remark is made on the difference between explicit and implicit schemes. An explicit one-step method calculates the updated numerical value ${y}_{k+1}$ directly from the precedent value yk, i.e.,

while implicit one-step methods find ${y}_{k+1}$ by solving an equation of the type

which results in the additional solution of a 4 × 4 implicit linear system, when considering Equation (1).

Concerning the data storage cost, a simple consideration can be made. In the short characteristic strategy the formal solver integrates step by step the radiative transfer equation along the ray path, using only local atmospheric quantities (Auer 2003). Therefore, the small amount of information retained avoids any data storage problem.

3. Exponential Integrators

Exponential integrators form a class of numerical methods for ordinary differential equations. This class is based on the exact integration of the linear part of the IVP, aiming at reducing the stiffness of the differential equation.

In order to present this class, one considers the following general IVP

Equation (11)

which is equivalent to Equation (1) given the initial value I0. One splits ${\boldsymbol{G}}$ into linear and nonlinear contributions, i.e.,

where the matrix ${\bf{L}}$ does not depend on the variable t and the nonlinear term is given by ${\boldsymbol{N}}={\boldsymbol{G}}-{\bf{L}}{\boldsymbol{y}}$. Equation (11) is then recast in the form

Equation (12)

and the exact integration of the linear part in the interval $[{t}_{0},t]$ yields the variation of constants formula

Equation (13)

The integral in Equation (13) has to be numerically approximated and a large variety of different options is available, e.g., Runge–Kutta discretizations as explained by Cox & Matthews (2002). Moreover, for a non-diagonal matrix ${\bf{L}}$, the evaluation of the matrix exponential often requires an approximation and one has to combine the integrator with well-chosen algorithms from numerical linear algebra.

In the following section, a specific strategy is applied, where only the nonlinear term ${\boldsymbol{N}}$ is approximated and the exponential operator is treated exactly.

4. The DELO Family

In this section, a particular family of methods belonging to the class of exponential integrators is presented. The first method of this family applied to the formal solution for polarized light was proposed by Rees et al. (1989) under the name of DELO. Thereafter, a second version by Trujillo Bueno (2003) took the appellative DELOPAR. Additional improvements, in terms of Bézier interpolations, were recently provided by De la Cruz Rodríguez & Piskunov (2013) and by Štěpán & Trujillo Bueno (2013).

As exhaustively explained by Guderley & Hsu (1972), this technique takes into account analytically the diagonal elements of the propagation matrix ${\bf{K}}$, aiming to remove stiffness from the problem. Therefore, the well-known radiative transfer equation for polarized light given by Equation (1) is brought in the form given by Equation (12), with the additional constraint of a diagonal matrix ${\bf{L}}$. This reformulation is facilitated by the fact that the diagonal elements of the propagation matrix are all identical. Replacing the coordinate s by the optical depth τ defined by

Equation (14)

one recasts Equation (1) into

Equation (15)

where $1$ represents the 4 × 4 identity matrix. The quantity ${\mathscr{S}}$ is the effective source function and it is defined by

with the modified propagation matrix ${\bf{K}}={\bf{K}}/{\eta }_{I}-{\bf{1}}$ (whose diagonal elements are all equal to zero) and the modified emission vector $\tilde{{\boldsymbol{\epsilon }}}={\boldsymbol{\epsilon }}/{\eta }_{I}$.

Observe that the optical depth scale is in fact defined so that the coordinate τ, defined in Equation (14), decreases along the ray path, thus $\tau \leqslant {\tau }_{0}$. Providing the upwind Stokes ${{\boldsymbol{I}}}_{0}={\boldsymbol{I}}({\tau }_{0})$, the operator on the left of Equation (15) is inverted leading to the formula

Equation (16)

which is analogous to Equation (13).

As anticipated, different numerical quadratures of the integral in Equation (16) lead to different numerical schemes. In particular, the DELO technique approximates the effective source function ${\bf{S}}$ by a polynomial ${{\boldsymbol{P}}}_{q}$ of degree q inside the integration interval, i.e.,

Equation (17)

Observing that ${\tau }_{k+1}\leqslant {\tau }_{k}$ and evaluating Equation (16) in the interval $[{\tau }_{k},{\tau }_{k+1}]$, one obtains

Equation (18)

where ${\rm{\Delta }}{\tau }_{k}={\tau }_{k}-{\tau }_{k+1}$. The integral can then be solved by parts, yielding an implicit or explicit linear system for the Stokes vector ${{\boldsymbol{I}}}_{k+1}$.

As explained by Guderley & Hsu (1972), the local truncation error is due to the fact that the effective source function is approximated by a polynomial of degree q. According to Henrici (1962), this approximation results in the following local truncation error

Equation (19)

and, from definition (5), a DELO method involving a polynomial ${{\boldsymbol{P}}}_{q}$ should show an order of accuracy equal to $q+1$. Equation (19) is not defined for q = 0, i.e., for a constant approximation of the effective source function. In this case, the local truncation error corresponds to the one for q = 1, following a behavior similar to the implicit midpoint method (Deuflhard & Bornemann 2002). Moreover, the evaluation of the quantity ${\rm{\Delta }}{\tau }_{k}$ plays a fundamental role in the accuracy of the numerical scheme, as explained in Appendix A.

As already mentioned, the DELO strategy is thought to remove stiffness from the problem. In fact, a method based on Equation (16) tends to A-stability for a vanishing modified propagation matrix. Therefore, a sufficiently small matrix ${\bf{K}}$ should imply a rather wide stability limit. However, the simple stability analysis presented in the previous section cannot be applied to the present family of methods, because of the two different contributions from the exponential terms and the modified propagation matrix. Nevertheless, some indications can be deduced if the simpler scalar case is analyzed. Guderley & Hsu (1972) show that the DELO strategy increases stability with respect to the corresponding Adams–Bashford methods (Deuflhard & Bornemann 2002).

On the other hand, the second-order accurate trapezoidal method already guarantees A-stability, which dispenses from a stability improvement for formal solvers having an order of accuracy $p\leqslant 2$.

4.1. DELO-constant, DELO-linear, and DELO-parabolic

In a very general way, the polynomial ${{\boldsymbol{P}}}_{q}$ in Equation (17) is obtained by the Lagrangian interpolation

Equation (20)

where ${{\bf{S}}}_{i}=-{{\bf{K}}}_{i}{{\boldsymbol{I}}}_{i}+{\tilde{{\boldsymbol{\epsilon }}}}_{i}$ and the Lagrange basis polynomials i are given by

Here k indicates an arbitrary node on the discretized ray path. The integral in Equation (18) can then be solved by parts, yielding an implicit linear system of the form

Equation (21)

where the different coefficients ${{\boldsymbol{\Phi }}}_{i}$ and ${{\boldsymbol{\Psi }}}_{i}$ depend on the chosen polynomial ${{\boldsymbol{P}}}_{q}$ and on the numerical values ${{\bf{K}}}_{{i}}$ and ${\tilde{{\boldsymbol{\epsilon }}}}_{i}$. Provided the previous Stokes ${{\boldsymbol{I}}}_{i}$ for $i=k-q+1,\ldots ,k$, the linear system (21) can be solved to obtain the numerical approximation ${{\boldsymbol{I}}}_{k+1}$ at ${\tau }_{k+1}$. Therefore, once given the boundary condition I0 at ${\tau }_{0}$, the recursive application of Equation (21) provides the emergent Stokes vector ${{\boldsymbol{I}}}_{N}$ at the end of the ray path.

As first choice, the effective source function is assumed constant inside the interval $[{\tau }_{k},{\tau }_{k+1}]$, i.e.,

and can be approximated by the midpoint rule

Replacing the polynomial ${{\boldsymbol{P}}}_{q}$ in Equation (18) by the constant approximation described above, one can calculate the integral and after some algebra obtain an implicit linear system formally identical to Equation (4), i.e.,

Equation (22)

The method described by Equation (22), which might be called DELO-constant, is second-order accurate, as shown in Figure 2. The explicit values of the coefficients ${{\boldsymbol{\Phi }}}_{k}$, ${{\boldsymbol{\Phi }}}_{k+1}$, ${{\boldsymbol{\Psi }}}_{k}$, and ${{\boldsymbol{\Psi }}}_{k+1}$ are provided in Appendix B.

Figure 2.

Figure 2. The log–log representation of the global error for the Stokes vector components $I,Q,U$, and V as a function of the number of points-per-decade of the continuum optical depth for the trapezoidal, DELO-constant, DELO-linear, and DELO-parabolic methods. The considered atmospheric model is described in Appendix C and the global error is computed as shown in Appendix D.

Standard image High-resolution image

The next step is to obtain a relation between ${{\boldsymbol{I}}}_{k}$ and ${{\boldsymbol{I}}}_{k+1}$ when approximating the effective source function by a linear interpolation, i.e., Equation (20) with q = 1,

for τ in the interval $[{\tau }_{k},{\tau }_{k+1}]$. One proceeds with an analytical integration and after some algebra obtains

Equation (23)

which is an implicit linear system formally identical to Equations (4) and (22). The numerical scheme described by Equation (23) was presented by Rees et al. (1989) under the name of DELO and subsequently re-baptized by De la Cruz Rodríguez & Piskunov (2013) as DELO-linear. It is a second-order accurate method, as shown in Figure 2. The explicit values of the coefficients ${{\boldsymbol{\Phi }}}_{k}$, ${{\boldsymbol{\Phi }}}_{k+1}$, ${{\boldsymbol{\Psi }}}_{k}$, and ${{\boldsymbol{\Psi }}}_{k+1}$ are provided in Appendix B.

The natural successive step is a parabolic Lagrangian interpolation of the effective source function (Murphy 1990), i.e., Equation (20) with q = 2. Considering three spatial points $\{{\tau }_{k-1},{\tau }_{k},{\tau }_{k+1}\}$ located along the optical depth grid, the effective source function ${\bf{S}}$ is approximated by the parabolic interpolation,

for τ in the interval $[{\tau }_{k},{\tau }_{k+1}]$. The necessity of a third interpolation point cannot be satisfied by numerical values at ${\tau }_{k+2}$, because no information is available for ${{\boldsymbol{I}}}_{k+2}$. After two integrations by parts of Equation (18) and some algebra, one obtains

Equation (24)

The method described by Equation (24), which might be called DELO-parabolic, is third-order accurate, as shown in Figure 2. The coefficients ${{\boldsymbol{\Phi }}}_{k-1}$, ${{\boldsymbol{\Phi }}}_{k}$, ${{\boldsymbol{\Phi }}}_{k+1}$, ${{\boldsymbol{\Psi }}}_{k-1}$, ${{\boldsymbol{\Psi }}}_{k}$, and ${{\boldsymbol{\Psi }}}_{k+1}$ are provided in Appendix B. The higher order of accuracy is essential to detect, for instance, the second-order behavior of the emission vector often present in realistic atmospheric models, avoiding its possible systematic overestimation.

DELO-constant and DELO-linear formal solvers compute the Stokes ${{\boldsymbol{I}}}_{k+1}$ solely on the basis of information about the preceding Stokes value ${{\boldsymbol{I}}}_{k}$ and are classified as one-step methods. In this sense, one-step methods have no memory, i.e., they forget all of the prior information that has been gained. In contrast, DELO-parabolic takes into account the two most recently found Stokes vectors ${{\boldsymbol{I}}}_{k}$ and ${{\boldsymbol{I}}}_{k-1}$, entering in the class of the multistep methods.

This family of formal solvers can be further expanded by just increasing the interpolation degree q of the effective source function. For instance, a third-order Lagrangian polynomial would generate DELO-cubic. However, the complexity of the numerical methods would increase as well and the adaptation to non-uniform grids would become gradually more cumbersome.

One should emphasize that the detailed behavior of the error curves plotted in Figures 2 and 3 depends on the considered atmospheric model. Indeed, the main scope of these figures is to highlight the overall order of accuracy of the various methods. It would certainly be wrong to try to reach conclusions on the performance of different methods on the basis of a qualitative comparison of small details of the error curves, obtained considering a single model atmosphere. The atmospheric model considered in this work is described in Appendix C.

Figure 3.

Figure 3. The log–log representation of the global error for the Stokes vector components $I,Q,U$, and V, as a function of the number of points-per-decade of the continuum optical depth for the DELOPAR, BESSER, quadratic, and cubic DELO-Bézier methods. The considered atmospheric model is described in Appendix C and the global error is computed as shown in Appendix D. The asymptotic behavior of the quadratic and cubic DELO-Bézier methods can only be appreciated in the interval between 3 and 11 points-per-decade.

Standard image High-resolution image

4.2. DELO-Bézier Methods

The choice of the Lagrangian form for the polynomial ${{\boldsymbol{P}}}_{q}$ in Equation (17) is certainly not univocal and the literature provides different interpolation methods. Mihalas et al. (1978) used the Hermitian interpolation for the integration of the scalar radiative transfer equation. An interesting set of suitable interpolants was proposed by Auer (2003) for the same problem: among them the monotonic Hermite interpolants recently used by Ibgui et al. (2013) in the IRIS code. In the same year, De la Cruz Rodríguez & Piskunov (2013) applied Bézier polynomials to the DELO strategy, generating the quadratic and cubic DELO-Bézier methods.

A detailed description of these methods, and a comparison with other methods, such as DELO-linear and DELOPAR, can be found in the above-mentioned publications, and will not be repeated here. As shown in Figure 3, both quadratic and cubic DELO-Bézier methods show fourth-order accuracy. It is interesting to observe that when treating smooth functions, the Bézier curves introduced by De la Cruz Rodríguez & Piskunov (2013) are forced to be identical to the Hermite polynomials of corresponding degree by adopting very specific control points. A detailed discussion of Hermitian methods will be presented in the second paper of this series, which will focus on high-order methods.

4.3. Particular Cases: DELOPAR and BESSER

Aiming to increase the order of accuracy with respect to the DELO-linear strategy, Trujillo Bueno (2003) opted for a semi-parabolic interpolation, namely a parabolic interpolation for the modified emission vector $\tilde{{\boldsymbol{\epsilon }}}$ and a linear interpolation for the ${\bf{K}}{\boldsymbol{I}}$ term. The parabolic interpolation of $\tilde{{\boldsymbol{\epsilon }}}$ is performed by considering the three spatial points $\{{\tau }_{k},{\tau }_{k+1},{\tau }_{k+2}\}$, which differs from the set used by DELO-parabolic. In the case of a diagonal dominant propagation matrix, this strategy can increase the order of accuracy in the Stokes parameter I, provided a high-order integration of the opacity (see Appendix A). However, the local truncation error in the Stokes Q, U, and V, is still dominated by the linear approximation of the term ${\bf{K}}{\boldsymbol{I}}$, resulting in a second-order method as shown in Figure 3. Therefore, the lack of improvement with respect to DELO-linear comes directly from the design of the method and not, as erroneously conjectured, from the so-called overshooting. This should also explain the unsatisfactory performance of DELOPAR found by De la Cruz Rodríguez & Piskunov (2013), as presented in their error curves. The DELOPAR strategy remains a honest method for the non-polarized case or for vanishing dichroism and anomalous dispersion coefficients. In fact, in both cases, the modified propagation matrix ${\bf{K}}$ disappears and a parabolic approximation of the source term $\tilde{{\boldsymbol{\epsilon }}}$, supported by a proper conversion to optical depth (see Appendix A), produces an effective third-order numerical scheme. This can be appreciated in Štěpán & Trujillo Bueno (2013), where the log–log error figure clearly shows the superiority of DELOPAR over DELO-linear for the scalar case. Štěpán & Trujillo Bueno (2013) applied a similar technique to create BESSER, the formal solver used in the PORTA code. The same argumentation can be applied to it, explaining its second-order accuracy confirmed by Figure 3.

5. Conclusions

This paper pays particular attention to the characterization of the different formal solvers for polarized radiative transfer. The paradigmatic analysis for numerical schemes proposed here is based on three different criteria: order of accuracy, stability, and computational cost.

The order of accuracy of a numerical scheme indicates how fast the error decreases, when reducing the step-size. Therefore, it can be only appreciated by considering the global (or local) error dependence on the step-size and not through single Stokes profiles.

When discussing numerical methods, the term stability relates to the attenuation of numerical errors with the recursive application of a scheme and not to the erratic behavior of high-order polynomial approximation (the so-called overshooting). One must always guarantee that any possible instability is avoided, when judging the order of accuracy of a method.

The computational cost is related to the complexity of the algorithm. Therefore, one suggests maintain as bony a method as possible, avoiding unnecessary superstructures.

In this perspective, some considerations about the DELO family can be exposed. Regarding the order of accuracy, one realizes that DELO methods do not converge better than more conventional methods: DELO-constant, DELO-linear, and DELOPAR are only second-order accurate, the two-step DELO-parabolic method effectively reaches third-order accuracy, and quadratic and cubic DELO-Bézier methods usually perform as fourth-order accurate methods, as summarized in Table 2. One must point out that second-order accuracy is already guaranteed by the simple trapezoidal method. The discussion about stability is somehow more involved. The DELO approach is thought to remove stiffness from the problem, but no stability improvement is necessary for formal solvers having an order of accuracy $p\leqslant 2$, because the trapezoidal method already guarantees A-stability as shown in Figure 1(a). However, one could still promote the DELO strategy for high-order methods ($p\geqslant 3$). Concerning the computational cost, DELO methods differ from standard implicit numerical methods for two reasons: the DELO strategy requires an additional conversion to optical depth (see Appendix A), which, however, is important to mitigate the variation of the eigenvalues of the propagation operator $-{\bf{K}}$ along the ray path, enforcing stability. Moreover, DELO coefficients require the evaluation of exponential terms (see Appendix B) and this results in extra computational cost, as explained in Section 2.4.

Table 2.  Order of Accuracy for DELO Methods

Formal Solver Order of Accuracy
DELO-constant 2
DELO-linear 2
DELO-parabolic 3
DELOPAR 2
BESSER 2
quadratic DELO-Bézier 4
cubic DELO-Bézier 4

Download table as:  ASCIITypeset image

In conclusion, the necessity of the DELO strategy for the numerical treatment of the polarized radiative transfer is not warranted for low-order methods and not well motivated for high-order methods. A second paper, focused on high-order formal solvers, will try to build a clear hierarchy with respect to order of accuracy, stability and computational cost. The effective performances when dealing with realistic atmospheric models remain to be explored.

The financial support by the Swiss National Science Foundation (SNSF) through grant ID 200021_159206/1 is gratefully acknowledged. Special thanks are extended to F. Calvo, A. Paganini, and F. Züger for particularly enriching discussions. The authors are grateful to the anonymous referee for providing valuable comments that helped improving the article.

Appendix A: Conversion to Optical Depth

As already pointed out by De la Cruz Rodríguez & Piskunov (2013), many radiative transfer applications require a conversion of the spatial scale, e.g., from geometrical height s to optical depth τ. From Equation (14) one obtains

Equation (25)

The numerical integration introduces an error, which could lead to a reduced order of accuracy of the formal solver. In practice, a trapezoidal integration of Equation (25) is inadequate to perform numerical schemes based on high-order interpolations of the effective source function (e.g., DELO-parabolic). Therefore, high-order DELO methods require a corresponding high-order numerical evaluation of the integral in Equation (25).

Appendix B: DELO Coefficients

In order to keep the notation as close as possible to Rees et al. (1989), the following definitions are introduced

The coefficients of the DELO-constant method, Equation (22), are given by

The coefficients of the DELO-linear method, Equation (23), are given by

The coefficients of the DELO-parabolic method, Equation (24), are given by

with

The DELO-parabolic coefficients ${{\rm{\Phi }}}_{k-1},{{\rm{\Phi }}}_{k},{{\rm{\Phi }}}_{k+1}$ could suffer of problematic division with vanishingly small quantities. Thus, in case of small ${\rm{\Delta }}\tau $, a Taylor expansion of the exponential term Ek to third-order is indicated.

Appendix C: Atmospheric Model

The atmosphere model and parametric description used for the calculations shown in this paper are very similar to the ones used by Steiner et al. (2016). The radiative transfer is computed at different frequencies ν in a spectral interval containing a hypothetical, magnetically sensitive spectral line. The problem is formulated in reduced frequencies

Equation (26)

with ${\nu }_{0}$ the line-center frequency and ${\rm{\Delta }}{\nu }_{D}$ the Doppler width. The spectral interval $v\in [-6,6]$ is considered. The atmosphere is assumed to be plane-parallel, and the chosen reference spatial coordinate is the continuum optical depth along the vertical direction, defined by

where kc is the continuum absorption coefficient at the line-center frequency, and z is the geometrical height (increasing in the outward direction). The atmosphere extends in the range $\mathrm{log}{\tau }_{c}\in [-5,2]$.

No scattering or atomic polarization is taken into account, so that polarization is only introduced by the Zeeman effect. The magnetic field vector and the absorption and anomalous dispersion profiles (which enter in the definition of the coefficients of the propagation matrix ${\bf{K}}$ and of the emission vector ${\boldsymbol{\epsilon }}$) are assumed to be depth-independent. Under these assumptions, the propagation matrix can be parametrized in the form (see, e.g., Landi Degl'Innocenti & Landolfi 2004; Steiner et al. 2016)

where k is the ratio between the frequency-integrated line absorption coefficient and the continuum absorption coefficient, and ${\bf{H}}$ is a constant 4 × 4 matrix. The emission vector, on the other hand, can be written as

where S is the usual intensity source function. The following analytical form of S and k (the only depth-dependent quantities of the problem) have been considered

Equation (27)

Equation (28)

The exponential term in Equation (27) is included in order to reproduce a possible emission rise at low optical depths (e.g., at chromospheric heights). The results of Figures 2 and 3 have been obtained on the basis of the smooth variation of S and k shown in Figure 4, with the following values of the parameters appearing in Equations (27) and (28): ${A}_{1}=20$, ${A}_{2}=10$, ${\tau }_{1}={10}^{-4}$, ${\tau }_{2}=0.183$, B = 25, and ${\tau }_{k}=0.123$.

Figure 4.

Figure 4. The source function S and the ratio k according to Equations (27)–(28). The green dots represent a sampling with 3 points-per-decade.

Standard image High-resolution image

The intensity of the magnetic field is specified through the dimensionless parameter ${v}_{B}={\nu }_{L}/{\rm{\Delta }}{\nu }_{D}$, ${\nu }_{L}$ being the Larmor frequency. The results shown in Figures 2 and 3 have been obtained for vB = 1.5, and assuming the magnetic field to have an inclination $\theta =60^\circ $ with respect to the vertical. A damping constant a = 0.05 has been chosen for calculating the Voigt and Faraday–Voigt profiles entering the definition of the radiative transfer coefficients. No macroscopic (bulk) velocity has been considered. The calculations have been performed for the radiation propagating outwards in the atmosphere, along the vertical direction. At the bottom of the atmosphere (boundary condition) an unpolarized radiation field ${{\boldsymbol{I}}}_{0}={({B}_{0},0,0,0)}^{T}$ has been introduced. The Stokes parameters of the emergent radiation have been calculated in units of the parameter B0, whose exact value is thus irrelevant for the calculations shown in this paper. The reference direction for positive Stokes Q has been taken in the plane defined by the vertical and by the magnetic field.

Appendix D: Error Calculation

Denoting with ${{\boldsymbol{I}}}^{\mathrm{ref}}(v)$ and ${{\boldsymbol{I}}}^{\mathrm{num}}(v)$ the reference and the numerically computed emergent Stokes vectors, respectively, at the reduced frequency v, the global error for the ith Stokes vector component is computed as

Equation (29)

The error is given by the maximal discrepancy between the reference and the simulated Stokes parameter over the entire profile, normalized by the maximal amplitude in the reference profile. Equation (29) is not defined for a constant profile, because of a vanishing denominator. In that case, one needs to introduce a different error definition. The reference emergent Stokes profile ${{\boldsymbol{I}}}^{\mathrm{ref}}(v)$ is the exact solution approximated by means of high-order numerical methods, using a hyperfine grid sampling with more than 103 points-per-decade of the continuum optical depth. Different high-order methods (e.g., DELO-parabolic and quadratic DELO-Bézier) are used to cross-check the reference emergent profile.

Please wait… references are loading.
10.3847/1538-4357/aa671d