Computational Science & Discovery

ISSN: 1749-4699

Computational Science & Discovery is an international, multidisciplinary journal focused on scientific advances and discovery through computational science in physics, chemistry, biology and applied science.

Computational Science & Discovery will cease publication with the 2015 volume. For further details on this decision, please visit https://ioppublishing.org/newsDetails/computational-science-and-discovery-to-cease-publication.

Journal archive

Computational approaches and metrics required for formulating biologically realistic nanomaterial pharmacokinetic models

Jim E Riviere et al 2013 Comput. Sci. Discov. 6 014005

View article, Computational approaches and metrics required for formulating biologically realistic nanomaterial pharmacokinetic models PDF, Computational approaches and metrics required for formulating biologically realistic nanomaterial pharmacokinetic models

The field of nanomaterial pharmacokinetics is in its infancy, with major advances largely restricted by a lack of biologically relevant metrics, fundamental differences between particles and small molecules of organic chemicals and drugs relative to biological processes involved in disposition, a scarcity of sufficiently rich and characterized in vivo data and a lack of computational approaches to integrating nanomaterial properties to biological endpoints. A central concept that links nanomaterial properties to biological disposition, in addition to their colloidal properties, is the tendency to form a biocorona which modulates biological interactions including cellular uptake and biodistribution. Pharmacokinetic models must take this crucial process into consideration to accurately predict in vivo disposition, especially when extrapolating from laboratory animals to humans since allometric principles may not be applicable. The dynamics of corona formation, which modulates biological interactions including cellular uptake and biodistribution, is thereby a crucial process involved in the rate and extent of biodisposition. The challenge will be to develop a quantitative metric that characterizes a nanoparticle's surface adsorption forces that are important for predicting biocorona dynamics. These types of integrative quantitative approaches discussed in this paper for the dynamics of corona formation must be developed before realistic engineered nanomaterial risk assessment can be accomplished.

https://doi.org/10.1088/1749-4699/6/1/014005

Evaluating the performance of the two-phase flow solver interFoam

Suraj S Deshpande et al 2013 Comput. Sci. Discov. 5 014016

View article, Evaluating the performance of the two-phase flow solver interFoam PDF, Evaluating the performance of the two-phase flow solver interFoam

The performance of the open source multiphase flow solver, interFoam, is evaluated in this work. The solver is based on a modified volume of fluid (VoF) approach, which incorporates an interfacial compression flux term to mitigate the effects of numerical smearing of the interface. It forms a part of the C + + libraries and utilities of OpenFOAM and is gaining popularity in the multiphase flow research community. However, to the best of our knowledge, the evaluation of this solver is confined to the validation tests of specific interest to the users of the code and the extent of its applicability to a wide range of multiphase flow situations remains to be explored. In this work, we have performed a thorough investigation of the solver performance using a variety of verification and validation test cases, which include (i) verification tests for pure advection (kinematics), (ii) dynamics in the high Weber number limit and (iii) dynamics of surface tension-dominated flows. With respect to (i), the kinematics tests show that the performance of interFoam is generally comparable with the recent algebraic VoF algorithms; however, it is noticeably worse than the geometric reconstruction schemes. For (ii), the simulations of inertia-dominated flows with large density ratios ${\sim }\mathscr {O}(10^3)$ yielded excellent agreement with analytical and experimental results. In regime (iii), where surface tension is important, consistency of pressure–surface tension formulation and accuracy of curvature are important, as established by Francois et al (2006 J. Comput. Phys. 213 141–73). Several verification tests were performed along these lines and the main findings are: (a) the algorithm of interFoam ensures a consistent formulation of pressure and surface tension; (b) the curvatures computed by the solver converge to a value slightly (10%) different from the analytical value and a scope for improvement exists in this respect. To reduce the disruptive effects of spurious currents, we followed the analysis of Galusinski and Vigneaux (2008 J. Comput. Phys. 227 6140–64) and arrived at the following criterion for stable capillary simulations for interFoam: $\Delta t\leqslant \max (10\tau _\mu , 0.1\tau _\rho)$ where $\tau _\mu =\mu \Delta x/\sigma ,~\mathrm {and}~\tau _\rho =\sqrt {\rho \Delta x^3/\sigma }$ . Finally, some capillary flows relevant to atomization were simulated, resulting in good agreement with the results from the literature.

https://doi.org/10.1088/1749-4699/5/1/014016

Hyperopt: a Python library for model selection and hyperparameter optimization

James Bergstra et al 2015 Comput. Sci. Discov. 8 014008

View article, Hyperopt: a Python library for model selection and hyperparameter optimization PDF, Hyperopt: a Python library for model selection and hyperparameter optimization

Sequential model-based optimization (also known as Bayesian optimization) is one of the most efficient methods (per function evaluation) of function minimization. This efficiency makes it appropriate for optimizing the hyperparameters of machine learning algorithms that are slow to train. The Hyperopt library provides algorithms and parallelization infrastructure for performing hyperparameter optimization (model selection) in Python. This paper presents an introductory tutorial on the usage of the Hyperopt library, including the description of search spaces, minimization (in serial and parallel), and the analysis of the results collected in the course of minimization. This paper also gives an overview of Hyperopt-Sklearn, a software project that provides automatic algorithm configuration of the Scikit-learn machine learning library. Following Auto-Weka, we take the view that the choice of classifier and even the choice of preprocessing module can be taken together to represent a single large hyperparameter optimization problem. We use Hyperopt to define a search space that encompasses many standard components (e.g. SVM, RF, KNN, PCA, TFIDF) and common patterns of composing them together. We demonstrate, using search algorithms in Hyperopt and standard benchmarking data sets (MNIST, 20-newsgroups, convex shapes), that searching this space is practical and effective. In particular, we improve on best-known scores for the model space for both MNIST and convex shapes. The paper closes with some discussion of ongoing and future work.

https://doi.org/10.1088/1749-4699/8/1/014008

Biomolecular electrostatics—I want your solvation (model)

Jaydeep P Bardhan 2013 Comput. Sci. Discov. 5 013001

View article, Biomolecular electrostatics—I want your solvation (model)PDF, Biomolecular electrostatics—I want your solvation (model)

We review the mathematical and computational foundations for implicit-solvent models in theoretical chemistry and molecular biophysics. These models are valuable theoretical tools for studying the influence of a solvent, often water or an aqueous electrolyte, on a molecular solute such as a protein. Detailed chemical and physical aspects of implicit-solvent models have been addressed in numerous exhaustive reviews, as have numerical algorithms for simulating the most popular models. This work highlights several important conceptual developments, focusing on selected works that spotlight the need for research at the intersections between chemical, biological, mathematical, and computational physics. To introduce the field to computational scientists, we begin by describing the basic theoretical ideas of implicit-solvent models and numerical implementations. We then address practical and philosophical challenges in parameterization, and major advances that speed up calculations (covering continuum theories based on Poisson as well as faster approximate theories such as generalized Born). We briefly describe the main shortcomings of existing models, and survey promising developments that deliver improved realism in a computationally tractable way, i.e. without increasing simulation time significantly. The review concludes with a discussion of ongoing modeling challenges and relevant trends in high-performance computing and computational science.

https://doi.org/10.1088/1749-4699/5/1/013001

ObsPy: a bridge for seismology into the scientific Python ecosystem

Lion Krischer et al 2015 Comput. Sci. Discov. 8 014003

View article, ObsPy: a bridge for seismology into the scientific Python ecosystem PDF, ObsPy: a bridge for seismology into the scientific Python ecosystem

The Python libraries NumPy and SciPy are extremely powerful tools for numerical processing and analysis well suited to a large variety of applications. We developed ObsPy (http://obspy.org), a Python library for seismology intended to facilitate the development of seismological software packages and workflows, to utilize these abilities and provide a bridge for seismology into the larger scientific Python ecosystem. Scientists in many domains who wish to convert their existing tools and applications to take advantage of a platform like the one Python provides are confronted with several hurdles such as special file formats, unknown terminology, and no suitable replacement for a non-trivial piece of software. We present an approach to implement a domain-specific time series library on top of the scientific NumPy stack. In so doing, we show a realization of an abstract internal representation of time series data permitting I/O support for a diverse collection of file formats. Then we detail the integration and repurposing of well established legacy codes, enabling them to be used in modern workflows composed in Python. Finally we present a case study on how to integrate research code into ObsPy, opening it to the broader community. While the implementations presented in this work are specific to seismology, many of the described concepts and abstractions are directly applicable to other sciences, especially to those with an emphasis on time series analysis.

https://doi.org/10.1088/1749-4699/8/1/014003

PythonTeX: reproducible documents with LaTeX, Python, and more

Geoffrey M Poore 2015 Comput. Sci. Discov. 8 014010

View article, PythonTeX: reproducible documents with LaTeX, Python, and more PDF, PythonTeX: reproducible documents with LaTeX, Python, and more

PythonTeX is a LaTeX package that allows Python code in LaTeX documents to be executed and provides access to the output. This makes possible reproducible documents that combine results with the code required to generate them. Calculations and figures may be next to the code that created them. Since code is adjacent to its output in the document, editing may be more efficient. Since code output may be accessed programmatically in the document, copy-and-paste errors are avoided and output is always guaranteed to be in sync with the code that generated it. This paper provides an introduction to PythonTeX and an overview of major features, including performance optimizations, debugging tools, and dependency tracking. Several complete examples are presented. Finally, advanced features are summarized. Though PythonTeX was designed for Python, it may be extended to support additional languages; support for the Ruby and Julia languages is already included. PythonTeX contains a utility for converting documents into plain LaTeX, suitable for format conversion, sharing, and journal submission.

https://doi.org/10.1088/1749-4699/8/1/014010

SunPy—Python for solar physics

The SunPy Community et al 2015 Comput. Sci. Discov. 8 014009

View article, SunPy—Python for solar physics PDF, SunPy—Python for solar physics

This paper presents SunPy (version 0.5), a community-developed Python package for solar physics. Python, a free, cross-platform, general-purpose, high-level programming language, has seen widespread adoption among the scientific community, resulting in the availability of a large number of software packages, from numerical computation (NumPy, SciPy) and machine learning (scikit-learn) to visualization and plotting (matplotlib). SunPy is a data-analysis environment specializing in providing the software necessary to analyse solar and heliospheric data in Python. SunPy is open-source software (BSD licence) and has an open and transparent development workflow that anyone can contribute to. SunPy provides access to solar data through integration with the Virtual Solar Observatory (VSO), the Heliophysics Event Knowledgebase (HEK), and the HELiophysics Integrated Observatory (HELIO) webservices. It currently supports image data from major solar missions (e.g., SDO, SOHO, STEREO, and IRIS), time-series data from missions such as GOES, SDO/EVE, and PROBA2/LYRA, and radio spectra from e-Callisto and STEREO/SWAVES. We describe SunPyʼs functionality, provide examples of solar data analysis in SunPy, and show how Python-based solar data-analysis can leverage the many existing tools already available in Python. We discuss the future goals of the project and encourage interested users to become involved in the planning and development of SunPy.

https://doi.org/10.1088/1749-4699/8/1/014009

Novel methods in the Particle-In-Cell accelerator Code-Framework Warp

J-L Vay et al 2013 Comput. Sci. Discov. 5 014019

View article, Novel methods in the Particle-In-Cell accelerator Code-Framework Warp PDF, Novel methods in the Particle-In-Cell accelerator Code-Framework Warp

The Particle-In-Cell (PIC) Code-Framework Warp is being developed by the Heavy Ion Fusion Science Virtual National Laboratory (HIFS-VNL) to guide the development of accelerators that can deliver beams suitable for high-energy density experiments and implosion of inertial fusion capsules. It is also applied in various areas outside the Heavy Ion Fusion program to the study and design of existing and next-generation high-energy accelerators, including the study of electron cloud effects and laser wakefield acceleration for example. This paper presents an overview of Warp's capabilities, summarizing recent original numerical methods that were developed by the HIFS-VNL (including PIC with adaptive mesh refinement, a large-timestep 'drift-Lorentz' mover for arbitrarily magnetized species, a relativistic Lorentz invariant leapfrog particle pusher, simulations in Lorentz-boosted frames, an electromagnetic solver with tunable numerical dispersion and efficient stride-based digital filtering), with special emphasis on the description of the mesh refinement capability. Selected examples of the applications of the methods to the abovementioned fields are given.

https://doi.org/10.1088/1749-4699/5/1/014019

DMTCP: bringing interactive checkpoint–restart to Python

Kapil Arya and Gene Cooperman 2015 Comput. Sci. Discov. 8 014005

View article, DMTCP: bringing interactive checkpoint–restart to Python PDF, DMTCP: bringing interactive checkpoint–restart to Python

DMTCP (Distributed MultiThreaded CheckPointing) is a mature checkpoint–restart package. It operates in user space without kernel privilege, and adapts to application-specific requirements through plugins. While DMTCP has been able to checkpoint Python and IPython 'from the outside' for many years, a Python module has recently been created to support DMTCP. IPython support is included through a new DMTCP plugin. A checkpoint can be requested interactively within a Python session or under the control of a specific Python program. Further, the Python program can execute specific Python code prior to checkpoint, upon resuming (within the original process) and upon restarting (from a checkpoint image). Applications of DMTCP are demonstrated for: (i) Python-based graphics using virtual network client, (ii) a fast/slow technique to use multiple hosts or cores to check one (Cython Behnel S et al 2011 Comput. Sci. Eng. 13 31–39) computation in parallel, and (iii) a reversible debugger, FReD, with a novel reverse-expression watchpoint feature for locating the cause of a bug.

https://doi.org/10.1088/1749-4699/8/1/014005

Pythran: enabling static optimization of scientific Python programs

Serge Guelton et al 2015 Comput. Sci. Discov. 8 014001

View article, Pythran: enabling static optimization of scientific Python programs PDF, Pythran: enabling static optimization of scientific Python programs

Pythran is an open source static compiler that turns modules written in a subset of Python language into native ones. Assuming that scientific modules do not rely much on the dynamic features of the language, it trades them for powerful, possibly inter-procedural, optimizations. These optimizations include detection of pure functions, temporary allocation removal, constant folding, Numpy ufunc fusion and parallelization, explicit thread-level parallelism through OpenMP annotations, false variable polymorphism pruning, and automatic vector instruction generation such as AVX or SSE. In addition to these compilation steps, Pythran provides a C++ runtime library that leverages the C++ STL to provide generic containers, and the Numeric Template Toolbox for Numpy support. It takes advantage of modern C++11 features such as variadic templates, type inference, move semantics and perfect forwarding, as well as classical idioms such as expression templates. Unlike the Cython approach, Pythran input code remains compatible with the Python interpreter. Output code is generally as efficient as the annotated Cython equivalent, if not more, but without the backward compatibility loss.

https://doi.org/10.1088/1749-4680/8/1/014001

SunPy—Python for solar physics

The SunPy Community et al 2015 Comput. Sci. Discov. 8 014009

View article, SunPy—Python for solar physics PDF, SunPy—Python for solar physics

https://doi.org/10.1088/1749-4699/8/1/014009

PythonTeX: reproducible documents with LaTeX, Python, and more

Geoffrey M Poore 2015 Comput. Sci. Discov. 8 014010

View article, PythonTeX: reproducible documents with LaTeX, Python, and more PDF, PythonTeX: reproducible documents with LaTeX, Python, and more

https://doi.org/10.1088/1749-4699/8/1/014010

SkData: data sets and algorithm evaluation protocols in Python

James Bergstra et al 2015 Comput. Sci. Discov. 8 014007

View article, SkData: data sets and algorithm evaluation protocols in Python PDF, SkData: data sets and algorithm evaluation protocols in Python

Machine learning benchmark data sets come in all shapes and sizes, whereas classification algorithms assume sanitized input, such as (x, y) pairs with vector-valued input x and integer class label y. Researchers and practitioners know all too well how tedious it can be to get from the URL of a new data set to a NumPy ndarray suitable for e.g. pandas or sklearn. The SkData library handles that work for a growing number of benchmark data sets (small and large) so that one-off in-house scripts for downloading and parsing data sets can be replaced with library code that is reliable, community-tested, and documented. The SkData library also introduces an open-ended formalization of training and testing protocols that facilitates direct comparison with published research. This paper describes the usage and architecture of the SkData library.

https://doi.org/10.1088/1749-4699/8/1/014007

Hyperopt: a Python library for model selection and hyperparameter optimization

James Bergstra et al 2015 Comput. Sci. Discov. 8 014008

View article, Hyperopt: a Python library for model selection and hyperparameter optimization PDF, Hyperopt: a Python library for model selection and hyperparameter optimization

https://doi.org/10.1088/1749-4699/8/1/014008

GPU computing with OpenCL to model 2D elastic wave propagation: exploring memory usage

Ursula Iturrarán-Viveros and Miguel Molero-Armenta 2015 Comput. Sci. Discov. 8 014006

View article, GPU computing with OpenCL to model 2D elastic wave propagation: exploring memory usage PDF, GPU computing with OpenCL to model 2D elastic wave propagation: exploring memory usage

Graphics processing units (GPUs) have become increasingly powerful in recent years. Programs exploring the advantages of this architecture could achieve large performance gains and this is the aim of new initiatives in high performance computing. The objective of this work is to develop an efficient tool to model 2D elastic wave propagation on parallel computing devices. To this end, we implement the elastodynamic finite integration technique, using the industry open standard open computing language (OpenCL) for cross-platform, parallel programming of modern processors, and an open-source toolkit called [Py]OpenCL. The code written with [Py]OpenCL can run on a wide variety of platforms; it can be used on AMD or NVIDIA GPUs as well as classical multicore CPUs, adapting to the underlying architecture. Our main contribution is its implementation with local and global memory and the performance analysis using five different computing devices (including Kepler, one of the fastest and most efficient high performance computing technologies) with various operating systems.

https://doi.org/10.1088/1749-4699/8/1/014006

SunPy—Python for solar physics

The SunPy Community et al 2015 Comput. Sci. Discov. 8 014009

View article, SunPy—Python for solar physics PDF, SunPy—Python for solar physics

https://doi.org/10.1088/1749-4699/8/1/014009

Biomolecular electrostatics—I want your solvation (model)

Jaydeep P Bardhan 2013 Comput. Sci. Discov. 5 013001

View article, Biomolecular electrostatics—I want your solvation (model)PDF, Biomolecular electrostatics—I want your solvation (model)

https://doi.org/10.1088/1749-4699/5/1/013001

Multiscale methods in turbulent combustion: strategies and computational challenges

Tarek Echekki 2009 Comput. Sci. Discov. 2 013001

View article, Multiscale methods in turbulent combustion: strategies and computational challenges PDF, Multiscale methods in turbulent combustion: strategies and computational challenges

A principal challenge in modeling turbulent combustion flows is associated with their complex, multiscale nature. Traditional paradigms in the modeling of these flows have attempted to address this nature through different strategies, including exploiting the separation of turbulence and combustion scales and a reduced description of the composition space. The resulting moment-based methods often yield reasonable predictions of flow and reactive scalars' statistics under certain conditions. However, these methods must constantly evolve to address combustion at different regimes, modes or with dominant chemistries. In recent years, alternative multiscale strategies have emerged, which although in part inspired by the traditional approaches, also draw upon basic tools from computational science, applied mathematics and the increasing availability of powerful computational resources. This review presents a general overview of different strategies adopted for multiscale solutions of turbulent combustion flows. Within these strategies, some specific models are discussed or outlined to illustrate their capabilities and underlying assumptions. These strategies may be classified under four different classes, including (i) closure models for atomistic processes, (ii) multigrid and multiresolution strategies, (iii) flame-embedding strategies and (iv) hybrid large-eddy simulation–low-dimensional strategies. A combination of these strategies and models can potentially represent a robust alternative strategy to moment-based models; but a significant challenge remains in the development of computational frameworks for these approaches as well as their underlying theories.

https://doi.org/10.1088/1749-4699/2/1/013001

Journal links

Journal information

Computational Science & Discovery

Most read

Latest articles

Review articles

Journal links

Journal information