PhD project: Workflows for reproducible computational science and data science
We are inviting applications for a PhD position in the Hamburg metropolitan area (Germany) to research reproducible workflows for computational science and data science.
Carrying out data analysis of scientific data obtained during experiments is a main activity in photon science, and is essential to convert the obtained data into understanding, and eventually publications. A topic that receives growing attention is that of reproducibility and re-usability: Given a publication, it should be possible for readers of the publication to reproduce the results published in the paper, particularly so if the results are based on computational processes. This forms the bases for re-use of the work, for example to extend the analysis software to carry out a related but new study. In practice, this is often impossible. In this project, we will investigate the process of data analysis towards publication and then work to improve this workflow. Typically, data analysis involves processing huge amounts of data (GB to PB) using a range of specialist software tools. Challenges include to preserve all these processing steps, the specialist software, and its computation environment so that the computation can be reproduced and re-used in the future. Objectives are to make the process reproducible, convenient and effective. Important tools for the technical part of this work are likely to include the Jupyter Notebook and an ecosystem of tools, including Python, package managers such as Spack, and containers.
We are looking for a computational scientist with a background in physics, chemistry, biology, mathematics, engineering or similar with strong interest in programming and computational science, or for a computer scientist with interest in supporting computational science. Relevant tools are likely to include Python, Jupyter, containers. The application deadline is end of March; actual start of the position is probably somewhat flexible later in 2019. Language of work is English.
The project is listed as number 6 on https://www.dashh.org/application/phd_topics/index_eng.html
More details are available at https://fangohr.github.io/vacancies/dashh-reproducible-workflows.html
Any questions, please get in touch (email@example.com).