They will also learn how to make visualizations in python using industry standard plotting libraries.
They will learn about strategies to make optimal use of HPC clusters, and how they might use HPC in their own research project.
6. Research data management
Students will learn about the best practices of research data management. This will include an approach to making research data FAIR (findable, accessible, interoperable, reusable). They will also learn about GDPR compliance and data life cycle management.
7. Version control & collaboration
Students will learn about git version control, which is a decentralized version control system widely used for source code and other plain-text files. Additionally, they will learn about version control in a collaborative setting where multiple researchers work on the same files.
8. Writing readable code
Here we discuss how students can ensure that the code they write is easily readable, with a clear and easy-to-follow logic. This will help in getting consistent results from data analyses.
9. Using Python Notebooks for communication
Notebooks are a great tool to communicate science and science results. In this module we will teach students how to use the interactive capabilities of Jupyter notebook as an effective means of communication.
10. Digital project management
In this module the students will learn how to effectively work on digital projects, specifically tailored towards scientific data analysis. We will discuss the use of software versioning, effective collaboration, and ensuring that the project can be taken over by colleagues.
11. Introduction to machine learning & AI
We discuss the basics of machine learning and AI, and what different kinds of problems can be solved using these techniques. We will also discuss how these methods could be used in their own fields.
12. Ethics of AI
The use of AI, and especially generative AI, comes with a host of ethical dilemmas. We will make students aware of these dilemmas, and discuss how they apply to their own work.