Recap

H4Sci 2025: Block 1 - Course Debriefing

Matt Bannert
#Data Science#Git#DevOps

Maneuvering the Open Source Ecosystem (Choose Your Rabbit Holes)

To start diving into a new ecosystem can be daunting and overwhelming, particularly when system borders are fuzzy and the landscape seems to evolve at a break-neck pace. In the first block of our course, we figured splitting the open source landscape into categories of tools provides a reasonable guideline: programming, languages, interaction environments, version control systems, data management, infrastructure, automation, communication and publishing tools. Obviously, not every project needs tools of every category. Still though, the ability to classify newly spotted software helps a great deal to figure out how deep you want to dive into learning a new technology.

RSE Book: Stack - A Developer’s Toolkit

The one no-brainer tool choice that should be used in every software project or analysis is git version control. Hence everyone using programming to tackle data quests should have carpentry level git skills.

How Long Does It Take to Learn Git (And Where to Start)?

First of all, get two ecosystem related hurdles out of the way: SSH Key Pair authentication and Quitting Vim. Both hurdles do not even have anything to do with git itself. Nevertheless newcomers will inevitably bump into them and potentially loose valuable motivation and endurance before they even started.

Once your public key is with your remote git provider of choice and you know how to quit a hacker’s favorite text editor you’re good to go. If you’re new to git, i.e., you don’t use git every week, here are a few steps to give you a bit of guidance getting started:

Additional Git Ressources

There is so much information on git available that it may be impossible to compose a comprehensive list. Certainly, such a list is beyond the scope of this blog post. Still, here a few git resources:

← Back to Blog