Data Engineering by Example
I’m creating a GitHub repository with examples of common procedures in data engineering. This is because I recently decided to get more practical and hands-o...
I’m creating a GitHub repository with examples of common procedures in data engineering. This is because I recently decided to get more practical and hands-o...
This is the first public weekly review I’ll do, and will set up the template for all future ones. The basic idea comes from Ali Abdaal (with his coming from ...
I recently had to overhaul an analysis after I realised a common issue had snuck in. As you can probably guess by the title, that issue was immortal time bia...
This post is a bit of a holding place for code I find helpful for quickly profiling the CPU or RAM that isn’t comprehensive enough to go in a github repo.
This is first look at a book that seems to be recommended by many as essential for a good foundation in data engineering (DE). So while I go through the hoop...