This blog post is taken from a chapter of my ebook on building reproducible analytical pipelines, which you can read here
If you want to follow along, you can start by downloading the data I use here. This is a smaller dataset made from the one you can get <a href=“https://dataverse.harvard. …