How This Blog Is Now Ran by Modern GitHub Actions Thanks to AI

Publishing Jupyter notebooks as blog posts on this site used to be a tedious, multi-step manual process. After not updating this blog for a while, I asked Claude (yes, an AI) to take a look at the old workflow and make it better. It read through my janky bash script, understood the whole Jekyll setup, and built a replacement that honestly makes me wonder why I spent all those hours doing it by hand. Here’s how it works now:

Predicting genres of 45,000 Project Gutenberg books using NLP - BoW Approach

Project Gutenberg is a website that offers more than 58,000 free eBooks for which U.S. copyright have expired. It is very interesting text data for Natural Language Processing (NLP), as it is a huge body of text with pretty reliable labeling such as genre, author, publication year etc… Here, I’ll attempt to process approximately 45,000 English books from Project Gutenberg in order to find patterns between words and the genre of the books using a Bag-of-Words (BoW) approach.