Click here to see the map at Huffington Post Québec First of all, a clarification. I did not really make that map. I adapted the code from Noah Veltman’s San Francisco history map, and made one for Montreal. Compare both maps, and you’ll see they are very similar in many ways. That said, the data sources […]
Using Python’s calendar module for scraping date-based data
I’ve recently fallen in love with Python’s standard calendar module. It has lots of functions to make handling dates a breeze. And for scraping data based on dates, it couldn’t be more convenient. Take Environment Canada’s historical hourly data for Montreal. Each page has 24 hours of data in a single day. If I want […]
On the ethics of web scraping
Scraping data from websites is a valuable skill for a journalist, and often (as was my case), the first incentive to learn to code. But once you acquire the power to harvest mass amount of data in a short time, ethical questions invariably pop up: Can I take this data? Can I republish this […]