This is Part I of a two part series on my attempt to collect, clean, analyze and visualize data from the Ask HN: Who is Hiring? threads. This first part is a discussion of my analysis. Part II goes…
In Part I of this article, I showed my analysis for the data I pulled and tagged from Ask HN: Who is Hiring? threads for the last 7 years. This second part is a walkthrough…
In 2011, a team of scientists led by Martin Hilbert discovered that the total amount of data in the world in 2007 was 295 optimally compressed exabytes. If you decided to store all this data on standard…
I’ve been working with Python for over a year now and I’ve grown to like the language very much. It’s made its own place in the industry and in academia. It’s also currently the most…
If you read our recent blog post, you know we’re big on using Scrapy for a lot of our large-scale website crawling needs. But there’s another kind of website crawling scenario that also requires a…