Articles tagged with 'hadoop'

Analysing Apache HTTP Server Logs With Hadoop - Part 1

The Apache HTTP Server seems to be declining in popularity but it still has huge market share. I was toying with MapReduce and Pig lately and thought that processing log files with Hadoop would be a cool little project to get the hang of things so I started with a MapReduce project using the Java API and while researching I found that I could do it much more easily with Apache Pig. This post describes the Java API approach and a follow up post will cover the Pig solution.

Read More

Post Tags: