News

A tutorial for writing a MapReduce program for Hadoop in python, and using Hive to do MapReduce with SQL-like queries. This uses the Hadoop Streaming API with python to teach the basics of using the ...
Hive-tutorial - WIP Hive is a datawarehousing layer above Hadoop. It gives SQl like semantics over Hadoop data (HDFS). Although now many SQl engine over hadoop like Impala,Drill,Presto has come but ...