Knowledge Hub: Big Data: AMAZON ELASTIC MAPREDUCE (EMR)

Thursday, August 21, 2014

Big Data- The Rise and the Future

In this blog we will discuss AMAZON ELASTIC MAPREDUCE (EMR)

AMAZON ELASTIC MAPREDUCE (EMR) Overview

What is EMR?

-Webservice on top of AWS that uses EC2 for processing and S3 for storage

-Data is pulled from S3, processed by auto-configured EC2 cluster and results pushed back to S3

-Crunch your data in the cloud without the hassle of managing your own cluster/infrastructure!!

What is an EMR Job Flow?

-Data processing wizard

-Hive,mapreduce, hbase and pig

The only thing we need to do is configure EMR Job Flow. Once its configured, rest is very easy. Even EMR JOB FLOW is very easy in amazon.

Thats it.

Knowledge Hub