Basis Hadoop Framework Learning
In my last Blogs we have discussed about What is Big data hadoop and its jobs ? Apache Spark and Introduction of Apache Spark SQL .Now in this blog we will discussed about the basic frame work of Hadoop. So lets gets started. Frameworks: Hadoop : Hadoop is basically a software library written in java. It is used for processing large amount of data in distributed environment, which allows developers to setup clusters of computers starting with a single node that can scale up to thousand of nodes. HIVE Hive is data warehousing framework that's built on Hadoop. It allows for structuring data and querying using a language like SQL called HiveQL. Developers can use Hive and HiveQL to write complex MapReduce over structured data in a distributed file system. Hive is the closest thing to a relational-database in the Hadoop ecosystem. PIG Pig is an application for transforming large data sets. Like Hive, Pig has its own language called pig-latin. Pig Latin allows devel...