How To Setup Hadoop SSH Configuration In Linux CentOs


Hadoop requires SSH access to manage its nodes, i.e. remote machines plus your local machine if you want to use Hadoop on it (which is what we want to do in this short tutorial). For our single-node setup of Hadoop, we therefore need to configure SSH access to localhost for the hduser user we created in the […]

Commissioning and Decommissioning Nodes in a Hadoop Cluster


One of the best Advantage in Hadoop is commissioning and decommissioning nodes in a hadoop cluster,If any node in hadoop cluster is crashed then decommissioning is useful suppose we want to add more nodes to our hadoop cluster then commissioning concept is of the most common task of a Hadoop administrator is to commission (Add) and decommission (Remove) […]

Apache Spark Tutorial for beginners


Apache Spark is a open source processing engine.Apache Spark is a fast and general engine for large-scale data processing.Spark is a lightning-fast cluster computing designed for fast computation. Apache spark: Streaming Data Apache Spark’s key use case is its ability to process streaming data. With so much data being processed on a daily basis, it […]

what is hadoop

what is hadoop

What is Hadoop : Hadoop is one of the trending technology in IT market because the reason is Big data,Yes big data is a problem and Hadoop is a solution of that problem.Before going to what is hadoop concept first of all we have to know what is Big data and what are the main […]

Hadoop Definition and Ecosystems


Hadoop Hadoop is one of the powerful technology in today Market,Hadoop is an open source and specially designed for commodity hardware and it is open source,java based frame work  that supports both storage purpose and processing purpose. Hadoop is especially designed for large data sets in distributed computing environment. Apache Hadoop is a part of […]