

#Intellij linux install
Run the below commands on your command prompt and Ubuntu will install sdk on your machine. I've used Ubuntu 14.x Linux VM on my Windows box. So we can run them on the same machine with some simple configuration changes. The Namenode and Datanode runs on their own JVM process. These ports can be overridden using the configuration. Namenode RPC interface listen on port 8020 by default and HTTP interface on port 50070 by default.ĭatanode RPC interface listen on port 50010 by default and HTTP interface on port 50075. Namenode and Datanode are implemented as Linux daemons and both exposes RPC and HTTP interfaces for communication. I suggest reading Tom White's "Hadoop, The Definitive Guide" book. If not, please spend some time learning and come back to this article. I assume the reader has sufficient understanding of the basics of Hadoop architecture. HDFS consists of Namenode to manage metadata of file system and several Datanodes to store the data. HDFS ( Hadoop Distributed File System) that stores and manages data and MapReduce framework that does data analysis job. In general, Hadoop technology consists of two parts - data storage and analyzing the stored data. Setting up debug environment using IntelliJ However I had a strong feeling of learning this wonderful bigdata technology and that helped me cross all the hurdles.:) I hope this article helps those who want to start learning hadoop from scratch. The reason why I wrote this article is I struggled for a couple of days to setup debug environment as I didn't get complete details online. Being a Microsoft developer, I was new to Java, IntelliJ IDEA and Hadoop development. This will help in better understanding of the framework itself.

This article is intended for beginners who want to setup Hadoop debug environment, i.e., debugging Namenode, Datanode and Map Reduce APIs.
