Tuesday Big Data Series

INFOGRAPHIC – HDFS and it’s features that make it so awesome!

Read more here.

Friday "Term of the week" Series

Term of the Week : HDFS

Image Source HDFS or the Hadoop Distributed File System, is a Java-based filesystem for storing large volumes of data in the Hadoop framework. It solves the problem of storing and managing enormous amounts of data with it’s high scalability, fault-tolerance, high availability and cost efficiency that make it so popular. Related Posts: HDFS and it’s features that make… Continue reading Term of the Week : HDFS

Tuesday Big Data Series

Understanding HDFS quotas

Every Hadoop system has an Hadoop Administrator and Hadoop users/developers. The Administrator is responsible for deployment and maintenance of the entire infrastructure. He is responsible for cluster availability, file system management, security, installation of latest updates, and all other things that need to keep the system up and running. The administrator is also responsible for… Continue reading Understanding HDFS quotas

Tuesday Big Data Series

Understanding NameNode and DataNode in HDFS

HDFS has a master/slave architecture and is built-up of basically two kinds of nodes: NameNode (which acts as a Master) and DataNodes (which acts as slaves). NameNode and DataNode are pieces of software that are designed to run on commodity machines which typically run on a GNU/Linux operating system (OS). HDFS is built using Java language,… Continue reading Understanding NameNode and DataNode in HDFS

Tuesday Big Data Series

HDFS and it’s features that make it so awesome!

HDFS, Hadoop Distributed File System, is a Java-based filesystem for storing large volumes of data in the Hadoop framework. When we are thinking of dealing with enormous amounts of data, the first thing that comes to our mind is where do we store this data and how do we store it. We know that every single bit… Continue reading HDFS and it’s features that make it so awesome!