Tuesday Big Data Series

INFOGRAPHIC – HDFS and it’s features that make it so awesome!

Read more here.

Friday "Term of the week" Series

Term of the Week : HDFS

Image Source HDFS or the Hadoop Distributed File System, is a Java-based filesystem for storing large volumes of data in the Hadoop framework. It solves the problem of storing and managing enormous amounts of data with it’s high scalability, fault-tolerance, high availability and cost efficiency that make it so popular. Related Posts: HDFS and it’s features that make… Continue reading Term of the Week : HDFS

Tuesday Big Data Series

Understanding NameNode and DataNode in HDFS

HDFS has a master/slave architecture and is built-up of basically two kinds of nodes: NameNode (which acts as a Master) and DataNodes (which acts as slaves). NameNode and DataNode are pieces of software that are designed to run on commodity machines which typically run on a GNU/Linux operating system (OS). HDFS is built using Java language,… Continue reading Understanding NameNode and DataNode in HDFS