Apache Storm is a system for processing streaming data in real time. Built on top of YARN, it is powerful for real-time analytics, machine learning and continuous monitoring operations.
Some of the important characteristics of Apache Storm are –
- It is a distributed real-time computation system for processing large volumes of high-velocity data.
- It is extremely fast with the ability to process one million 100 byte messages per second per node.
- It is highly scalable.
- Like other Hadoop systems, it is fault-tolerant and reliable.
- It guarantees that each unit of data will be processed at least once or exactly once.
- It is easy to operate.