Apache Storm

Apache Storm is an open-source real-time computation system. It processes unbounded streams of data and helps with real-time processing. The primary functions are real-time analytics, machine learning, continuous computation, distributed RPC, ETL, and more. It became Apache’s Top-Level Project in September 2014.

Project Background

  • Framework: Apache Storm 
  • Author: Backtype, Twitter
  • Released: September 2011
  • Type: Real-time computation system
  • License: Apache License 2.0
  • Support: Cross-platform
  • GitHub: apache/storm

Applications

  • Fault-tolerant
  • Real-time analytic
  • Real-time stream processing
  • Online machine learning
  • Use with any language
  • Continuous computation
  • Distributed RPC and ETL

Summary

  • Stream processing framework is written in Clojure.
  • It has custom-made “spouts” and “bolts” for data information.
  • Spouts and bolts work as graph vertices in Directed Acyclic Graph
  • Data is processed in real-time.
  • It is user-friendly and can be used in any programming language.
  • It has a history of over a million tuples processed per second per node.
Scroll to Top