JavaGuide/docs/open-source-project/big-data.md at main · ManitejChowdary1/JavaGuide

title	Quality Open Source Big Data Projects
category	Open Source Projects
icon	big-data

Spark: Spark is a unified analytics engine for large-scale data processing.
Flink: Apache Flink is a framework and distributed processing engine for stateful computations over bounded and unbounded data streams. Flink can run in all common cluster environments and can perform computations at in-memory speed and any scale.
HBase: HBase – Hadoop Database, is a highly reliable, high-performance, column-oriented, scalable distributed storage system. Using HBase technology, large-scale structured storage clusters can be built on inexpensive PC servers.
Flume: Apache Flume is a distributed, reliable, and available system for collecting, aggregating, and moving large amounts of log data from various sources to centralized data storage.
Storm: A distributed, fault-tolerant real-time computation system.

Provide feedback