| title | Quality Open Source Big Data Projects |
|---|---|
| category | Open Source Projects |
| icon | big-data |
- Spark: Spark is a unified analytics engine for large-scale data processing.
- Flink: Apache Flink is a framework and distributed processing engine for stateful computations over bounded and unbounded data streams. Flink can run in all common cluster environments and can perform computations at in-memory speed and any scale.
- HBase: HBase – Hadoop Database, is a highly reliable, high-performance, column-oriented, scalable distributed storage system. Using HBase technology, large-scale structured storage clusters can be built on inexpensive PC servers.
- Flume: Apache Flume is a distributed, reliable, and available system for collecting, aggregating, and moving large amounts of log data from various sources to centralized data storage.
- Storm: A distributed, fault-tolerant real-time computation system.