Source-agnostic distributed change data capture system
Open-Source Distributed Stream and Batch Processing
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
Example codes for my Distributed Computing course at Hefei University.
A high performance replicated log service. (The development is moved to Apache Incubator)
BBoxDB is a scalable, highly available and distributed data store for multi-dimensional big data. The software supports operations like hyperrectangle queries or spatial joins.
Multi-platform Scheduling and Workflows Engine
Agreement in Asynchronous Distributed Systems
Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that leverage the different semantic aspects of heterogeneous data. The library can be used on a single machine using multi-threading or distributed computing using Spark.