Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
BBoxDB is a scalable, highly available, and distributed data store for multi-dimensional big data. The software supports operations like multi-dimensional range queries and spatial joins. In addition, data streams are supported.
Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
A high performance replicated log service. (The development is moved to Apache Incubator)
Example codes for my Distributed Computing course at Hefei University.
Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that leverage the different semantic aspects of heterogeneous data. The library can be used on a single machine using multi-threading or distributed computing using Spark.