- Databus - Databus provides a timeline-consistent stream of change capture events for a database. Github link.
- Samza - Apache Samza is a distributed stream processing framework. It uses Apache Kafka for messaging, and Apache Hadoop YARN to provide fault tolerance, processor isolation, security, and resource management.
- Druid - Druid is open source infrastructure for Real²time Exploratory Analytics on Large Datasets. The system uses an always-on, distributed, shared-nothing, architecture designed for real-time querying and data ingestion. It leverages column-orientation and advanced indexing structures to allow for cost effective, arbitrary exploration of multi-billion-row tables with sub-second latencies.
- Grape - Grape is a realtime processing pipeline.
Monday, November 4, 2013
New Big Data Projects
Some interesting big data related projects.
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment