Monday, November 4, 2013

New Big Data Projects

Some interesting big data related projects.

  • Databus - Databus provides a timeline-consistent stream of change capture events for a database. Github link.
  • Samza - Apache Samza is a distributed stream processing framework. It uses Apache Kafka for messaging, and Apache Hadoop YARN to provide fault tolerance, processor isolation, security, and resource management.
  • Druid - Druid is open source infrastructure for Real²time Exploratory Analytics on Large Datasets. The system uses an always-on, distributed, shared-nothing, architecture designed for real-time querying and data ingestion. It leverages column-orientation and advanced indexing structures to allow for cost effective, arbitrary exploration of multi-billion-row tables with sub-second latencies.
  • Grape - Grape is a realtime processing pipeline.

No comments:

Post a Comment