Building Data Analytics Infrastructure with SMACK Stack

Building Big Data Analytics Infrastructure with SMACK Stack

Guide to Building Data Analytics Infrastructure with SMACK Stack using the following terms –

  • S – stands for Apache Spark used for Batch Processing or Real-Time Data Streaming.
  • M – stands for Apache Mesos responsible for installation and administration.
  • A – stands for Akka for Data Streaming and Data Ingestion at a faster pace.
  • C – stands for Cassandra database to write and read the stored data.
  • K – stands for Apache Kafka to perform decoupling and reduction of overhead.

Challenge for Building the Analytics Platform

  • To power Scalable Real-Time & Data Driven Application.
  • Build a system of Real-Time insights to create new opportunities and deliver new value.
  • Ingest Data at a scale without loss.
  • Trigger actions based on the analyzed data, and store the data at Cloud-scale.

Solution Offered for Building Infrastructure with SMACK Stack

Propose SMACK Stack to overcome these challenges –

The SMACK stack to build modern enterprise apps because it performs each of these objectives with a loosely coupled toolchain of technologies that are all open source, and production-proven at scale.

  • Spark – A general engine for large-scale Data Processing, enabling analytics from SQL queries to Machine Learning, Graph Analytics, and Stream Processing.
  • Mesos – Distributed systems kernel that provides resourcing and isolation across all the other SMACK stack components. Mesos is the foundation on which other SMACK stack components run.
  • Akka – A toolkit and runtime to easily create concurrent and distributed apps that are responsive to messages.
  • Cassandra – Distributed database management system that can handle a large amount of data across servers with high availability.
  • Kafka – A high throughput, low-latency platform to handle Real-Time data feeds with no data loss.
Read more