Ressource Management

Zoom on the hardware architecture

How jobs impact available servers

Schema Full Saagie

Schema Saagie on Top of a Datalake

Rules

Node typesResident ServicesScheduled or Streaming Jobs
Data Node

HDFS

Yarn/Map reduce (aslo Hive)

Impala

Drill

Docker

Spark

R

Python

Sqoop

Talend

Java-Scala

Datascience Notebook (depends of your settings)

Datamart

Mongo Db

MySQL

PostGreSQL (1.5)

Elastic Search (1.5)


Dataviz

Docker

Datascience Notebook (depends of your settings)

Kafka NodeKafka
Compute Edge Node
Datascience Notebook (depends of your settings)
GPU Edge Node