Global architecture

Global Schema

Network Architecture

Available Technologies ("Capsules")

Extraction & Processing

  • SQOOP
  • Java, Scala, Kotlin
  • Apache Spark (version 1.5, 1.6, 2.0, 2.1)
  • R
  • Python (branch 2.x and 3.X)
  • Docker
  • Notebooks :
    • Jupyter : Python
    • Jupyter: Python with PySpark 2.1
    • Jupyter: R
    • Jupyter: Julia
    • Jupyter: Haskell
    • Spark Notebook : 1.5
    • Spark Notebook : 1.6
    • Zeppelin Notebook 

Datalake

  • HDFS : 2.6
  • Hive
  • Impala : 2.5
  • Drill
  • Kafka : 0.10

Datamart

  • Mongo DB
  • MySQL
  • Postgre SQL
  • Elastic Search

Dataviz

  • Docker