WHAT WE DO

STRATEGY

We help companies to identify best practices to develop a big data strategy, what technologies might be used, how to build effective analytics. To understand how to run that process, we often practice “hacking sessions” for non- technical people with our customers: custom workshops to help identify the right use cases and what type of insight could be obtained, analyze and map the company’s data landscape, support the re-design of the business models or concepts and calculate the ROI of a possible project.

MACHINE LEARNING

Track (3 days)

Machine Learning introduction

  • Scope and motivations
  • Terminology and workflow
  • Typical pipelines
  • Approaches and algorithms
  • Algorithms in-depth
  • Use cases and demo

Advanced Machine Learning

  • Features Engineering
  • Advanced pipelines
  • Specialized Algorithms
  • Model selection and evaluation
  • Recommender systems
  • Use cases and demo

Large Scale Machine Learning

  • Spark MLlib and Big Data
  • Deep Learning
TECHNOLOGY
SPARK

Full Spark training with hands on Lab

Track 1 (2 days) – Spark Core + Spark SQL

  • BigData overview
  • Spark Story & Community
  • Spark vs Hadoop
  • Spark Integrations
  • Spark Build
  • Spark Deployment
  • How it works
  • API overview
  • First Job (LAB)
  • RddAPI (LAB)
  • RDD vs DataFrame vs DataSet
  • DataFrame API (LAB)
  • Final project (LAB)
  • Tips & Tricks
  • Spark SQL
  • SparkSQL vs Hive vs Impala
  • SparkSQL API
  • SparkSQL Job ( API )
  • Spark Thrift Server + BI connection

Track 2 (2 days) – Spark Streaming + Spark ML

  • Spark Streaming
  • Spark Streaming vs Storm vs Flink
  • Spark Streaming integrations
  • First stream Job (LAB)
  • Lamda Architecture
  • Advanced Streaming (LAB)
  • Spark for Machine learning
  • ML vs MLLib
  • Algorithms
  • Clustering: K-Means (LAB)
  • Recommendation: ALS (LAB)
  • Model Server with Lambda Architecture
  • Tips & Tricks
  • Datascience & Production
  • Spark Notebook
HADOOP

Introduction to Hadoop – Track ( 2 days )

BIG DATA PLATFORMS

  • Overview
  • NoSQL benchmarking

HADOOP + CLOUDERA COMPONENTS

  • Hadoop vs RDBMS
  • Hadoop in Enterprise

DATA STORES

  • HDFS Advanced
  • HBase Design & DataModel
  • Solr

DATA INGESTION

  • Kafka
  • Sqoop
  • Flume

DATA ANALYSIS

  • Impala
  • Hive
  • Mapreduce Concepts & Development
  • Mapreduce Input&Output

ARCHITECTURE

  • Security Authentication
  • Security Authorisation
  • Hadoop Processes
CASSANDRA

A certified architect will bring you into Cassandra internals with lot of hands on excercises

Track 1 (2 days) – Cassandra Core

  • BigData and NoSQL overview
  • Installation and configuration (LAB)
  • Tools: nodetool, cqlsh, stress (LAB)
  • Replication and Consistency
  • Gossip
  • Data Model
  • CQL (LAB)
  • Write and Read Path (LAB)
  • Compaction and Tombstoning
  • Hardware best practices

Track 2 (1 day) – Operations

  • Environment
  • Adding nodes (LAB)
  • Remove, Decommission and Replace nodes (LAB)
  • Bootstrap and Cleanup
  • Hinted Handoff (LAB)
  • Repair (LAB)
  • Backup and Recovery
  • Security
  • DR and MultiDatacenter
  • JVM tuning
  • Disk tuning

Track 3 (1 day) – Data Model

  • Logical model
  • Conceptual model
  • Physical model
  • Data Types
  • How to validate model
  • Transactions
  • Client Side Joins
  • Best practices
  • Workshop (LAB)

Track 4 (1 day) – Datastax platform and integrations

  • Datastax overview
  • Solr Overview
  • Search fundamentals
  • Solr Queries (LAB)
  • Inverted Index and Document Scoring Datastax integration
  • CQL Extensions (LAB) Cassandra Spark Connector Read from Cassandra
  • Write into Cassandra
  • Group by, Join and Partitioning Dataframe
  • Lambda architecture

We are Also Lightbend certified trainers so we can deliver certified courses

  • Lightbend Reactive Architecture – Professional
  • Lightbend Akka Streams for Scala – Professional
  • Lightbend Scala Language – Professional (formerly Fast Track to Scala)
  • Lightbend Scala Language – Expert (formerly Advanced Scala)
  • Lightbend Akka for Scala – Professional (formerly Fast Track to Akka for Scala)
  • Lightbend Akka for Java – Professional (formerly Fast Track to Akka for Java)
  • Lightbend Akka for Scala – Expert (formerly Advanced Akka for Scala)
  • Lightbend Akka for Java – Expert (formerly Advanced Akka for Java)
  • Lightbend Apache Spark for Scala – Professional (formerly Spark Workshop)
  • Fast Track to Play with Scala