Training

Help you on your learning journey of Big Data Technology

YAVA Data Engineering Course

Overview

This training is designed to provide the essential knowledge and skills needed to support data-driven decision making by collecting, transforming, and visualizing data with Hadoop cluster and its ecosystem. The course covers designs, builds, maintains, and troubleshoots data processing systems.

Duration

5 days

Format

50% Lecture/Discussion
50% Hands-on Labs

Prerequisites

Participant should be familiar with programming principles and have previous experience in software development. Previous experience with data processing and SQL is also helpful, but not required.

Target Audience

This course for :

  • Developer
  • ETL engineer
  • Data analyst

Course Objectives

  • Yava Data Management Platform Overview
  • Moving Data into Hadoop
  • Working with HDFS
  • Batch Processing using MapReduce and HGrid247
  • Real Time Processing using Apache Storm and HGrid247
  • Data Exploration with Hive
Apache Spark with R Course

Overview

This training is specifically designed to provide the knowledge and skills needed to become a data analyst. This course covers the concept of data preparation, data manipulation, data exploration and data visualization.

Duration

3 days

Format

50% Lecture/Discussion
50% Hands-on Labs

Prerequisites

Participant should be familiar with programming principles and have previous experience in data analytics and data processing. Previous experience with SQL, R and statistic is also helpful.

Target Audience

This course for :

  • Developer
  • ETL engineer
  • Data analyst

Course Objectives

  • Introducing Apache Spark including: Spark SQL, and Sparklyr / SparkR
  • Data preparation: load data (HDFS/HIVE), transforms, filter, handle missing value, join tables
  • Data exploration: view statistics, aggregation, view data distribution, data binning, view trend
  • Data visualisation
Apache Spark with Scala Course

Overview

This training is specifically designed to provide the knowledge and skills needed to become a data analyst. This course covers the concept of data preparation, data manipulation, data exploration and data visualization.

Duration

3 days

Format

50% Lecture/Discussion
50% Hands-on Labs

Prerequisites

Participant should be familiar with programming principles and have previous experience in data analytics and data processing. Previous experience with SQL, Scala and statistic is also helpful.

Target Audience

This course for :

  • Developer
  • ETL engineer
  • Data analyst

Course Objectives

  • Introducing Apache Spark including: Spark SQL, and Spark DataFrames
  • Data preparation: load data (HDFS/HIVE), transforms, filter, handle missing value, join tables
  • Data exploration: view statistics, aggregation, view data distribution, data binning, view trend
  • Data visualisation
Apache Spark with Python Course

Overview

This training is specifically designed to provide the knowledge and skills needed to become a data analyst. This course covers the concept of data preparation, data manipulation, data exploration and data visualization.

Duration

3 days

Format

50% Lecture/Discussion
50% Hands-on Labs

Prerequisites

Participant should be familiar with programming principles and have previous experience in data analytics and data processing. Previous experience with SQL, Python and statistic is also helpful.

Target Audience

This course for :

  • Developer
  • ETL engineer
  • Data analyst

Course Objectives

  • Introducing Apache Spark including: Spark SQL, and Spark Python API
  • Data preparation: load data (HDFS/HIVE), transforms, filter, handle missing value, join tables
  • Data exploration: view statistics, aggregation, view data distribution, data binning, view trend
  • Data visualisation
YAVA Cluster Administration Course

Overview

This training gives participants the expertise in all the steps necessary to operate and maintain Hadoop clusters, from planning, installation, configuration, monitoring to troubleshooting. This training provides hands-on preparation for the real-world challenges faced by Hadoop Administrators.

Duration

5 days

Format

50% Lecture/Discussion
50% Hands-on Labs

Prerequisites

Participant should have basic linux knowledge. Previous experience in administering linux system is also helpful, but not required.

Target Audience

This course for :

  • Unix/Linux administrator
  • Database administrator
  • System administrator
  • People who want to develop careers become administrator
    hadoop

Course Objectives

  • YAVA Data Management Platform Overview
  • Planning, Installation and Configuring Cluster
  • Identity, Authentication and Authorization
  • Resource Management
  • Cluster Maintenance & Monitoring
  • Troubleshooting
Apache Hawq

Overview

This training is specifically designed to provide the knowledge and skills needed to become a data analyst. This course covers the concept of data preparation, data manipulation, data exploration and machine learning.

Duration

3 days

Format

40% Lecture/Discussion
60% Hands-on Labs

Prerequisites

Participant should be familiar with programming principles and have previous experience in data analytics and data processing. Previous experience with SQL, Python and statistic is also helpful.

Target Audience

This course for :

  • SQL Developer
  • ETL engineer
  • Data analyst

Course Objectives

  • Introducing Apache HAWQ including: Hawq system overview and architecture
  • Data preparation using SQL: load data (HDFS/Hawq), transforms, filter, handle missing value, join tables
  • Using Procedural Language
  • Query Plans and Optimization
  • Machine Learning with MADLib
YAVA - BIG DATA SOLUTION WITHIN YOUR REACH

© 2017 Labs247. All rights reserved.
YAVA logo and HGrid247 logo are registered trademarks or trademarks of the Labs247 Company.
HADOOP, the Hadoop Elephant Logo, Apache, Flume, Ambari, Yarn, Bigtop, Phoenix, Hive, Tez, Oozie, HBase, Mahout, Pig, Solr, Storm, Spark, Sqoop, Impala, and ZooKeeper are registered trademarks or trademarks of the Apache Software Foundation.