fbpx
big_data_analytics_logo

NICF – Big Data Analytics (SF)

This five-day instructor-led course provides participants with concepts beyond the Big Data knowledge to get a head start with Hadoop.

Share on facebook
Share on twitter
Share on linkedin

Overview

Course Reference Number: CRS-Q-0033760-ICT

This five-day instructor-led course provides participants with concepts beyond the Big Data knowledge to get a head start with Hadoop. This course will also teach about data analysis using Hadoop Ecosystem for data analysts, business intelligence specialists, developers and system architects.

Associated CertificatioN(s)

Upon completion of the course, participants can take the exam on Cloudera Certified Associate (CCA) Data Analyst, MapR: Certified Data Analyst (MCDA) or Hortonworks HDP Certified Developer (HDPCD): Pig and Hive. These certifications are great differentiators to establish yourself as a leader in the field, providing employers and customers with tangible evidence of your skills and expertise.

Prerequisites

Prior knowledge of SQL is highly recommended. Linux knowledge will be helpful.

Who Should Attend?

This course is intended for executives, managers, consultants, business analysts, operation personnel, programmers, architects, administrators and data analysts who want a foundational overview of the key components required to effectively understand and analyse Big Data. Familiarity working with computers and business applications is assumed. Programming experience is beneficial but not required.

Course Outline

  • Why we need Hadoop
  • Why Hadoop is in demand in market nowadays
  • Where expensive SQL based tools are failing
  • Key Points, Why Hadoop is leading tool in current IT Industry Definition of Big Data
  • Hadoop nodes
  • Introduction to Hadoop Release-1
  • Hadoop Daemons in Hadoop Release-1
  • Introduction to Hadoop Release-2
  • Hadoop Daemons in Hadoop Release-2
  • Hadoop Cluster and Racks
  • Hadoop Cluster Demo
  • New projects on Hadoop
  • How Open Source tools is capable to run jobs in lesser time Hadoop Storage – HDFS (Hadoop Distributed file system) Hadoop Processing Framework (Map Reduce / YARN) Alternates of Map Reduce
  • Why NOSQL is in much demand instead of SQL
  • Distributed warehouse for HDFS
  • Hadoop Ecosystem and its usages
  • Data import/Export tools
  • Hadoop installation
  • Introduction to Hadoop FS and Processing Environment’s UIs How to read and write files
  • Basic Unix commands for Hadoop
  • Hadoop FS shell
  • Hadoop releases practical
  • Hadoop daemons practical
  • Pig-UDFs
  • Pig Use cases
  • Pig Assignment
  • Complex Use cases on Pig
  • Real time scenarios on Pig
  • When we should use Pig
  • When we shouldn’t use Pig
  • Hive Introduction
  • Meta storage and meta store
  • Introduction to Derby Database
  • Hive Data types
  • HQL
  • DDL, DML and sub languages of Hive
  • Internal, external and Temp tables in Hive
  • Differentiation between SQL based Datawarehouse and Hive
  • Hive releases
  • Why Hive is not best solution for OLTP OLAP in Hive
  • Partitioning
  • Bucketing
  • Hive Architecture
  • Thrift Server
  • Hue Interface for Hive
  • How to analyze data using Hive script Differentiation between Hive and Impala UDFs in Hive
  • Complex Use cases in Hive
  • Hive Advanced Assignment
  • How to load data streaming data without fixed schema
  • How to load unstructured and semi structured data in Hadoop Introduction to Flume
  • Hands-on on Flume
  • How to load Twitter data in HDFS using Hadoop
  • Introduction to Oozie
  • How to schedule jobs using Oozie
  • What kind of jobs can be scheduled using Oozie
  • How to schedule jobs which are time based
  • Hadoop releases From where to get
  • Hadoop and other components to install
  • Introduction to YARN
  • Significance of YARN
  • Introduction to NOSQL
  • Why NOSQL if SQL is in market since several years
  • Databases in market based on NOSQL CAP Theorem
  • ACID Vs. CAP
  • OLTP Solutions with different capabilities
  • Which Nosql based solution is capable to handle specific requirements Examples of companies that uses NOSQL based databases
  • HBase Architecture of column families
  • Introduction to Spark
  • Basics Features of SPARK and Scala available in Hue Why SPARK demand is increasing in market
  • How can we use Spark with Hadoop Eco System Datasets for practice purpose
  • YARN
  • Emerging Technologies of Big Data
  • Emerging use cases e.g. IoT, Industrial Internet, New Applications
  • Certifications and
  • Job Opportunities

Course fee:

Usual Price: S$3500
Promo Price: S$2450 (excluding GST)

 w/GST
Full Course Fees$3,745.00
Promo Fees$2,621.50
SG Citizen/PR aged ≥ 21 years$2,029.00
SG Citizen aged ≥ 40 years$646.50

 w/GST
Full Course Fees$3,745.00
Promo Fees$2,621.50
SG Citizen/PR aged ≥ 21 years$646.50
SG Citizen aged ≥ 40 years$646.50

 w/GST
Full Course Fees$3,745.00
Promo Fees$2,621.50
SG Citizen/PR aged ≥ 21 years$2,029.00
SG Citizen aged ≥ 40 years$646.50

Get Brochure and Pricing

More Like This

Get the course Brochure & Pricing

Our course consultant will contact you within 1 working day

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

Stay-Home-Series-main-header-v2

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

Open chat