BIG DATA

Training+project+Hands On on Big Data Technologies from industry's top experts

19990

Overview of BIG DATA

Data Sciences and Big Data, is fast becoming pervasive in all industries and receiving direct attention of CEOs worldwide. With shortage of skilled professionals in this area, Big Data skills are viewed as hot property by managers and recruiters. The 3 day workshop being organized and delivered by BITSKART is an opportunity for interested engineers and professionals to get hands on with important components of the Big Data landscape. The Workshop is designed & delivered by a team of engineers and scientists who have built award winning products. The workshop is targeted towards both students & professional engineers to get familiar with Big Data Technologies

Table of contents

TOPICS

TIME IN HOURS

Introduction of FileSystems

Distributed file system

Meet Hadoop.

Comparison With Other Systems

   RDBMS

   GRID COMPUTING

   VOLUNTEER COMPUTING

2

HDFS Overview and Design.

HDFS COncepts

   BLOCK

   NameNOde

   DataNode

   HDFS Federation

   HIgh Availability

HDFS CLI

JAVA Interface

Copying With Distcp

2

Hadoop I/O

  Integrity

  Serialization

  Compression

  AVRO

1

Hadoop Single node setup (hands on)

Hadoop Cluster Setup. (hands on)

5

Understanding MapReduce

Map-Reduce Job Run Anatomy.

MR Scheduling

MR Failures

Mapper

Reducer

Combiner

Partitioner

MR Formats

 Input/Output

MR Features

 Counters

 Sorting

 Joins

4

 


MapReduce(Writing Mapreduce Using JAVA API). (hands on)

6

PIG:

DataModel

PigLatin(I/O,Relational operators,UDF)

Advanced Pig LAtin

Embedding Piglatin in Python

Eval and Filter Functions

4

PIG HANDS ON:

2


 

HIVE:

Data Types and Formats

HQL(DDL)

HQL(DML)

HQL Indexes

Partitioning

Hive UDF

Thrift Service

Security

Tuning

Integration With Oozie

4

HIVE Hands ON

3

NO SQL:

Problems With RdBMS

CAP Theorem

ACID Vs BASE

1

HBASE:

Architecture

   seek Vs Transfer

   Write Ahead Log

Tables,Rows,Columns,Cells

AutoSharding

CRUD

Regions

Region lifeCycle

Replication

Performance Tuning

3

HBASE HANDS ON

3

Cassandra:

DataMOdel

KeySpaces

ColumnFamily

Super C.F

Architecture:

   P2P

   Gossip And Failure Detection

   Anti Entropy and Read REpair

   Hinted Handoff

Read And Write

3

CASSANDRA HANDS ON:

2

 

DATA SETS TO WORK WITH :

Twitter data Set

NYSE data Set

All India Consumer Price Index Numbers

Current Daily Price of Various Commodities from Various Markets

 

Reviews of BIG DATA

Rating - 5.0
5 stars 0
4 stars 0
3 stars 0
2 stars 0
1 stars 0

Have you used this product?

Rate Now:
OR
WRITE A REVIEW
Loading
Refer Your Friends
How to refer?

1 Click link below to share on facebook

OR

2 Enter your friends email address and invite them to join cakart (comma seperated)

Buy India's Best CA, CS, CMA Faculty Video Classes only at CAKART