Categories
Important question news Notes question bank Question Paper R-2021 Syllabus

CCS334 Big Data Analytics [PDF]

Anna University – CCS334 Big Data Analytics Regulation 2021 Syllabus , Notes Book , Important Questions, Question Paper with Answers Previous Year Question Paper.

UNIT I UNDERSTANDING BIG DATA  CCS334 Big Data Analytics
Introduction to big data – convergence of key trends – unstructured data – industry examples of big
data – web analytics – big data applications– big data technologies – introduction to Hadoop – open
source technologies – cloud and big data – mobile business intelligence – Crowd sourcing analytics
– inter and trans firewall analytics.

UNIT II NOSQL DATA MANAGEMENT CCS334 Big Data Analytics
Introduction to NoSQL – aggregate data models – key-value and document data models –
relationships – graph databases – schemaless databases – materialized views – distribution models
– master-slave replication – consistency – Cassandra – Cassandra data model – Cassandra
examples – Cassandra clients

UNIT III MAP REDUCE APPLICATIONS
MapReduce workflows – unit tests with MRUnit – test data and local tests – anatomy of MapReduce
job run – classic Map-reduce – YARN – failures in classic Map-reduce and YARN – job scheduling
– shuffle and sort – task execution – MapReduce types – input formats – output formats.

UNIT IV BASICS OF HADOOP CCS334 Big Data Analytics
Data format – analyzing data with Hadoop – scaling out – Hadoop streaming – Hadoop pipes –
design of Hadoop distributed file system (HDFS) – HDFS concepts – Java interface – data flow –
Hadoop I/O – data integrity – compression – serialization – Avro – file-based data structures –
Cassandra – Hadoop integration.

UNIT V HADOOP RELATED TOOLS CCS334 Big Data Analytics
Hbase – data model and implementations – Hbase clients – Hbase examples – praxis.
Pig – Grunt – pig data model – Pig Latin – developing and testing Pig Latin scripts.
Hive – data types and file formats – HiveQL data definition – HiveQL data manipulation – HiveQL
queries.

Syllabus Click Here
Notes Click Here
Important Questions Click Here
Previous Year Question Paper Click Here
Question Bank Click Here

TEXT BOOKS: CCS334 Big Data Analytics notes
1. Michael Minelli, Michelle Chambers, and AmbigaDhiraj, “Big Data, Big Analytics:
Emerging Business Intelligence and Analytic Trends for Today’s Businesses”, Wiley,
2013.
2. Eric Sammer, “Hadoop Operations”, O’Reilley, 2012.
3. Sadalage, Pramod J. “NoSQL distilled”, 2013

REFERENCES: CCS334 Big Data Analytics important questions
1. E. Capriolo, D. Wampler, and J. Rutherglen, “Programming Hive”, O’Reilley, 2012.
2. Lars George, “HBase: The Definitive Guide”, O’Reilley, 2011.
3. Eben Hewitt, “Cassandra: The Definitive Guide”, O’Reilley, 2010.
4. Alan Gates, “Programming Pig”, O’Reilley, 2011

 

Leave a Reply

Your email address will not be published. Required fields are marked *