(sorry, but check out next vodqa)
Keynote & Introduction
What is Big Data
How big is big and why does big matter and what does Hadoop have to do with it?
Play with large data sets and see Hadoop’s method of distributed storage: Hadoop Distributed File System (HDFS).
Introduction to Functional Programming
Scala, short for scalable language, is future ready, but are you? Move beyond your regular java, and check out what is brewing in this language.
Like Hobbes is to Calvin, Hadoop is to Big Data. Come on this journey to play with large data sets and see Hadoop’s method of distributed processing with:
1. MapReduce and Yarn: String together your understanding of Yet Another Resource Negotiator (YARN) by gaining exposure to MapReduce, the tool-sets that start the processing of Big Data.
2. Moving Data into Hadoop: Open the door to move data into Hadoop to get the program working for you. This course’s emphasis on Sqoop.
3. Accessing Hadoop Data using Hive: This topic sets the stage for how to query and analyze data using Apache Hive.
4. Workflow Management: This topic covers workflow scheduler system to manage Apache Hadoop jobs using Oozie
Introduction to Apache Spark along with hands on workshop for testing Spark based applications using ScalaTest
Ignite your interest in Spark with an introduction to the core concepts that make this general processor an essential tool set for working with Big Data. Hands on workshop for testing travel ecommerce website logs using ScalaTest.
Tea and Networking
ThoughtWorks Technologies (India) Pvt Ltd 6th Floor, Tower B, Bldg.14 DLF Cyber City Phase III Sector 24 & 25 Gurgaon-122002 (Haryana) « Open in Google Maps »