Get in Touch

Course Outline

  1. Fundamentals of Big Data
    • Big Data and its role in the corporate world
    • The phases of developing a Big Data strategy within a corporation
    • The rationale behind a holistic approach to Big Data
    • Essential components of a Big Data Platform
    • Big Data storage solutions
    • Limits of traditional technologies
    • Overview of database types
    • The four dimensions of Big Data
  2. The Impact of Big Data on Business
    • The business significance of Big Data
    • Challenges associated with extracting useful data
    • Integrating Big Data with traditional data sources
  3. Big Data Storage Technologies
    • Overview of Big Data technologies
      • Data storage models
      • Hadoop
      • Hive
      • Cassandra
      • MongoDB
    • Selecting the appropriate Big Data technology
  4. Processing Big Data
    • Connecting to and extracting data from databases
    • Transforming and preparing data for processing
    • Utilizing Hadoop MapReduce for distributed data processing
    • Monitoring and executing Hadoop MapReduce jobs
    • Building blocks of the Hadoop Distributed File System
    • MapReduce and YARN
    • Handling streaming data with Spark
  5. Big Data Analysis Tools and Technologies
    • Programming Hadoop with Pig Latin
    • Querying Big Data with Hive
    • Data mining with Mahout
    • Visualization and reporting tools
  6. Implementing Big Data in Business
    • Managing and establishing Big Data requirements
    • Business importance of Big Data
    • Choosing the right Big Data tools for specific problems

Data Warehousing Concepts

  • What is a Data Warehouse?
  • Differences between OLTP and Data Warehousing
  • Data Acquisition
  • Data Extraction
  • Data Transformation
  • Data Loading
  • Data Marts
  • Dependent vs. Independent Data Marts
  • Database design

ETL Testing Concepts:

  • Introduction
  • Software Development Life Cycle
  • Testing methodologies
  • ETL Testing Workflow Process
  • ETL Testing Responsibilities in Data Stage

Big Data Fundamentals

  • Big Data and its role in the corporate world
  • The phases of developing a Big Data strategy within a corporation
  • The rationale behind a holistic approach to Big Data
  • Essential components of a Big Data Platform
  • Big Data storage solutions
  • Limits of traditional technologies
  • Overview of database types

NoSQL Databases

Hadoop

MapReduce

Apache Spark

Requirements

Participants should possess a basic understanding of storage tools and have some experience handling large datasets.

 14 Hours

Number of participants


Price per participant

Testimonials (1)

Upcoming Courses

Related Categories