Delivered by Simplilearn in collaboration with IBM, this Big Data Engineer Bootcamp program is designed to help build comprehensive knowledge and job ready skills in the world of big data. Over 7 courses, you will build an in-depth understanding of working with big data engineering tools such as Data Model Creation, Database Interfaces, Advanced Architecture, Scala, Spark SQL, Flume and more.
The course will equip you with big data model data, perform ingestion, replicate data, and share data using a NoSQL database management system MongoDB and hands-on experience working with data integration tools such as Kafka Connect.
By the end of the Big Data Engineer certification course, you will receive the certificates from IBM (for IBM Data Engineering courses) and Simplilearn for each individual courses in the learning path. These certificates is a demonstration of your skills as an expert in Big Data Engineering after the completion of training, helping you gain a competitive edge in your portfolio.
- Course 1 - Big Data for Data Engineering
This Big Data for Data Engineering delivered by IBM will equip you with the in-demand techniques of Big Data, including data ingestion and processing through Apache Hadoop framework. By the end of this Data Engineering course, you will gain insights on how to improve business productivity by processing large volumes of data and extracting valuable information from them.
-Lesson 1: Welcome
-Lesson 2: What is Big Data
-Lesson 3: Beyond the Hype
-Lesson 4: Big Data and Data Science
-Lesson 6: Processing Big Data
-Lesson 7: Course Summary
Free course: Data Engineering with Hadoop
-Lesson 1: Learning Objectives
-Lesson 2: Introduction to Hadoop
-Lesson 3: Hadoop Architecture and HDFS
-Lesson 4: Hadoop administration
-Lesson 5: Hadoop Components
Free course: Data Engineering with Scala
-Lesson 1: Learning Objectives
-Lesson 2: Introduction
-Lesson 3: Basic Object Oriented Programming
-Lesson 4: Case Objects and Classes
-Lesson 5: Collections
-Lesson 6: Idiomatic Scala
Course delivery format: online learning only
- Course 2 - Big Data Hadoop and Spark Developer
Master the skills of Big Data Hadoop framework and implement Big Data tools and methodologies in this course delivered by Simplilearn. Learn how to work with Spark applications, parallel processing and functional programming in real world projects. At the end of this course, you will acehive a Big Data Hadoop certification that will bolster your job ready skills and help you stand out from the crowd.
-Lesson 1: Course Introduction
-Lesson 2: Introduction to Big Data and Hadoop
-Lesson 3: Hadoop Architecture,Distributed Storage (HDFS) and YARN
-Lesson 4: Data Ingestion into Big Data Systems and ETL
-Lesson 5: Distributed Processing - MapReduce Framework and Pig
-Lesson 6: Apache Hive
-Lesson 7: NoSQL Databases - HBase
-Lesson 8: Basics of Functional Programming and Scala
-Lesson 9: Apache Spark Next Generation Big Data Framework
-Lesson 10: Spark Core Processing RDD
-Lesson 11: Spark SQL - Processing Data Frames
-Lesson 12: Spark MLLib - Modelling Big Data with Spark
-Lesson 13: Stream Processing Frameworks and Spark Streaming
-Lesson 14: Spark GraphX
-Practice Projects
Free Course: Core Java
-Lesson 1: Introduction to Java 11 and OOPs Concepts
-Lesson 2: Utility Packages and Inheritance
-Lesson 3: Multithreading Concepts
-Lesson 4: Debugging Concepts
-Lesson 5: JUnit
-Lesson 6: Java Cryptographic Extensions
-Lesson 7: Design Pattern
Free Course: Linux Training
-Lesson 1: Course Introduction
-Lesson 2: Introduction to Linux
-Lesson 3: Ubuntu
-Lesson 4: Ubuntu Dashboard
-Lesson 5: File System Organization
-Lesson 6: Introduction to CLI01
-Lesson 7: Editing Text Files and Search Patterns
-Lesson 8: Package Management
-Practice Project
Course delivery format: online learning + live virtual classes
- Course 3 - PySpark Training Course
Prepare to take your Python coding to the next level as you master PySpark, the powerful development framework for Big Data. We'll teach you the data science tools and techniques required to become a successful PySpark developer.
-Lesson 1: A Brief Primer on PySpark
-Lesson 2: Resilient Distributed Datasets
-Lesson 3: Resilient Distributed Datasets and Actions
-Lesson 4: DataFrames and Transformations
-Lesson 5: Data Processing with Spark DataFrames
Free Course: Python for Data Science
Course delivery format: online learning only
- Course 5 - MongoDB Developer and Administrator
MongoDB is a reliable and scalable NoSQL database that helps businesses to handle massive data in order to achieve business priorities. This online certification course will help you learn how to use MongoDB as a document-oriented database, including various concepts of MongoDB, basic query language and other advanced features. This MongoDB course delivered by Simplilearn is integrated with industry projects, lab exercises and various demonstrations to help explain key concepts, helping you build your skillset in Big Data.
-Course Introduction
-Lesson 1: Introduction to NoSQL databases
-Lesson 2: MongoDB A Database for the Modern Web
-Lesson 3: CRUD Operations in MongoDB
-Lesson 4: Indexing and Aggregation
-Lesson 5: Replication and Sharding
-Lesson 6: Developing Java and Node JS Application with MongoDB
-Lesson 7: Administration of MongoDB Cluster Operations
Course delivery format: online learning + live virtual classes
- Course 6 - AWS Data Analytics Certification Training
This AWS Data Analytics course will set you up to pass the industry-recognised AWS Certified Data Analytics Specialty exam. Developed by industry experts, this AWS course will help you gain a strong understanding of various AWS concepts including: AWS QuickSight, AWS lambda and Glue, S3 and DynamoDB, Redshift, Hive on EMR and more.
Section 1 - Self-paced Curriculum
-Lesson 1: Introduction
-Lesson 2: Domain 01 - Collection
-Lesson 3: Domain 02 - Storage
-Lesson 4: Domain 03 - Processing
-Lesson 5: Domain 04 - Analysis
-Lesson 6: Domain 05 - Visualization
-Lesson 7: Domain 06 - Security
-Lesson 8: Everything Else
-Lesson 9: Preparing for the Exam
-Lesson 10: Appendix - Machine Learning Topics for the Amazon Web Services AWS Certified Big Data Exam
-Lesson 11: Wrapping Up
Section 2 - Live Virtual Class Curriculum
-Lesson 1 - Course Introduction
-Lesson 2 - AWS in Big Data Introduction
-Lesson 3 - Collection
-Lesson 4 - Storage and Data Management
-Lesson 5 - Processing - I
-Lesson 6 - Processing - II
-Lesson 7 - ETL with Redshift
-Lesson 8 - Analysis with Machine Learning
-Lesson 9 - Analysis and Visualization
-Lesson 10 - Security
-Practice Project
- Free Course - AWS Technical Essentials
-Lesson 1: Introduction to Cloud Computing
-Lesson 2: First Steps into Amazon Web Services
-Lesson 3: Identity and Access Management (IAM)
-Lesson 4: Networking in AWS - Virtual Private Clouds
-Lesson 5: Elastic Compute Cloud (EC2)
-Lesson 6: AWS Storage
-Lesson 7: Load Balancing and Autoscaling
-Lesson 8: DNS and Content Delivery Networks
-Lesson 9: Monitoring, Auditing and Alerts
-Lesson 10: Databases
-Lesson 11: Serverless Computing
-Lesson 12: Security and Compliance
-Lesson 13: AWS Pricing, Billing, and Support Services
-Lesson 14: Conclusion
-Practice Project
Course delivery format: online learning + live virtual classes
- Course 7 - Big Data Capstone
This project will give you the opportunity to apply your skills from the previous weeks in a real-world project. With expert led mentor sessions, you'll be expected to solve a real-industry aligned problem and build skills that are employable in the world of IT. By the end of this capstone project, you'll have developed an employable skill set and an opportunity to showcase your talent to your employers.
-Lesson 1: Data Engineer Capstone
Course delivery format: online learning + live virtual classes
Completion Certificate
You will gain individual certificates after completing each course
Optional electives
Optional electives are available as part of this Big Data Engineer Bootcamp Program.
These are not mandatory to complete, but are available as additional courses to study if you are interested in
expanding your knowledge and further implementing your skills.
- Elective 1: AWS Technical Essentials
- Elective 2: Java Certification Training
- Elective 3: Industry Master Class - Data Engineering