IDC 6145 Big Data for Data Science

Course Number:

IDC 6145

Credit hours:

3

Prerequisites:

Basic programming knowledge, Statistics fundamentals

Course Description

This course provides a comprehensive introduction to Big Data concepts and technologies in the context of Data Science. Students will learn about distributed computing, parallel processing, and how to handle data at scale using modern Big Data frameworks.

Learning Objectives

After completing this course, students will be able to:

  • Understand core Big Data concepts and the Big Data lifecycle

  • Implement MapReduce algorithms using Hadoop

  • Develop data processing applications using Apache Spark

  • Apply distributed computing principles to solve real-world problems

  • Design and implement scalable data processing pipelines

  • Evaluate and optimize Big Data solutions

Modules

Summary

Exam Outline

Assignment Instructions