IDC 6145 Big Data for Data Science¶
- Course Number:
IDC 6145
- Credit hours:
3
- Prerequisites:
Basic programming knowledge, Statistics fundamentals
Course Description¶
This course provides a comprehensive introduction to Big Data concepts and technologies in the context of Data Science. Students will learn about distributed computing, parallel processing, and how to handle data at scale using modern Big Data frameworks.
Learning Objectives¶
After completing this course, students will be able to:
Understand core Big Data concepts and the Big Data lifecycle
Implement MapReduce algorithms using Hadoop
Develop data processing applications using Apache Spark
Apply distributed computing principles to solve real-world problems
Design and implement scalable data processing pipelines
Evaluate and optimize Big Data solutions