Big Data Tools for Machine Learning
|Lecture hours per week||2|
|Lab hours per week||2|
In this course, students will be introduced to large scale learning: distributed learning. The concepts of distributed storage systems and parallel processing will be discussed. Storage types for big data (NoSQL) and big data tools (Hadoop eco system, YARN, Apache Spark, and Apache Mahout) will be explained and students will gain hands-on experience by applying the big data tools in real world applications.