This course introduces the fundamental concepts and computational paradigms of scalable data management systems. The focus of this course is on the systems-oriented aspects and internals of such systems for storing, updating, querying, and analyzing large datasets. 

Topics include: 

Database Architectures 
Parallel and Distributed Databases 
Data Warehousing 
MapReduce and Hadoop 
Spark and its Ecosystem 
Optional: NoSQL Databases, Stream Processing, Graph Databases, Scalable Machine Learning