CP612
Data Management and Analysis
0.5 Credit

This course provides students with a broad overview of the concepts, techniques and tools of modern data management and analysis. It compares traditional relational databases with an alternative model (a NoSQL database), and explains how to choose the most appropriate means of storing and managing data, depending on the size and structure of a particular dataset and its intended use. Students are guided through the basics of using Hadoop with MapReduce and Spark. In data analysis, both theoretical concepts and algorithms in Association Mining, Classification, Clustering, etc., and a range of advanced applications, including Link Analysis, Recommender Systems, Computational Advertising, etc., are introduced.