-
Chapter 0 Course Introduction
-
Chapter 1 Introduction & Overview
-
Chapter 2 Software Foundation
-
Chapter 3 Data Storage & Management
-
● 3.1 Distributed System
-
● 3.2 HDFS
-
● 3.3 NoSQL
-
● 3.4 HBase
-
● 3.5 Hive
-
● 章节习题
-
Chapter 4 Data Preprocessing
-
● 4.1 Data CLeaning
-
● 4.2 Data Integration & Transformation
-
● 4.3 Data Reduction & Masking
-
● 4.4 Data Preprocessing Practice
-
● Exp 3.1
-
● Exp 3.2
-
● 章节习题
-
Chapter 5 Data Analysis
-
● 5.1 Data Analysis
-
● 5.2 Supervised Leaning Algorithm1
-
● 5.3 Supervised Learning Algorithm2
-
● 5.4 Unsupervised Learning Algorithm
-
● 5.5 Model Evaluation
-
● 5.6 Artificial Neural Networks
-
● Exp4.1 EDA
-
● Exp4.2 ML
-
● 章节习题
-
Chapter 6
-
● 6.1 Data Visualization Methodology
-
● 6.2 Data Charts
-
● 6.3 Data Visualization Practice
-
● 6.4 Machine Learning Practice
-
Chapter 7 Processing Framework
-
● 7.1 MapReduce
-
● 7.2 Spark Basics
-
● 7.3 RDD Operations
-
● 7.4 Stream Processing1
-
● 7.5 Stream Processing2
-
● 章节习题
-
Chapter 8 Data Governance & Data Security and Privacy Protection