Mathematical Foundations for Data Analysis

This book was developed from two courses. Lecture material is provided here from recent versions.

This course (UofU Fall 2020) covers most of Chapters 1,2,3,5,6,7,8, and 9 and is taught to mainly college sophomores and juniors. The Lecture Video Playlist is on Youtube, and was taught over Zoom. The following has links to slides written on iPad during the lecture, and sometimes Colab python code.

- Lecture 1 : Class Overview w/ Video
- Lecture 2 : Probability Review : Sample Space, Random Variables, Independence (Chap 1-1.2) w/ Video and Code
- Lecture 3 : Probability Review : PDFs, CDFs, Expectation, Variance, Joint and Marginal Distributions (Chap 1.3-1.6) w/ Video
- Lecture 4 : Bayesâ€™ Rule : Maximum Likelihood (Chap 1.7) w/ Video
- Lecture 5 : Bayesian Inference (Chap 1.8) w/ Video
- Lecture 6 : Convergence : Central Limit Theorem and Estimation (Chap 2.1-2.2) w/ Video and Code
- Lecture 7 : Convergence : PAC Algorithms and Concentration of Measure (Chap 2.3) w/ Video
- Lecture 8 : Linear Algebra Review : Vectors, Matrices, Multiplication and Scaling (Chap 3.1-3.2) w/ Video
- Lecture 9 : Linear Algebra Review : Norms, Linear Independence, Rank (Chap 3.3-3.5) w/ Video and Code
- Lecture 10 : Linear Algebra Review : Inverse, Orthogonality (Chap 3.6-3.8) w/ Video
- Lecture 11 : Linear Regression : explanatory & dependent variables (Chap 5.1) w/ Video and Code
- Lecture 12 : Linear Regression : multiple regression, polynomial regression (Chap 5.2-5.3) w/ Video and Code for multi-regression and for poly regression
- Lecture 13 : Linear Regression : overfitting and cross-validation (Chap 5.4) w/ Video and Code
- Lecture 14 was a review
- Lecture 15 : Gradient Descent : functions, minimum, maximum, convexity & gradients (Chap 6.1-6.2) w/ Video
- Lecture 16 : Gradient Descent : algorithmic & convergence (Chap 6.3) w/ Video and Code
- Lecture 17 : Gradient Descent : fitting models to data and stochastic gradient descent (Chap 6.4) w/ Video
- Lecture 18 : Dimensionality Reduction : SVD (Chap 7.1-7.2) w/ Video
- Lecture 19 : Dimensionality Reduction : rank-k approximation and eigenvalues (Chap 7.2-7.3) w/ Video and Code
- Lecture 20 : Dimensionality Reduction : power method (Chap 7.4) w/ Video and Code
- Lecture 21 : Dimensionality Reduction : PCA, centering, and MDS (Chap 7.5-7.6) w/ Video and Code for centering and for MDS
- Lecture 22 : Clustering : Voronoi Diagrams + Assignment-based Clustering (Chap 8.1) w/ Video and Code
- Lecture 23 : Clustering : k-means (Chap 8.3) w/ Video
- Lecture 24 : Clustering : EM, Mixture of Gaussians, Mean-Shift (Chap 8.4,8.7) w/ Video
- Lecture 25 : Classification : Linear prediction (Chap 9.1) w/ Video
- Lecture 26 : Classification : Perceptron Algorithm (Chap 9.2) w/ Video
- Lecture 27 : Classification : Kernels and SVMs (Chap 9.3) w/ Video
- Lecture 28 : Classification : KNN, Decision Trees, Neural Nets (Chap 9.5,9.6,9.7) w/ Video

This course (mostly from UofU Spring 2020) covers most of Chapters 4, 5, 7, 8, 10, and 11 and is taught to college seniors and first year graduate students. It mostly follows this Lecture Playlist on YouTube, partially taught on Zoom. The following links to slides written on iPad during the lecture.

- Lecture 1 : Class Overview w/ Video
- Lecture 2 : Statistics Principles (Chap 2.2-2.3) w/ Video
- Lecture 3 : Similarity : Jaccard + k-Grams (Chap 4.3-4.4) w/ Video
- Lecture 4 : Similarity : Min Hashing (Chap 4.6.6) w/ Video
- Lecture 5 : Similarity : LSH (Chap 4.6) w/ Video
- Lecture 6 : Similarity : Distances (Chap 4-4.3)w/ Video
- Lecture 7 : Similarity : Word Embeddings + ANN vs. LSH (Chap 4.4) w/ Video
- Lecture 8 : Clustering : Hierarchical (Chap 8.5, 8.2) w/ Video
- Lecture 9 : Clustering : K-Means (Chap 8-8.3) w/ Video
- Lecture 10 : Clustering : Spectral (Chap 10.3) w/ Video
- Lecture 11 : Streaming : Model and Misra-Greis (Chap 11.1 - 11.2) w/ Video
- Lecture 12 : Streaming : Count-Min Sketch, Count Sketch (Chap 11.2) w/ Video
- Lecture 13 : Regression : Basics in 2-dimensions (Chap 5-5.3) w/ Video
- Lecture 14 : Regression : Lasso + MP + Compressed Sensing (Chap 5.5) w/ Video
- Lecture 15 : Regression : Cross-Validation and p-values (Chap 5.4) w/ Video
- Lecture 16 : Dim Reduce : SVD + PCA (Chap 7 -7.3, 7.5) w/ Video
- Lecture 17 : Dim Reduce : Matrix Sketching (Chap 11.3) w/ Video
- Lecture 18 : Dim Reduce : Metric Learning (Chap 7.6-7.8) w/ Video
- Lecture 19 : Dim Reduce : Random Projection + Noise in Data (Chap 7.10, 8.6) w/ Video
- Lecture 20 was merged into above lectures, compressed due to COVID
- Lecture 21 was a detour on privacy and related ethical concerns
- Lecture 22 : Graph Analysis : Markov Chains (Chap 10.1) w/ Video
- Lecture 23 : Graph Analysis : PageRank (Chap 10.2) w/ Video
- Lecture 24 : Graph Analysis : Communities (Chap 10.4) w/ Video