Tensor Calculus Part 1: From Vectors to Tensors – Multilinear Algebra
Introduction to tensors, their importance in ML, Einstein summation convention, and fundamental tensor algebraic operations like outer product and contraction.
A crash course on tensor calculus, focusing on definitions, notation, and operations essential for understanding advanced machine learning and optimization techniques in high-dimensional spaces.
Welcome to the Tensor Calculus Crash Course!
This mini-series is designed for learners who have a grounding in Linear Algebra, Multivariable Calculus, and the basics of Functional Analysis (as outlined in the preface and prerequisites section of the main “Mathematical Optimization Theory in ML” series). Tensors are fundamental mathematical objects for describing geometric and physical quantities, and they play an increasingly crucial role in modern machine learning. They provide a powerful framework for handling high-dimensional data, understanding complex transformations, and analyzing the geometric properties of model parameter spaces and loss landscapes.
In machine learning and optimization, we often encounter quantities like gradients of matrix-valued functions, Hessians in many dimensions, and concepts related to the curvature of optimization surfaces. Tensor calculus provides the natural language and tools to work with these objects rigorously and intuitively.
This crash course aims to:
Our focus will be on building intuition and understanding the practical relevance of these concepts for subsequent topics in the main “Mathematical Optimization Theory in ML” series, particularly for modules on Differential Geometry and Information Geometry.
This crash course is structured into the following parts:
We encourage you to work through the examples and think about how these concepts might apply to problems you’ve encountered in machine learning. Let’s begin!
Introduction to tensors, their importance in ML, Einstein summation convention, and fundamental tensor algebraic operations like outer product and contraction.
Understanding how tensor components transform under coordinate changes (covariance and contravariance, derived from basis transformations), and the fundament...
Introduction to tensor differentiation (covariant derivative), Christoffel symbols, and the role of tensors in characterizing ML concepts like gradients, Hes...
A concise summary of key definitions, notations, operations, transformation laws, and differentiation rules in Tensor Calculus, primarily drawing from the cr...