Mathematical Notation

Published

26/06/2026

A glossary of the mathematical notation used throughout the course, in roughly the order it first appears. Use this as a lookup for any symbol whose meaning isn’t obvious from context.

Variables and indexing conventions

Symbol	Meaning
t	time (independent variable)
x, y, z	spatial coordinates
\mathbf{x} = (x, y, z)	spatial position vector
z = 0	sea surface (in the column model, Unit 8)
z = -H	column bottom (depth H below the surface)
\eta(x, y, t)	sea-surface elevation anomaly (Unit 1, Unit 6)
T(z, t)	temperature, column model (Unit 9)
u, v	horizontal velocity components
w	vertical velocity component
\mathbf{u} = (u, v, w)	full velocity vector
\mathbf{u}_h = (u, v)	horizontal velocity vector
\psi(t)	unknown boundary forcing (Unit 1’s river source)
\theta, \phi	network parameter vectors (PINN unknowns)

Indices: on the continuous side, i indexes collocation / sample points, j components, and k time steps. On the discrete grid the same letters are reused: i, j run over (x, y) and n over t (e.g. u_i^n for the value at grid cell i, time step n).

Derivatives

Notation	Meaning
\dot{x}, \ddot{x}	first and second time derivatives, \frac{dx}{dt} and \frac{d^2 x}{dt^2}
\dfrac{\partial f}{\partial x}	partial derivative of f with respect to x
\partial_x f, f_x	shorthand for \partial f / \partial x
\partial_t f, \partial_z f	likewise for t and z
\partial^2_x f, f_{xx}	second partial derivative \partial^2 f / \partial x^2
\nabla f	gradient: (\partial_x f, \partial_y f, \partial_z f)
\nabla\!\cdot\!\mathbf{u}	divergence: \partial_x u + \partial_y v + \partial_z w
\nabla \times \mathbf{u}	curl
\nabla^2 f	Laplacian: \partial^2_x f + \partial^2_y f + \partial^2_z f
\nabla_h = (\partial_x, \partial_y)	horizontal gradient
\nabla_h\!\cdot\!\mathbf{u}_h	horizontal divergence

Vector and tensor operations

Notation	Meaning
\mathbf{a}\!\cdot\!\mathbf{b}	dot product (scalar): \sum_i a_i b_i
\mathbf{a} \otimes \mathbf{b}	outer / tensor product
\\|\mathbf{a}\\|, \\|\mathbf{a}\\|_2	Euclidean norm: \sqrt{\mathbf{a}\!\cdot\!\mathbf{a}}
\\|f\\|_{L^2}	L^2 norm of a function: \sqrt{\int f^2}
\hat{\mathbf{n}}	outward unit normal to a boundary
\mathbf{u}\!\cdot\!\hat{\mathbf{n}} = 0	no-flux (impermeable) boundary condition
A^\top	matrix transpose
A^{-1}	matrix inverse
\det(A), \mathrm{tr}(A)	determinant, trace

Probability, statistics, loss functions

Notation	Meaning
\mathbb{E}[X]	expectation of random variable X
\mathrm{Var}(X), \sigma^2	variance
\mathcal{N}(\mu, \Sigma)	Gaussian with mean \mu, covariance \Sigma
\sim	“distributed as” (e.g. X \sim \mathcal{N}(0, 1))
\mathcal{L}(\theta)	loss function evaluated at parameters \theta
\hat R(f), R(f)	empirical risk, true risk (Unit 2 §2.1)
\ell(\hat y, y)	per-sample loss
\arg\min_\theta f(\theta)	the \theta that minimises f

Network architectures and PINN-specific symbols

Notation	Meaning
u_\theta, T_\theta, \eta_\theta	neural-network parameterisations of a field
\theta	parameter vector of an MLP / KAN
\sigma	activation function (sigmoid, \tanh, ReLU, Swish, …)
W_\ell, b_\ell	weights and biases of layer \ell
\mathcal{L}_{\text{PDE}}	PDE-residual loss term
\mathcal{L}_{\text{BC}}	boundary-condition loss term
\mathcal{L}_{\text{IC}}	initial-condition loss term
\mathcal{L}_{\text{data}}	data-misfit loss term
\lambda_r, \lambda_b, \lambda_i, \lambda_d	weights on the above
\{t_i\}_{i=1}^{N_r}, \{(\mathbf{x}_i, t_i)\}	collocation points
r_\theta(\mathbf{x}, t)	residual of the PDE evaluated at (\mathbf{x}, t)
\gamma(\mathbf{x}) = [\sin(B\mathbf{x}), \cos(B\mathbf{x})]	Fourier-feature embedding (Unit 7 §7.3)

Optimisation

Notation	Meaning
\eta (learning rate)	step size in gradient descent (not the surface elevation!)
\nabla_\theta \mathcal{L}, g_t	gradient of the loss with respect to parameters
m_t, v_t	first- and second-moment estimates in Adam
\beta_1, \beta_2	exponential-decay rates in Adam
H_t, B_t	true Hessian / quasi-Newton approximation in L-BFGS
\mu	momentum coefficient
Adam, L-BFGS, SGD	named optimisers (Unit 2 §2.6)

Symbol re-use across units

A handful of letters carry different meanings in different contexts; check the local section:

\eta is the learning rate in optimisation and the surface-elevation anomaly in shallow-water physics.
\kappa is eddy diffusivity in oceanography and a matrix condition number in numerical analysis.
\sigma is the sigmoid activation, the standard deviation of noise, and a Rayleigh-damping rate depending on context.
H is column depth (m) in the column model and the Hessian matrix in optimisation.
N is number of training samples, the grid size, and the buoyancy frequency depending on context.

PDE classification quick-reference

For a second-order linear PDE a\, u_{xx} + 2b\, u_{xy} + c\, u_{yy} + \ldots = 0:

Type	Condition b^2 - ac	Canonical example
Elliptic	< 0	Laplace: u_{xx} + u_{yy} = 0
Parabolic	= 0	Heat: u_t = \alpha\, u_{xx}
Hyperbolic	> 0	Wave: u_{tt} = c^2\, u_{xx}

(Full discussion in Unit 6 §6.1.)

Common dimensionless groups

Group	Definition	Interpretation
Reynolds Re	UL/\nu	inertia vs. viscosity
Péclet Pe	UL/\kappa	advection vs. diffusion
Richardson Ri	N^2/(\partial_z U)^2	buoyancy vs. shear
Courant–Friedrichs–Lewy CFL	c\,\Delta t/\Delta x	numerical wave speed
Strouhal St	fL/U	unsteadiness vs. advection