Complex (Cognitive) Systems - Autonomous University of the State of Morelos

work in progress...

This marvelous course by Prof. Markus Müller revealed like a hidden treasure in the most unexpected of places: a social-sciences and humanities-oriented postgrad program, in random underfunded university (the cognitive science masters degree from the Autonomous State University of Morelos, Mexico). The aim of self-publishing notes and assignments, and assembling them into a coherent whole, goes beyond giving everyone brave enough¹ a shot to enjoy it. It also sits as a personal reminder of my interest at the intersection of mathematical modeling, biophysics and the mind.

the brain meets the operational definition of what a complex² system is:

hierarchical organisation, scale-free structure, many-to-many relations among components, interconnectedness (recurrence)
non-linear behaviour, emergent properties (hard to determine from the observation of components alone, which number in the millions)
chaotic to initial conditions (slight differences lead to widely different states)
non-stationary measurements (dynamical system whose model constants/parameters actually vary; system doesn't stay in an attractor)
phase transitions, equilibrium depends on critical states
noisy measurements, plus some rather stochastic processes (spiking), whose measurable fluctuations may be difficult to distinguish from chaotic determinism

• dynamical systems basics

we'll distinguish between two kinds of systems:

stochastic
dynamical (deterministic). e.g. the harmonic oscillator:

• harmonic oscillator

consider the mechanics of a mass attached to a spring (no other forces involved, e.g. friction, gravity). let's say this is a very simple brain model (or of one of its components thereof) whose activity behaves like an oscillating variable:

(Image "Figure 29") — Figure 29 - source

from Newton's second law and Hooke's law:

$F = m a = - k x$ (TeX formula: F = ma = -kx )

therefore³

$m \frac{d^{2} x}{d t^{2}} = - k x$ $(TeX formula: m \frac{d^2x}{dt^2} = -kx )$

$\ddot{x (t)} = - \frac{k}{m} x$ $(TeX formula: \ddot{x(t)} = - \frac{k}{m}x )$

$\ddot{x (t)} = - ω_{0}^{2} x$ $(TeX formula: \ddot{x(t)} = - ω_0^2 x )$

in order to solve the second-order linear ordinary homogeneous differential equation, we partition it into a system of two first-order equations. let $v = \dot{x}$ $(TeX formula: v = \dot{x})$ , therefore:

${\begin{matrix} \dot{v} = - ω_{0}^{2} x \\ \dot{x} = v \end{matrix}$ $(TeX formula: \left\{ \begin{array}{cc} \dot{v} = -ω_0^2 x \\ \dot{x} = v \\ \end{array} \right. )$

${\begin{matrix} \dot{v} = - ω_{0}^{2} \int v \\ \dot{x} = \int - ω_{0}^{2} x = - ω_{0}^{2} \int x \end{matrix}$ $(TeX formula: \left\{ \begin{array}{cc} \dot{v} = -ω_0^2 ∫v \\ \dot{x} = ∫-ω_0^2 x = -ω_0^2∫x \\ \end{array} \right. )$

notice that with the last substitutions both equations are the same, except in terms of different variables. both sin(ω₀t) and cos(ω₀t) satisfy it, because:

$\frac{d}{d t} s i n (ω_{0} t) = ω_{0} c o s (ω_{0} t) = - ω_{0}^{2} \int s i n (ω_{0} t) d t$ $(TeX formula: \frac{d}{dt} sin(ω_0 t) = ω_0 cos(ω_0 t) = -ω_0^2 ∫sin(ω_0t)dt )$

and $\frac{d}{d t} c o s (ω_{0} t) = - ω_{0} s i n (ω_{0} t) = - ω_{0}^{2} \int c o s (ω_{0} t) d t$ $(TeX formula: \frac{d}{dt} cos(ω_0 t) = -ω_0 sin(ω_0 t) = -ω_0^2 ∫cos(ω_0t)dt )$

more generally, solutions will conform to the linear superposition principle:

$x (t) = a s e n (ω_{0} t) + b c o s (ω_{0} t); a, b \in ℝ$ $(TeX formula: x(t) = a sen(ω_0t) + b cos(ω_0t); \; a,b ∈ ℝ)$

any possible function of position on time will look sinusoidal, of varying frequency (depending on $\sqrt{k / m}$ $(TeX formula: \sqrt{k/m})$ ), amplitude and phase (a and b together). we rewrite it as:

$x (t) = \sqrt{a^{2} + b^{2}} [\frac{a}{\sqrt{a^{2} + b^{2}}} s e n (ω_{0} t) + \frac{b}{\sqrt{a^{2} + b^{2}}} c o s (ω_{0} t)] = A [c_{1} s e n (ω_{0} t) + c_{2} s e n (ω_{0} t)]$ $(TeX formula: x(t) = \sqrt{a^2 + b^2} \left[ \frac{a}{\sqrt{a^2 + b^2}} sen(ω_0t) + \frac{b}{\sqrt{a^2 + b^2}} cos(ω_0t) \right] = A[c_1 sen(ω_0t) + c_2 sen(ω_0t)] )$

this has the property that $- 1 < = c_{1}, c_{2} < = 1$ (TeX formula: -1 <= c_1, c_2<= 1) , and $c_{1}^{2} + c_{2}^{2} = 1$ (TeX formula: c_1^2 + c_2^2 = 1) .

let $φ_{0}$ (TeX formula: φ_0) be the initial phase. then $\exists φ_{0} (\frac{a}{\sqrt{a^{2} + b^{2}}} = c o s (φ_{0}) \land \frac{b}{\sqrt{a^{2} + b^{2}}} = c o s (φ_{0}))$ $(TeX formula: ∃ φ_0 \; \left( \frac{a}{\sqrt{a^2 + b^2}} = cos(φ_0) ∧ \frac{b}{\sqrt{a^2 + b^2}} = cos(φ_0)\right))$

so:

$x (t) = A [c o s (φ_{0}) s e n (ω_{0} t) + s e n (φ_{0}) s e n (ω_{0} t)] =$ (TeX formula: x(t) = A[cos(φ_0) sen(ω_0t) + sen(φ_0) sen(ω_0t)] = )

$A [s e n (ω_{0} t + φ_{0})]$ (TeX formula: A[sen(ω_0t + φ_0)] )

graphically:

(Image "Figure 22") — Figure 22 - simple harmonic oscillating movement and its parameters (source)

• initial conditions

finally, let's say the spring is totally relaxed at t=0, so $x (0) = A = A [s e n (φ_{0})]$ (TeX formula: x(0) = A = A[sen(φ_0)]) . this implies that $φ_{0} = π / 2$ (TeX formula: φ_0 = π/2) :

$x (t) = A [s e n (ω_{0} t + π / 2)]$ (TeX formula: x(t) = A[sen(ω_0t + π/2)] )

• complex polar notation

prove that $z = e^{i φ}$ $(TeX formula: z = e^{iφ})$ is the unit circle.

following Euler's formula, $e^{i φ} = c o s (φ) + i s i n (φ) \in (ℝ \to ℂ)$ $(TeX formula: e^{iφ} = cos(φ) + isin(φ) ∈ (ℝ→ℂ))$ . therefore $|\vec{z}| = \sqrt{e^{iφ}e^{-iφ}} = \sqrt{e^{iφ-iφ}} = \sqrt{e^0} = 1.$ $(TeX formula: |\vec{z}| = \sqrt{e^{iφ}e^{-iφ}} = \sqrt{e^{iφ-iφ}} = \sqrt{e^0} = 1.)$ ⁴

(Image "Figure 23") — Figure 23 - simple harmonic oscillating movement (complex polar notation) and its parameters (source)

solutions of the form $A e^{λ t}$ $(TeX formula: Ae^{λt})$ include the aforementioned trigonometric solutions when λ, A ∈ ℂ. so the #original differential equation can be rewritten as:

$λ^{2} A e^{λ t} + ω_{0}^{2} A e^{λ t} = 0 = (λ^{2} + ω_{0}^{2}) A e^{λ t}$ $(TeX formula: λ^2 A e^{λt} + ω_0^2 Ae^{λt} = 0 = (λ^2 + ω_0^2)Ae^{λt} )$

$⊢ λ^{2} + ω_{0}^{2} = 0$ (TeX formula: ⊢ λ^2 + ω_0^2 = 0 )

$⊢ λ = \sqrt{- ω_{0}^{2}} = \pm i ω_{0}$ $(TeX formula: ⊢ λ = \sqrt{-ω_0^2} = ±iω_0 )$

the general solution becomes:

$x (t) = A e^{i ω_{0} t} + B e^{- i ω_{0} t}$ $(TeX formula: x(t) = A e^{iω_0t} + B e^{-iω_0t} )$

if we restrict x(t) to the type ℝ→ℝ (position is real), it follows that x(t) equals its complex conjugate.

moreover, $\overline{x(t)} = \overline{A e^{iω_0t} + B e^{-iω_0t}} = \overline{A e^{iω_0t}} + \overline{B e^{-iω_0t}} = \overline{A} e^{-iω_0t} + \overline{B} e^{iω_0t}$ $(TeX formula: \overline{x(t)} = \overline{A e^{iω_0t} + B e^{-iω_0t}} = \overline{A e^{iω_0t}} + \overline{B e^{-iω_0t}} = \overline{A} e^{-iω_0t} + \overline{B} e^{iω_0t})$ . this implies that $A = \bar{B}$ $(TeX formula: A = \overline{B})$ and $B = \bar{A}$ $(TeX formula: B = \overline{A})$ . so:

$x (t) = (a + i b) [c o s (ω_{0} t) + i s i n (ω_{0} t)] + (a - i b) [c o s (ω_{0} t) - i s i n (ω_{0} t)]$ $(TeX formula: x(t) = (a+ib)\left[cos(ω_0t)+isin(ω_0t)\right] + (a-ib)\left[cos(ω_0t)-isin(ω_0t)\right] )$

$= 2 a c o s (ω t) - 2 b s i n (ω t) \in (ℝ \to ℝ)$ (TeX formula: = 2a cos(ωt) - 2bsin(ωt) ∈ (ℝ→ℝ) )

compare this expression against the one using trigonometric functions.

• energy loss

(also see #dissipative systems and attractors)

(Image "Figure 28") — Figure 28 - source

now we will damp the oscillator by adding a term for friction, whose force is proportional to velocity (but in opposite direction):

$F_{T} = \sum F$ (TeX formula: F_T = ∑F )

$\ddot{x (t)} = - ω_{0}^{2} x (t) - 2 β \dot{x (t)}$ $(TeX formula: \ddot{x(t)} = - ω_0^2 x(t) - 2β\dot{x(t)} )$

assume a family of solutions of the form $x (t) = A e^{λ t}$ $(TeX formula: x(t) = Ae^{λt})$ . therefore:

$λ^{2} A e^{λ t} + 2 β λ A e^{λ t} + ω_{0}^{2} A e^{λ t} = 0$ $(TeX formula: λ^2 Ae^{λt} + 2βλAe^{λt} + ω_0^2 Ae^{λt} = 0 )$

$A e^{λ t} (λ^{2} + 2 β λ + ω_{0}^{2}) = 0$ $(TeX formula: Ae^{λt} (λ^2 + 2βλ + ω_0^2) = 0 )$

$λ^{2} + 2 β λ = - ω_{0}^{2}$ (TeX formula: λ^2 + 2βλ = -ω_0^2 )

$λ^{2} + 2 β λ + β^{2} = - ω_{0}^{2} + β^{2} = {(λ + β)}^{2}$ (TeX formula: λ^2 + 2βλ + β^2 = -ω_0^2 + β^2 = (λ+β)^2 )

$λ = - β \pm i \sqrt{ω_{0}^{2} - β^{2}}$ $(TeX formula: λ = -β ± i\sqrt{ω_0^2 - β^2} )$

plug that back into the general solution:

$x (t) = A e^{(- β + i \sqrt{ω_{0}^{2} - β^{2}}) t} + B e^{(- β - i \sqrt{ω_{0}^{2} - β^{2}}) t}$ $(TeX formula: x(t) = Ae^{(-β+i\sqrt{ω_0^2 - β^2})t} + Be^{(-β-i\sqrt{ω_0^2 - β^2})t} )$

$= A e^{- β t} e^{i \sqrt{ω_{0}^{2} - β^{2}} t} + B e^{- β t} e^{- i \sqrt{ω_{0}^{2} - β^{2}} t}$ $(TeX formula: = Ae^{-βt}e^{i\sqrt{ω_0^2 - β^2}t} + Be^{-βt}e^{-i\sqrt{ω_0^2 - β^2}t} )$

$= e^{- β t} (A e^{i \sqrt{ω_{0}^{2} - β^{2}} t} + B e^{- i \sqrt{ω_{0}^{2} - β^{2}} t})$ $(TeX formula: = e^{-βt} \left( Ae^{i\sqrt{ω_0^2 - β^2}t} + Be^{-i\sqrt{ω_0^2 - β^2}t} \right) )$

asymptotically, the system will inevitably evolve to the most entropic dynamic state. let's show this on a case-by-case basis, taking note on the different (non)monotonic dynamics:

• underdamped: ω₀ > β

let $ω = \sqrt{ω_{0}^{2} - β^{2}}$ $(TeX formula: ω = \sqrt{ω_0^2 - β^2})$ :

$x (t) = e^{- β t} (A e^{i ω t} + B e^{- i ω t})$ $(TeX formula: x(t) = e^{-βt} \left( Ae^{iωt} + Be^{-iωt} \right) )$

by restricting x(t) to ℝ→ℝ, and from our previous knowledge of the simple harmonic oscillator solution:

$\bar{x (t)} = e^{- β t} D s i n (ω t + φ_{0})$ $(TeX formula: \overline{x(t)} = e^{-βt} Dsin(ωt+φ_0) )$

(Image "Figure 24") — Figure 24 - underdamped harmonic oscillating movement

• ω₀ = β

$λ_{1} = - β \pm i \sqrt{0} = - β = λ_{2}$ $(TeX formula: λ_1 = -β ± i\sqrt{0} = -β = λ_2 )$

we leverage the fact that amplitude isn't constant anymore:

$x (t) = A (t) e^{- β t}$ $(TeX formula: x(t) = A(t)e^{-βt} )$

$\dot{x} (t) = \dot{A} (t) e^{- β t} - A (t) β e^{- β t}$ $(TeX formula: \dot{x}(t) = \dot{A}(t)e^{-βt} - A(t)βe^{-βt} )$

$\ddot{x} (t) = \ddot{A} (t) e^{- β t} - \dot{A} (t) β e^{- β t} - \dot{A} (t) β e^{- β t} + A (t) β^{2} e^{- β t}$ $(TeX formula: \ddot{x}(t) = \ddot{A}(t)e^{-βt} - \dot{A}(t)βe^{-βt} - \dot{A}(t)βe^{-βt} + A(t)β^2e^{-βt})$

and the damped harmonic oscillator equation becomes:

$e^{- β t} [\ddot{A} - 2 \dot{A} β + A β^{2} + 2 β \dot{A} - 2 β^{2} A + β^{2} A] = 0$ $(TeX formula: e^{-βt}\left[ \ddot{A}-2\dot{A}β+Aβ^2+2β\dot{A}-2β^2A+β^2A \right] = 0 )$

$e^{- β t} [\ddot{A}] = 0$ $(TeX formula: e^{-βt}\left[ \ddot{A} \right] = 0 )$

$⊢ \ddot{A} (t) = 0$ $(TeX formula: ⊢ \ddot{A}(t) = 0 )$

$⊢ \ddot{A} (t) = 0 \leftrightarrow A (t) = a_{1} + a_{2} t$ $(TeX formula: ⊢ \ddot{A}(t) = 0 ↔ A(t) = a_1 + a_2t )$

i.e., amplitude is a linear function. so the solution must be of the form:

$x (t) = a_{1} e^{- β t} + a_{2} t e^{- β t}$ $(TeX formula: x(t) = a_1 e^{-βt} + a_2 te^{-βt} )$

graphically, the spring oscillates only once:

(Image "Figure 25") — Figure 25 - damped harmonic oscillating movement, single overshoot.

• overdamped: ω₀ < β

$λ = - β \pm i \sqrt{ω_{0}^{2} - β^{2}} = - β \pm i^{2} \sqrt{β^{2} - ω_{0}^{2}} \in ℝ$ $(TeX formula: λ = -β ± i\sqrt{ω_0^2 - β^2} = -β ± i^2\sqrt{β^2 - ω_0^2} ∈ ℝ )$

$x (t) = A e^{- β t} [A e^{ω t} + B e^{- ω t}]$ $(TeX formula: x(t) = Ae^{-βt} \left[ Ae^{ωt} + Be^{-ωt} \right] )$

because $β > ω$ (TeX formula: β>ω) , $A e^{ω t}$ $(TeX formula: Ae^{ωt})$ is overshadowed by $B e^{- ω t}$ $(TeX formula: Be^{-ωt})$ . the plot never gets to complete an oscillation:

(Image "Figure 26") — Figure 26 - overdamped harmonic oscillating movement

• energy input

the damped model will get augmented with the influence of external stimuli, driving our cognitive-unit model to activity. this is known as a driven harmonic oscillator. let's go with an engine rotating at frequency $\tilde{ω}$ $(TeX formula: \tilde{ω})$ and amplitude F, which in turn pushes and pulls the rest of the system:

$\ddot{x (t)} + 2 β \dot{x (t)} + ω_{0}^{2} x (t) = F c o s (\tilde{ω} t)$ $(TeX formula: \ddot{x(t)} + 2β\dot{x(t)} + ω_0^2 x(t) = Fcos(\tilde{ω}t) )$

the equation isn't homogeneous anymore. the solution is the sum of the general and particular solutions: the homogeneous one is of the type $e^{- β t}$ $(TeX formula: e^{-βt})$ and eventually dissipates. the particular one oscillates forever:

$x_{p} (t) = A e^{i \tilde{ω} t}$ $(TeX formula: x_p(t) = Ae^{i\tilde{ω}t} )$

• resonance

(Image "Figure 30") — Figure 30 - source

starting from an exploration of the steady state ("final conditions"), we will show that amplitude is a function of the engine's frequency, and that there are privileged frequencies for which steady-state amplitude forms maxima or singularities: resonance frequencies.

$\dot{x_{p} (t)} = i \tilde{ω} A e^{i \tilde{ω} t}$ $(TeX formula: \dot{x_p(t)} = i\tilde{ω}Ae^{i\tilde{ω}t} )$

$\ddot{x_{p} (t)} = - {\tilde{ω}}^{2} A e^{i \tilde{ω} t}$ $(TeX formula: \ddot{x_p(t)} = -\tilde{ω}^2 Ae^{i\tilde{ω}t} )$

so we rewrite the full differential equation:

$[- {\tilde{ω}}^{2} A + 2 β i A \tilde{ω} + ω_{0}^{2} A] e^{i \tilde{ω} t} = F e^{i \tilde{ω} t}$ $(TeX formula: \left[ -\tilde{ω}^2A + 2βiA\tilde{ω} + ω_0^2A \right] e^{i\tilde{ω}t} = Fe^{i\tilde{ω}t} )$

$[ω_{0}^{2} - {\tilde{ω}}^{2} + 2 β i \tilde{ω}] A = F$ $(TeX formula: \left[ ω_0^2 - \tilde{ω}^2 + 2βi\tilde{ω} \right] A = F )$

$A (\tilde{ω}) = \frac{F}{[ω_{0}^{2} - {\tilde{ω}}^{2} + i 2 β \tilde{ω}]}$ $(TeX formula: A(\tilde{ω}) = \frac{F}{\left[ ω_0^2 - \tilde{ω}^2 + i2β\tilde{ω} \right]} )$

$= \frac{F}{[(ω_{0}^{2} - {\tilde{ω}}^{2}) + i 2 β \tilde{ω}]} \frac{[(ω_{0}^{2} - {\tilde{ω}}^{2}) - i 2 β \tilde{ω}]}{[(ω_{0}^{2} - {\tilde{ω}}^{2}) - i 2 β \tilde{ω}]}$ $(TeX formula: = \frac{F}{\left[ (ω_0^2 - \tilde{ω}^2) + i2β\tilde{ω} \right]} \frac{\left[ (ω_0^2 - \tilde{ω}^2) - i2β\tilde{ω} \right]}{\left[ (ω_0^2 - \tilde{ω}^2) - i2β\tilde{ω} \right]} )$

$= \frac{F [(ω_{0}^{2} - {\tilde{ω}}^{2}) - 2 β i \tilde{ω}]}{[{(ω_{0}^{2} - {\tilde{ω}}^{2})}^{2} + {(2 β \tilde{ω})}^{2}]}$ $(TeX formula: = \frac{F \left[ (ω_0^2 - \tilde{ω}^2) - 2βi\tilde{ω} \right]}{\left[ (ω_0^2 - \tilde{ω}^2)^2 + (2β\tilde{ω})^2 \right]} )$

(...magic happens here...)

$| A (\tilde{ω}) | = \frac{F}{\sqrt{{(ω_{0}^{2} - {\tilde{ω}}^{2})}^{2} + {(2 β \tilde{ω})}^{2}}}$ $(TeX formula: |A(\tilde{ω})| = \frac{F}{\sqrt{(ω_0^2 - \tilde{ω}^2)^2 + (2β\tilde{ω})^2}} )$

(Image "Figure 27") — Figure 27 - steady-state amplitude as a function of energy-source frequency

• phase space

phase space contains all information of a dynamical system, with a different axis for each variable of interest. each point in it represents a state of the whole system, and a trajectory in this virtual space corresponds to the evolution of the system.

signal: disregarding noise, taking a series of measurements is equivalent to observing variation in a single dimension of phase space. graphically, this is a projection of the system upon some vector in phase space.

(Image "Figure 31") — Figure 31 - Simple harmonic oscillator (in real space) and associated quantities (source)

(Image "Figure 5") — Figure 5 - Black dot: state trajectory in a phase space with two variables (position and momentum, for instance). Red and blue: measurements of such variables, or of two mutually independent linear combinations of them thereof. (source)

• Fourier analysis

preliminaries:

some properties of all inner products (AKA dot product):

they map the vector space to its own field: ℝⁿ·ℝⁿ → ℝ.
they follow bilinear form (linear on both arguments): B(𝒖+𝒗, 𝒘) = B(𝒖, 𝒘) + B(𝒗, 𝒘) and B(k𝒖, 𝒗) = kB(𝒖, 𝒗).

the basic idea is to create a linear combination of basis functions which span a whole space of functions. the coefficients of the superposition will come from measuring the strength of each component, with a mere projection of the function to be represented upon that component.

now consider the vector space of square-integrable functions (so that convergence is guaranteed for the product / projection):

$M = {f (x) : [a, b] \to ℝ | \int_{a}^{b} | f (x) |^{2} d x = c < \infty}$ $(TeX formula: M = \{ f(x): [a, b] → ℝ \; | \; ∫_a^b |f(x)|^2 dx = c < ∞ \} )$

• linear independence of Fourier terms

we will show that complex oscillators are an adequate basis.

let $v_{n} (x) = \frac{1}{\sqrt{2 a}} e^{i \frac{n π}{a} x}$ $(TeX formula: v_n(x) = \frac{1}{\sqrt{2a}} e^{i \frac{nπ}{a}x})$ with $n = 0, \pm 1, \pm 2, . . .$ be functions on the interval $[- a, a]$ $(TeX formula: \left[−a, a\right])$ . prove that $v_{n} \cdot v_{m} = δ_{n m}$ $(TeX formula: v_n · v_m = δ_{nm})$ , given the proper definition of the dot product. $δ_{n m} = {\begin{matrix} 1 & \Leftrightarrow n = m \\ 0 & \Leftrightarrow n \neq m \end{matrix}$ $(TeX formula: δ_{nm} = \left\{ \begin{array}{cc} 1 & ⇔ n=m \\ 0 & ⇔ n≠m \\ \end{array} \right.)$ is Dirac's delta distribution.

let $V_{n} \cdot V_{m} = \int_{- a}^{a} d x V_{n}^{⋆} (x) V_{m} (x)$ $(TeX formula: V_n·V_m = ∫_{-a}^{a} dx\; V_n^⋆(x) V_m(x))$ be the scalar product for functions of the vector basis defined by $V_{n} (x)$ (TeX formula: V_n(x)) .

proof by modus ponens follows:

restatement of conditional premise

given the above definitions; if $V_{n} \cdot V_{m} = δ_{n m}$ $(TeX formula: V_n·V_m = δ_{nm})$ , then $V_{n} (x)$ (TeX formula: V_n(x)) is an orthogonal basis. that is:

$\forall n \forall m ((V_{n} \cdot V_{n} = 1) \land (V_{n} \cdot V_{m} = 0; n \neq m) \Rightarrow V_{n} (x) is orthogonal)$ $(TeX formula: ∀n∀m \; ((V_n·V_n = 1) ∧ (V_n·V_m = 0; n≠m) ⇒ V_n(x) \text{ is orthogonal}) )$

proof of the antecedent premise

indeed: $\forall n \forall m ((V_{n} \cdot V_{n} = 1) \land (V_{n} \cdot V_{m} = 0))$ $(TeX formula: ∀n∀m \; ((V_n·V_n = 1) ∧ (V_n·V_m = 0)))$

because:

$⊢ (\int_{- a}^{a} d x V_{n}^{⋆} (x) V_{n} (x) = 1) \land (\int_{- a}^{a} d x V_{n}^{⋆} (x) V_{m} (x) = 0)$ $(TeX formula: ⊢ \left( ∫_{-a}^{a} dx\; V_n^⋆(x) V_n(x) = 1 \right) ∧ \left( ∫_{-a}^{a} dx\; V_n^⋆(x) V_m(x) = 0 \right) )$

$⊢ (\int_{- a}^{a} d x \frac{1}{\sqrt{2 a}} e^{i n π x / a} \frac{1}{\sqrt{2 a}} e^{- i n π x / a} = 1) \land (\int_{- a}^{a} d x \frac{1}{\sqrt{2 a}} e^{i n π x / a} \frac{1}{\sqrt{2 a}} e^{- i m π x / a} = 0)$ $(TeX formula: ⊢ \left( ∫_{-a}^{a} dx\; \frac{1}{\sqrt{2a}}e^{inπx/a} \frac{1}{\sqrt{2a}}e^{-inπx/a} = 1 \right) ∧ \left( ∫_{-a}^{a} dx\; \frac{1}{\sqrt{2a}}e^{inπx/a} \frac{1}{\sqrt{2a}}e^{-imπx/a} = 0 \right) )$

$⊢ (\frac{1}{2 a} \int_{- a}^{a} d x e^{i n π x / a} e^{- i n π x / a} = 1) \land (\frac{1}{2 a} \int_{- a}^{a} d x e^{i n π x / a} e^{- i m π x / a} = 0)$ $(TeX formula: ⊢ \left( \frac{1}{2a} ∫_{-a}^{a} dx\; e^{inπx/a} e^{-inπx/a} = 1 \right) ∧ \left( \frac{1}{2a} ∫_{-a}^{a} dx\; e^{inπx/a} e^{-imπx/a} = 0 \right) )$

$⊢ (\frac{1}{2 a} \int_{- a}^{a} d x e^{(i n π x / a) - (i n π x / a)} = 1) \land (\frac{1}{2 a} \int_{- a}^{a} d x e^{(i n π x / a) - (i m π x / a)} = 0)$ $(TeX formula: ⊢ \left( \frac{1}{2a} ∫_{-a}^{a} dx\; e^{(inπx/a)-(inπx/a)} = 1 \right) ∧ \left( \frac{1}{2a} ∫_{-a}^{a} dx\; e^{(inπx/a)-(imπx/a)} = 0 \right) )$

$⊢ (\frac{1}{2 a} \int_{- a}^{a} d x e^{0} = 1) \land (\frac{1}{2 a} \int_{- a}^{a} d x e^{i π x (n - m) / a} = 0)$ $(TeX formula: ⊢ \left( \frac{1}{2a} ∫_{-a}^{a} dx\; e^0 = 1 \right) ∧ \left( \frac{1}{2a} ∫_{-a}^{a} dx\; e^{iπx(n-m)/a} = 0 \right) )$

from Euler's formula:

$⊢ (\frac{1}{2 a} \int_{- a}^{a} d x = 1) \land (\frac{1}{2 a} \int_{- a}^{a} d x (c o s (\frac{π x (n - m)}{a}) + i s e n (\frac{π x (n - m)}{a})) = 0)$ $(TeX formula: ⊢ \left( \frac{1}{2a} ∫_{-a}^{a} dx = 1 \right) ∧ \left( \frac{1}{2a} ∫_{-a}^{a} dx\; (cos(\frac{πx(n-m)}{a}) + i\; sen(\frac{πx(n-m)}{a})) = 0 \right) )$

$⊢ ({\frac{x}{2 a} |}_{- a}^{a} = 1) \land (\frac{1}{2 a} \int_{- a}^{a} d x (c o s (\frac{π x (n - m)}{a}) + i s e n (\frac{π x (n - m)}{a})) = 0)$ $(TeX formula: ⊢ \left( \left. \frac{x}{2a} \right|_{-a}^{a} = 1 \right) ∧ \left( \frac{1}{2a} ∫_{-a}^{a} dx\; (cos(\frac{πx(n-m)}{a}) + i\; sen(\frac{πx(n-m)}{a})) = 0 \right) )$

from the integration of the odd function $s e n (\frac{π x (n - m)}{a})$ $(TeX formula: sen(\frac{πx(n-m)}{a}))$ over a zero-centered interval, $[- a, a]$ (TeX formula: [-a, a]) :

$⊢ (\frac{a}{2 a} - \frac{- a}{2 a} = 1) \land (\frac{1}{2 a} \int_{- a}^{a} d x (c o s (\frac{π x (n - m)}{a}) + 0) = 0)$ $(TeX formula: ⊢ \left( \frac{a}{2a} - \frac{-a}{2a} = 1 \right) ∧ \left( \frac{1}{2a} ∫_{-a}^{a} dx\; (cos(\frac{πx(n-m)}{a}) + 0) = 0 \right) )$

$⊢ (1 = 1) \land (\frac{1}{2 a} \int_{- a}^{a} d x c o s (\frac{π x (n - m)}{a}) = 0)$ $(TeX formula: ⊢ \left( 1 = 1 \right) ∧ \left( \frac{1}{2a} ∫_{-a}^{a} dx\; cos(\frac{πx(n-m)}{a}) = 0 \right) )$

let $u = \frac{π x (n - m)}{a}$ $(TeX formula: u = \frac{πx(n-m)}{a})$ ; then $\frac{d u}{d x} = \frac{π (m - n)}{a}$ $(TeX formula: \frac{du}{dx} = \frac{π(m-n)}{a})$ .

$⊢ ⊤ \land (\frac{1}{2 a} \int_{- a}^{a} c o s (u) \frac{π (n - m)}{a} d x \frac{a}{π (n - m)} = 0)$ $(TeX formula: ⊢ ⊤ ∧ \left( \frac{1}{2a} ∫_{-a}^{a} cos(u)\frac{π(n-m)}{a} \; dx\; \frac{a}{π(n-m)} = 0 \right) )$

$⊢ ⊤ \land (\frac{a}{2 a π (n - m)} \int_{- a}^{a} c o s (u) d u = 0)$ $(TeX formula: ⊢ ⊤ ∧ \left( \frac{a}{2aπ(n-m)} ∫_{-a}^{a} cos(u) \; du = 0 \right) )$

$⊢ ⊤ \land (\frac{a}{2 a π (n - m)} {[s e n (u)]}_{- a}^{a} = 0)$ $(TeX formula: ⊢ ⊤ ∧ \left( \frac{a}{2aπ(n-m)} \left[ sen(u) \right]_{-a}^{a} = 0 \right) )$

$⊢ ⊤ \land (\frac{a}{2 a π (n - m)} [s e n (\frac{π (a) (n - m)}{a}) - s e n (\frac{π (- a) (n - m)}{a})] = 0)$ $(TeX formula: ⊢ ⊤ ∧ \left( \frac{a}{2aπ(n-m)} \left[ sen(\frac{π(a)(n-m)}{a}) - sen(\frac{π(-a)(n-m)}{a}) \right] = 0 \right) )$

note that $(n - m) \in ℤ$ (TeX formula: (n-m) ∈ ℤ) . $⊢ \forall n \forall m (s e n (π (n - m)) = 0)$ $(TeX formula: ⊢ ∀n∀m \; \left( sen(π(n-m)) = 0 \right))$

$⊢ ⊤ \land (\frac{a}{2 a π (n - m)} [0 - 0] = 0)$ $(TeX formula: ⊢ ⊤ ∧ \left( \frac{a}{2aπ(n-m)} \left[0 - 0 \right] = 0 \right) )$

$⊢ ⊤ \land ⊤$ (TeX formula: ⊢ ⊤ ∧ ⊤ )

conclusion

$⊢ V_{n} (x)$ (TeX formula: ⊢ V_n(x)) is an orthogonal basis.

• Fourier transform

the Fourier series represents a periodic function as a discrete linear combination of sines and cosines . on the other hand, the Fourier transform is capable of representing a more general set of functions, not restricted to being periodic. how is that achieved?

(for convenience, we will start from the series in complex exponential notation)

$f (x) - f_{0} = \sum_{n = 1}^{\infty} [a_{n} c o s (\frac{n π x}{a}) + b_{n} s e n (\frac{n π x}{a})]$ $(TeX formula: f(x) -f_0 = ∑_{n=1}^∞ \left[ a_n cos\left( \frac{nπx}{a} \right) + b_n sen\left( \frac{nπx}{a} \right) \right] )$

$= \sum_{- \infty}^{\infty} \underset{α_{n}}{\underset{⏟}{(\frac{1}{2 a} \int_{- a}^{a} f (x) e^{- i n π x / a} d x)}} e^{i n π x / a}$ $(TeX formula: = ∑_{-∞}^∞ \underbrace{ \left( \frac{1}{2a} ∫_{-a}^{a} f(x)e^{-inπx/a} \;dx \right) }_{α_n} e^{inπx/a} )$

if $f (x)$ (TeX formula: f(x)) weren't periodic, the series would be representing it only at $[- a, a]$ (TeX formula: [-a, a]) . nonetheless, we can make the interval arbitrarily large:

$= lim_{a \to \infty} \sum_{- \infty}^{\infty} \underset{α_{n}}{\underset{⏟}{(\frac{1}{2 a} \int_{- a}^{a} f (x) e^{- i n π x / a} d x)}} e^{i n π x / a}$ $(TeX formula: = \lim_{a → ∞} ∑_{-∞}^∞ \underbrace{ \left( \frac{1}{2a} ∫_{-a}^{a} f(x)e^{-inπx/a} \;dx \right) }_{α_n} e^{inπx/a} )$

let $k_{n} = \frac{n π}{a}$ $(TeX formula: k_n = \frac{nπ}{a})$ , then $Δ k = \frac{π}{a}$ $(TeX formula: Δk = \frac{π}{a})$ . substituting in the last expression yields:

$= lim_{a \to \infty} \sum_{- \infty}^{\infty} Δ k {(Δ k)}^{- 1} (\frac{1}{2 a}) (\int_{- a}^{a} f (x) e^{- i k_{n} x} d x) (e^{i k_{n} x})$ $(TeX formula: = \lim_{a → ∞} ∑_{-∞}^∞ Δk \left( Δk \right)^{-1} \left( \frac{1}{2a} \right) \left( ∫_{-a}^{a} f(x)e^{-ik_nx} \;dx \right) \left( e^{ik_nx} \right) )$

$= lim_{a \to \infty} \sum_{- \infty}^{\infty} (\frac{π}{a}) (\frac{a}{2 a π}) (\int_{- a}^{a} f (x) e^{- i k_{n} x} d x) (e^{i k_{n} x})$ $(TeX formula: = \lim_{a → ∞} ∑_{-∞}^∞ \left( \frac{π}{a} \right) \left( \frac{a}{2aπ} \right) \left( ∫_{-a}^{a} f(x)e^{-ik_nx} \;dx \right) \left( e^{ik_nx} \right) )$

$= lim_{Δ k \to 0} \frac{1}{2 π} \sum_{- \infty}^{\infty} Δ k (\int_{- \infty}^{\infty} f (x) e^{- i k_{n} x} d x) (e^{i k_{n} x})$ $(TeX formula: = \lim_{Δk → 0} \;\frac{1}{2π} ∑_{-∞}^∞ Δk \left( ∫_{-∞}^{∞} f(x)e^{-ik_nx} \;dx \right) \left( e^{ik_nx} \right) )$

which is a Riemman sum for variable $k_{n}$ (TeX formula: k_n) :

$= \frac{1}{2 π} \int_{- \infty}^{\infty} d k_{n} (\int_{- \infty}^{\infty} f (x) e^{- i k_{n} x} d x) (e^{i k_{n} x})$ $(TeX formula: = \frac{1}{2π} ∫_{-∞}^∞ dk_n\; \left( ∫_{-∞}^{∞} f(x)e^{-ik_nx} \;dx \right) \left( e^{ik_nx} \right) )$

$= {(\sqrt{\frac{1}{2 π}})}^{2} \int_{- \infty}^{\infty} (\int_{- \infty}^{\infty} f (x) e^{- i k_{n} x} d x) e^{i k_{n} x} d k_{n}$ $(TeX formula: = \left(\sqrt{\frac{1}{2π}}\right)^2 ∫_{-∞}^∞ \left( ∫_{-∞}^{∞} f(x)e^{-ik_nx} \;dx \right) e^{ik_nx} \;dk_n )$

$= \underset{Inverse Fourier transform}{\underset{⏟}{\frac{1}{\sqrt{2 π}} \int_{- \infty}^{\infty} \underset{Fourier transform}{\underset{⏟}{(\frac{1}{\sqrt{2 π}} \int_{- \infty}^{\infty} f (x) e^{- i k_{n} x} d x)}} e^{i k_{n} x} d k_{n}}}$ $(TeX formula: = \underbrace{ \frac{1}{\sqrt{2π}} ∫_{-∞}^∞ \underbrace{ \left( \frac{1}{\sqrt{2π}} ∫_{-∞}^{∞} f(x)e^{-ik_nx} \;dx \right) }_{\text{Fourier transform}} e^{ik_nx} \;dk_n }_{\text{Inverse Fourier transform}} )$

• properties

which symmetry properties does H(f) has if h(t) is:

odd real

odd imaginary

odd complex

and which symmetry properties does h(t) has if H(f ) is:

odd real

even complex

even imaginary

recall that:

$H (f) = \int_{- \infty}^{\infty} [r (t) + i j (t)] [c o s (2 π f t) + i s e n (2 π f t)] d t$ $(TeX formula: H(f) = ∫_{-∞}^∞ [r(t)+ij(t)][cos(2πft) + isen(2πft)]dt )$

$= \int_{- \infty}^{\infty} [r (t) c o s (2 π f t) + j (t) s e n (2 π f t)] d t + i \int_{- \infty}^{\infty} [j (t) c o s (2 π f t) - r (t) s e n (2 π f t)] d t;$ $(TeX formula: = ∫_{-∞}^∞ [r(t)cos(2πft)+j(t)sen(2πft)]dt + i∫_{-∞}^∞ [j(t)cos(2πft)-r(t)sen(2πft)]dt; )$

$h (t) = \int_{- \infty}^{\infty} [R (f) + i J (f)] [c o s (2 π f t) + i s e n (2 π f t)] d t$ $(TeX formula: h(t) = ∫_{-∞}^∞ [R(f)+iJ(f)][cos(2πft) + isen(2πft)]dt )$

$= \int_{- \infty}^{\infty} [R (f) c o s (2 π f t) + J (f) s e n (2 π f t)] d t + i \int_{- \infty}^{\infty} [J (f) c o s (2 π f t) - R (f) s e n (2 π f t)] d t .$ $(TeX formula: = ∫_{-∞}^∞ [R(f)cos(2πft)+J(f)sen(2πft)]dt + i∫_{-∞}^∞ [J(f)cos(2πft)-R(f)sen(2πft)]dt. )$

odd real h(t)

we get rid of the imaginary part of H(f):

$H (f) = \int_{- \infty}^{\infty} [r (t) c o s (2 π f t) + 0)] d t + i \int_{- \infty}^{\infty} [0 - r (t) s e n (2 π f t)] d t$ $(TeX formula: H(f) = ∫_{-∞}^∞ [r(t)cos(2πft)+{0})]dt + i∫_{-∞}^∞ [{0}-r(t)sen(2πft)]dt )$

from integration of an odd function (r(t)cos(2πft)) at a symmetric interval:

$H (f) = 0 + i \int_{- \infty}^{\infty} [r (t) s e n (2 π f t)] d t$ $(TeX formula: H(f) = {0} + i∫_{-∞}^∞ [r(t)sen(2πft)]dt )$

$s e n (2 π f t)$ (TeX formula: sen(2πft)) is odd. $∴ H (f)$ (TeX formula: ∴ { H(f)}) is odd and imaginary.

odd imaginary h(t)

same reasoning:

$H (f) = \int_{- \infty}^{\infty} [0 + j (t) s e n (2 π f t)] d t + i \int_{- \infty}^{\infty} [j (t) c o s (2 π f t) - 0] d t$ $(TeX formula: H(f) = ∫_{-∞}^∞ [{0}+j(t)sen(2πft)]dt + i∫_{-∞}^∞ [j(t)cos(2πft)-{0}]dt )$

$H (f) = \int_{- \infty}^{\infty} [j (t) s e n (2 π f t)] d t + 0$ $(TeX formula: H(f) = ∫_{-∞}^∞ [j(t)sen(2πft)]dt + {0} )$

$s e n (2 π f t)$ (TeX formula: sen(2πft)) is odd. $∴ H (f)$ (TeX formula: ∴ {H(f)}) is odd and real.

odd h(t)

$H (f) = \int_{- \infty}^{\infty} [0 + j (t) s e n (2 π f t)] d t + i \int_{- \infty}^{\infty} [0 - r (t) s e n (2 π f t)] d t$ $(TeX formula: H(f) = ∫_{-∞}^∞ [{0}+j(t)sen(2πft)]dt + i∫_{-∞}^∞ [{0}-r(t)sen(2πft)]dt )$

$s e n (2 π f t)$ (TeX formula: sen(2πft)) is odd. $∴ H (f)$ (TeX formula: ∴ {H(f)}) is odd and complex.

odd real H(f)

$h (t) = \int_{- \infty}^{\infty} [R (f) c o s (2 π f t) + 0] d t + i \int_{- \infty}^{\infty} [0 - R (f) s e n (2 π f t)] d t$ $(TeX formula: h(t) = ∫_{-∞}^∞ [R(f)cos(2πft)+{0}]dt + i∫_{-∞}^∞ [{0}-R(f)sen(2πft)]dt )$

$h (t) = 0 + i \int_{- \infty}^{\infty} R (f) s e n (2 π f t) d t$ $(TeX formula: h(t) = {0} + i∫_{-∞}^∞ R(f)sen(2πft)dt )$

$s e n (2 π f t)$ (TeX formula: sen(2πft)) is odd. $∴ h (t)$ (TeX formula: ∴ {h(t)}) is imaginary and odd.

complex even H(f)

$h (t) = \int_{- \infty}^{\infty} [R (f) c o s (2 π f t) + 0] d t + i \int_{- \infty}^{\infty} [J (f) c o s (2 π f t) - 0] d t$ $(TeX formula: h(t) = ∫_{-∞}^∞ [R(f)cos(2πft)+{0}]dt + i∫_{-∞}^∞ [J(f)cos(2πft)-{0}]dt )$

$c o s (2 π f t)$ (TeX formula: cos(2πft)) is even. $∴ h (t)$ (TeX formula: ∴ {h(t)}) is even and complex.

imaginary even H(f)

$h (t) = \int_{- \infty}^{\infty} [0 + J (f) s e n (2 π f t)] d t + i \int_{- \infty}^{\infty} [J (f) c o s (2 π f t) - 0] d t$ $(TeX formula: h(t) = ∫_{-∞}^∞ [{0}+J(f)sen(2πft)]dt + i∫_{-∞}^∞ [J(f)cos(2πft)-{0}]dt )$

$h (t) = 0 + i \int_{- \infty}^{\infty} J (f) c o s (2 π f t) d t$ $(TeX formula: h(t) = {0} + i∫_{-∞}^∞ J(f)cos(2πft)dt )$

$c o s (2 π f t)$ (TeX formula: cos(2πft)) is even. $∴ h (t)$ (TeX formula: ∴ {h(t)}) is even and imaginary.

• uncertainty principle

the more short-lived the wave in the time domain, the more widespread its power spectrum (a.k.a. frequency spectrum). vice versa.

• convolution theorem

$y (t) = x (t) * h (t) = \int_{- \infty}^{\infty} x (τ) h (t - τ) d τ$ $(TeX formula: y(t) = x(t)∗h(t) = ∫_{-∞}^∞ x(τ)h(t-τ)dτ)$ is the convolution of functions x(t) y h(t). F[y(t)] denotes its Fourier transform:

$F [y (t)] = \int_{- \infty}^{\infty} y (t) e^{- 2 π i f t} d t$ $(TeX formula: F[y(t)] = ∫_{-∞}^∞ y(t)e^{-2πift}dt )$

$= \int_{- \infty}^{\infty} [\int_{- \infty}^{\infty} x (τ) h (t - τ) d τ] e^{- 2 π i f t} d t$ $(TeX formula: = ∫_{-∞}^∞ \left[ ∫_{-∞}^∞ x(τ)h(t-τ)dτ \right] e^{-2πift}dt )$

$= \int_{- \infty}^{\infty} x (τ) [\int_{- \infty}^{\infty} h (t - τ) e^{- 2 π i f t} d t] d τ$ $(TeX formula: = ∫_{-∞}^∞ x(τ) \left[ ∫_{-∞}^∞ h(t-τ) e^{-2πift}dt \right]dτ )$

let u=t-τ.

$= \int_{- \infty}^{\infty} x (τ) [\int_{- \infty}^{\infty} h (u) e^{- 2 π i f (u + τ)} d u] d τ$ $(TeX formula: = ∫_{-∞}^∞ x(τ) \left[ ∫_{-∞}^∞ h(u)e^{-2πif(u+τ)}du \right]dτ )$

$= \int_{- \infty}^{\infty} x (τ) [\int_{- \infty}^{\infty} h (u) e^{- 2 π i f u} e^{- 2 π i f τ} d u] d τ$ $(TeX formula: = ∫_{-∞}^∞ x(τ) \left[ ∫_{-∞}^∞ h(u)e^{-2πifu}e^{-2πifτ}du \right]dτ )$

$= [\int_{- \infty}^{\infty} x (τ) e^{- 2 π i f τ} d τ] [\int_{- \infty}^{\infty} h (u) e^{- 2 π i f u} d u]$ $(TeX formula: = \left[ ∫_{-∞}^∞ x(τ) e^{-2πifτ}dτ \right] \left[ ∫_{-∞}^∞ h(u)e^{-2πifu}du \right] )$

$= X (f) H (f)$ (TeX formula: = X(f)H(f))

therefore:

$x (t) * h (t) = F^{- 1} [X (f) H (f)]$ $(TeX formula: x(t)∗h(t) = F^{-1} \left[ X(f)H(f) \right] )$

time-domain convolution is equivalent to frequency-domain product!

• cross-correlation theorem

$z_{x, h} (t) = \int_{- \infty}^{\infty} x (τ) h (t + τ) d τ$ $(TeX formula: z_{x,h}(t) = ∫_{-∞}^∞ x(τ)h(t+τ)dτ)$ is the correlation (covariance, actually) of functions x(t) y h(t) as a function of a time lag between them. F[z(t)] denotes its Fourier transform:

$F [z (t)] = \int_{- \infty}^{\infty} z (t) e^{- 2 π i f t} d t$ $(TeX formula: F[z(t)] = ∫_{-∞}^∞ z(t)e^{-2πift}dt )$

$= \int_{- \infty}^{\infty} [\int_{- \infty}^{\infty} x (τ) h (t + τ) d τ] e^{- 2 π i f t} d t$ $(TeX formula: = ∫_{-∞}^∞ \left[ ∫_{-∞}^∞ x(τ)h(t+τ)dτ \right] e^{-2πift}dt )$

$= \int_{- \infty}^{\infty} x (τ) [\int_{- \infty}^{\infty} h (t + τ) e^{- 2 π i f t} d t] d τ$ $(TeX formula: = ∫_{-∞}^∞ x(τ) \left[ ∫_{-∞}^∞ h(t+τ) e^{-2πift}dt \right]dτ )$

let u=t+τ.

$= \int_{- \infty}^{\infty} x (τ) [\int_{- \infty}^{\infty} h (u) e^{- 2 π i f (u - τ)} d u] d τ$ $(TeX formula: = ∫_{-∞}^∞ x(τ) \left[ ∫_{-∞}^∞ h(u)e^{-2πif(u-τ)}du \right]dτ )$

$= \int_{- \infty}^{\infty} x (τ) [\int_{- \infty}^{\infty} h (u) e^{- 2 π i f u} e^{2 π i f τ} d u] d τ$ $(TeX formula: = ∫_{-∞}^∞ x(τ) \left[ ∫_{-∞}^∞ h(u)e^{-2πifu}e^{2πifτ}du \right]dτ )$

$= [\int_{- \infty}^{\infty} x (τ) e^{2 π i f τ} d τ] [\int_{- \infty}^{\infty} h (u) e^{- 2 π i f u} d u]$ $(TeX formula: = \left[ ∫_{-∞}^∞ x(τ) e^{2πifτ}dτ \right] \left[ ∫_{-∞}^∞ h(u)e^{-2πifu}du \right] )$

$= X^{*} (f) H (f)$ (TeX formula: = X^*(f)H(f))

therefore:

$z_{x, h} = F^{- 1} [X^{*} (f) H (f)]$ $(TeX formula: z_{x,h} = F^{-1} \left[ X^*(f)H(f) \right] )$

cross-correlation is equivalent to frequency domain product (with conjugate X(f)).

• discrete Fourier transform

the computation of a DFT is done segment by segment. the signal is multiplied by some "window" function (e.g. a square pulse) which is zero outside the window. this is known as windowing.

the effect of using a finite window is that unexisting frequencies appear at the power spectrum (around the original frequencies). this phenomenon is known as leakage. the explanation is evident from the #convolution theorem: one can use similar reasoning to show that $h (t) x (t) = F^{- 1} [H (f) * X (f)]$ $(TeX formula: h(t)x(t) = F^{-1} \left[ H(f)∗X(f) \right])$ . therefore, multiplication by a window will induce a convolution in the frequency domain.

low frequencies (and therefore #non-stationary systems) are the most problematic to compute a DFT for, since the window may not be long enough to capture them. there's no single #test for stationarity. it can be tested for nonetheless, measuring the effect of varying window sizes on different parameters (centrality and spread statistics, Lyapunov exponent, etc.).

• Whittaker–Shannon (sinc) interpolation

what function should we convolve the signal's Fourier transform with in order to obtain the same result as windowing it with a rectangular function? we will analyze the closely-related case of a square pulse window:

calculate the first three terms of the Fourier series for function $f (x) = {\begin{matrix} 1 & \Leftrightarrow 2 n π \leq x \leq (2 n + 1) π \\ - 1 & \Leftrightarrow (2 n + 1) π \leq x \leq (2 n + 2) π \end{matrix}$ $(TeX formula: f(x) = \left\{ \begin{array}{cc} 1 & ⇔ 2nπ ≤ x ≤ (2n+1)π \\ -1 & ⇔ (2n+1)π ≤ x ≤ (2n+2)π \\ \end{array} \right.)$ , with $n = 0, \pm 1, \pm 2, . . .$

in terms of the Fourier series:

$f (x) = \underset{f_{0}}{\underset{⏟}{\frac{1}{2 a} \int_{- a}^{a} f (x) d x}} + \sum_{n = 1}^{\infty} [\underset{a_{n}}{\underset{⏟}{(\frac{1}{a} \int_{- a}^{a} d x f (x) c o s (\frac{n π x}{a}))}} c o s (\frac{n π x}{a}) + \underset{b_{n}}{\underset{⏟}{(\frac{1}{a} \int_{- a}^{a} d x f (x) s e n (\frac{n π x}{a}))}} s e n (\frac{n π x}{a})]$ $(TeX formula: f(x) = \underbrace{ \frac{1}{2a}∫_{-a}^{a}f(x)dx}_{f_0} + ∑_{n=1}^∞ \left[ \underbrace{ \left( \frac{1}{a} ∫_{-a}^{a}dx\; f(x)cos\left( \frac{nπx}{a} \right) \right)}_{a_n} cos\left( \frac{nπx}{a} \right) + \underbrace{ \left( \frac{1}{a} ∫_{-a}^{a}dx\; f(x)sen\left( \frac{nπx}{a} \right) \right)}_{b_n} sen\left( \frac{nπx}{a} \right) \right] )$

from the integration of odd functions ( $f (x) c o s (\frac{n π x}{a})$ $(TeX formula: f(x)cos\left(\frac{nπx}{a}\right))$ and $f (x)$ (TeX formula: f(x)) ) at symmetrical interval ( $[- a, a]$ (TeX formula: [-a, a]) ):

$⊢ f (x) = \underset{f_{0}}{\underset{⏟}{0}} + \sum_{n = 1}^{\infty} [\underset{a_{n}}{\underset{⏟}{(0)}} (c o s (\frac{n π x}{a})) + \underset{b_{n}}{\underset{⏟}{(\frac{1}{a} \int_{- a}^{a} d x f (x) s e n (\frac{n π x}{a}))}} s e n (\frac{n π x}{a})]$ $(TeX formula: ⊢ f(x) = \underbrace{0}_{f_0} + ∑_{n=1}^∞ \left[ \underbrace{\left(0\right)}_{a_n} \left( cos\left( \frac{nπx}{a} \right)\right) + \underbrace{ \left( \frac{1}{a} ∫_{-a}^{a}dx\; f(x)sen\left( \frac{nπx}{a} \right) \right)}_{b_n} sen\left( \frac{nπx}{a} \right) \right] )$

$⊢ f (x) = \sum_{n = 1}^{\infty} \underset{b_{n}}{\underset{⏟}{(\frac{1}{a} \int_{- a}^{a} d x f (x) s e n (\frac{n π x}{a}))}} s e n (\frac{n π x}{a})$ $(TeX formula: ⊢ f(x) = ∑_{n=1}^∞ \underbrace{ \left( \frac{1}{a} ∫_{-a}^{a}dx\; f(x)sen\left( \frac{nπx}{a} \right) \right)}_{b_n} sen\left( \frac{nπx}{a} \right) )$

we will consider the interval $[- a, a] = [- π, π]$ (TeX formula: [-a, a] = [-π, π]) , which includes a full period of $f (x) s e n (n π x / π)$ (TeX formula: f(x)sen(nπx/π)) for any n. therefore:

$⊢ f (x) = \sum_{n = 1}^{\infty} \underset{b_{n}}{\underset{⏟}{(\frac{1}{π} \int_{- π}^{π} d x f (x) s e n (\frac{n π x}{π}))}} s e n (\frac{n π x}{π})$ $(TeX formula: ⊢ f(x) = ∑_{n=1}^∞ \underbrace{ \left( \frac{1}{π} ∫_{-π}^{π}dx\; f(x)sen\left( \frac{nπx}{π} \right) \right)}_{b_n} sen\left( \frac{nπx}{π} \right) )$

$⊢ f (x) = \sum_{n = 1}^{\infty} \underset{b_{n}}{\underset{⏟}{\frac{1}{π} (\int_{- π}^{0} d x (- 1) s e n (n x) + \int_{0}^{π} d x (+ 1) s e n (n x))}} s e n (n x)$ $(TeX formula: ⊢ f(x) = ∑_{n=1}^∞ \underbrace{ \frac{1}{π} \left( ∫_{-π}^{0} dx\; (-1)sen(nx) + ∫_{0}^{π} dx\; (+1)sen(nx) \right)}_{b_n} sen(nx) )$

$⊢ f (x) = \sum_{n = 1}^{\infty} \underset{b_{n}}{\underset{⏟}{\frac{1}{π} (\frac{1}{n} {[c o s (n x)]}_{- π}^{0} - \frac{1}{n} {[c o s (n x)]}_{0}^{π})}} s e n (n x)$ $(TeX formula: ⊢ f(x) = ∑_{n=1}^∞ \underbrace{ \frac{1}{π} \left( \frac{1}{n} \left[ cos(nx) \right]_{-π}^{0} - \frac{1}{n} \left[ cos(nx) \right]_{0}^{π} \right)}_{b_n} sen(nx) )$

$⊢ f (x) = \sum_{n = 1}^{\infty} \underset{b_{n}}{\underset{⏟}{\frac{1}{n π} (c o s (0) - c o s (- n π) - c o s (n π) + c o s (0))}} s e n (n x)$ $(TeX formula: ⊢ f(x) = ∑_{n=1}^∞ \underbrace{ \frac{1}{nπ} \left( cos(0) - cos(-nπ) -cos(nπ) + cos(0) \right)}_{b_n} sen(nx) )$

$⊢ f (x) = \sum_{n = 1}^{\infty} \underset{b_{n}}{\underset{⏟}{\frac{1}{n π} (2 c o s (0) - 2 c o s (n π))}} s e n (n x)$ $(TeX formula: ⊢ f(x) = ∑_{n=1}^∞ \underbrace{ \frac{1}{nπ} \left( 2cos(0) - 2cos(nπ) \right)}_{b_n} sen(nx) )$

$⊢ f (x) = \sum_{n = 1}^{\infty} \underset{b_{n}}{\underset{⏟}{\frac{1}{n π} (2 - 2 c o s (n π))}} s e n (n x)$ $(TeX formula: ⊢ f(x) = ∑_{n=1}^∞ \underbrace{ \frac{1}{nπ} \left( 2 - 2cos(nπ) \right)}_{b_n} sen(nx) )$

$⊢ f (x) = \sum_{n = 1}^{\infty} \underset{b_{n}}{\underset{⏟}{\frac{1}{n π} ({\begin{matrix} 2 - 2 & \Leftrightarrow n even \\ 2 + 2 & \Leftrightarrow n odd \end{matrix})}} s e n (n x)$ $(TeX formula: ⊢ f(x) = ∑_{n=1}^∞ \underbrace{ \frac{1}{nπ} \left( \left\{ \begin{array}{cc} 2-2 & ⇔ n \text{ even} \\ 2+2 & ⇔ n \text{ odd} \\ \end{array} \right. \right)}_{b_n} sen(nx) )$

$⊢ f (x) = \sum_{n = 1}^{\infty} {\begin{matrix} 0 & \Leftrightarrow n even \\ \frac{4}{n π} s e n (n x) & \Leftrightarrow n odd \end{matrix}$ $(TeX formula: ⊢ f(x) = ∑_{n=1}^∞ \left\{ \begin{array}{cc} 0 & ⇔ n \text{ even} \\ \frac{4}{nπ}sen(nx) & ⇔ n \text{ odd} \\ \end{array} \right. )$

$⊢ f (x) = \sum_{n = 0}^{\infty} \frac{4}{π (2 n + 1)} s e n (x (2 n + 1))$ $(TeX formula: ⊢ f(x) = ∑_{n=0}^∞ \frac{4}{π(2n+1)}sen(x(2n+1)) )$

$= \frac{4}{π} \sum_{n = 0}^{\infty} \frac{s e n (x (2 n + 1))}{2 n + 1}$ $(TeX formula: = \frac{4}{π} ∑_{n=0}^∞ \frac{sen(x(2n+1))}{2n+1} )$

$⊢ f (x) = \underset{f_{0}}{\underset{⏟}{0}} + \frac{4}{π} s e n (x) + 0 + \frac{4}{3 π} s e n (3 x) + 0 + \frac{4}{5 π} s e n (5 x) + . . .$ $(TeX formula: ⊢ f(x) = \underbrace{0}_{f_0} + \frac{4}{π}sen(x) + 0 + \frac{4}{3π}sen(3x) + 0 + \frac{4}{5π}sen(5x) + ...)$

f(x) could be just a discrete sample of a continuous signal, but because convolution with sinc(x) produces a continuous function (a sum of sine waves), the operation will fill in all the missing time points.

• Nyquist-Shannon sampling theorem

continuous functions of bounded frequency bandwidth ("band-limited") contain a bounded amount of information. they can be perfectly represented using a function of countable (i.e. discrete) sets and an interpolation operation.

the Nyquist frequency is the maximum sampling frequency at which the digitization of a continuous signal still losses fidelity. that is, aliasing occurs:

$f_{N y q u i s t} = 2 f_{m a x}$ $(TeX formula: f_{Nyquist} = 2 \; f_{max} )$

sampling frequency should be greater than twice the maximum frequency in the power spectrum of the original signal.

undersampling results in the unconsidered high frequencies aliasing (adding up) over $f_{N y q u i s t} - (f - f_{N y q u i s t})$ $(TeX formula: f_{Nyquist} - (f - f_{Nyquist}))$ at the frequency domain. oversampling is harmless, beyond being a waste of disk space.

• testing for undersampling

what if the maximum frequency isn't known in advance? it is possible to test that the maximum frequency has been considered, because aliasing will make two power spectra of the same signal look dissimilar under two different sampling frequencies:

register using two sampling frequencies, one greater than the one to test for the Nyquist criterion.
compute both power spectra ( ${| F [x (t)] |}^{2}$ $(TeX formula: \left|F[x(t)]\right|^2)$ ).
if they are simply scaled versions of one another, aliasing didn't occur. i.e. the ratio between two features (e.g. maxima) should stay constant.

• other time-series topics

• stochastic correlations

suppose the signal is generated by an stochastic process. if low frequencies dominate the power spectrum, then the process is non-stationary (time-series mean value will be trending over long periods of time) and autocorrelation will be large.

corollary: even two independent sources of noise can display high statistical dependence.

averaging the time series would be no good. prediction should take autocovariance times into account, detrending methods are also available.

• stationarity

strict/particular stationarity: system's parameters are truly constant. too hard a requirement, since we must know the system's differential equations in advance.
weak/general stationarity: empirical transition probabilities in the F map stay constant. affected by low-band frequencies. alternatively, 1st moment and autocovariance stay constant and finite. this requires greater data availability, because short acquisition times may not be enough to establish statistical relevance.

• linear prediction

suppose there's a time series $\vec{s} = (s_{1}, s_{2}, . . ., s_{N})$ $(TeX formula: \vec{s} = (s_1, s_2, ..., s_N))$ whose value $s_{N + 1}$ $(TeX formula: s_{N+1})$ we want to predict. in a linear prediction algorithm we define ${\hat{s}}_{N + 1}$ $(TeX formula: \hat{s}_{N+1})$ as a linear combination of the last m points:

${\hat{s}}_{N + 1} = \sum_{j = 1}^{m} a_{j} s_{n - m + j}$ $(TeX formula: \hat{s}_{N+1} = ∑_{j=1}^m a_j s_{n-m+j} )$

it's clear that there are as many linear combinations of the last m points as there are permutations (with substitution) of coefficients $a_{j}$ (TeX formula: a_j) . in order to find the optimal coefficients we try to minimize the mean square error predicting the previous m - 1 values:

$\sum_{n = m}^{N - 1} {({\hat{s}}_{n + 1} - s_{n + 1})}^{2}$ $(TeX formula: ∑_{n=m}^{N-1} (\hat{s}_{n+1} - s_{n+1})^2 )$

• cross-validation

in-sample accuracy: estimated from predictions made for the same data segment used during training/optimisation phase. it doesn't reflect the algorithm's true predictive power (overestimate), since tests aren't predictions but "post-dictions".
out-sample accuracy: error estimated from true predictions, using the values of data never "seen" before by our predictive model.

• phase-space methods

• dissipative systems and attractors

dissipative system: on average, the volume of the manifold describing the state trajectory contracts under dynamics, after certain initial conditions. alternatively, it can be said that #energy loss creates an attractor for that system, reducing the set of possible states that it can take with time. nonetheless, the system still operates far from thermodynamic equilibrium.

in order to verify the presence of an attractor, the L1 metric of the determinant of the system's Jacobian (i.e., of the state transition function's (F) partial derivatives) must be smaller than 1, on average:

$| d e t ((\begin{matrix} \frac{\partial f_{1}}{\partial x_{1}} & \dots & \frac{\partial f_{1}}{\partial x_{n}} \\ ⋮ & ⋮ \\ \frac{\partial f_{n}}{\partial x_{1}} & \dots & \frac{\partial f_{n}}{\partial x_{n}} \end{matrix})) | < 1$ $(TeX formula: \left| det \left( \begin{pmatrix} \frac{∂f_1}{∂x_1} & ⋯ & \frac{∂f_1}{∂x_n} \\ ⋮ & & ⋮ \\ \frac{∂f_n}{∂x_1} & ⋯ & \frac{∂f_n}{∂x_n} \\ \end{pmatrix} \right) \right| < 1 )$

this means that the transformation's scaling factor produced by change on the different directions is a contracting one. visually, if phase space contains an attractor then the system is dissipative.

• visual inspection

Poincaré section: allows to tell whether the trajectory behavior is classically deterministic, chaotic deterministic or random. it amounts to finding the intersection between trajectories and a secant plane.

stroboscopic map: graphical representation of Poincaré's section technique for attractors. section is adjusted until the scatter plot is maximally decorrelated. also used for qualitative detection of strange/fractal attractors.

"phase portrait": also known as delay representation. it is a bidimensional representation of any m-dimensional phase space. each point is given by a pair of subsequent states in phase space (see also #Takens-Whitney embedding theorem). it eases the visualisation of multidimensional systems and helps identify determinism, periodicity, linearity and chaos.

• Takens-Whitney embedding theorem

it is possible to reconstruct a phase space manifold that is isomorphic to the original one, even when all information we have is a time series, through the method known as delay reconstruction.

requirements:

stationary data
low-dimensional system
negligible noise

the basic idea is that for each subset of m equidistant points in the signal, there's an associated unique point in m-dimensional phase space. each value in this subset of the time series becomes a component of the state vector:

note that the closer $s_{1}$ (TeX formula: s_1) and $s_{2}$ (TeX formula: s_2) are, the more correlated the reconstructed attractor; rendering it less useful:

the spacing parameter among points in the time series is called the time lag τ. to set an adequate value for τ, we want τ > L; with L being the autocovariance length. that is, we want the time between two measurements to be long enough so that they aren't correlated anymore.

finding the right number m of dimensions involves increasing the number of embedding components per point, until the manifold can't unfold anymore. we say there are false neighbors (and therefore that the embedding isn't high-dimensional enough) whenever an increase in m separates seemingly close points considerably in reconstructed phase space.

• non-linear prediction

unlike the #linear prediction algorithm for time series; a non-linear algorithm doesn't use the last m points combined. rather, it builds upon the intuition that the most relevant states for making a prediction some time Δn after the present are the ones that most resembled the current state. it then averages those similar states after Δn:

${\hat{s}}_{N + Δ n} = \frac{1}{| 𝒰 (\vec{s_{N}}) |} \sum_{\vec{s_{n}} \in 𝒰 (\vec{s_{N}})} s_{n + Δ n}$ $(TeX formula: \hat{s}_{N+Δn} = \frac{1}{\left| 𝒰(\vec{s_N})\right|} ∑_{\vec{s_n} ∈ 𝒰(\vec{s_N})} s_{n+Δn} )$

where $| 𝒰 (\vec{s_{N}}) |$ $(TeX formula: \left| 𝒰(\vec{s_N})\right|)$ denotes the cardinality of the current state vector's neighborhood (in phase space).

• as stationarity test

divide the series in intervals so short so as to have lineal segments.
predict and cross-validate using all possible segment pairs.
if some training segments display unusually different performance, then they don't generalise to the whole series. therefore we can conclude that the attractor changed as we swapped training segments; and the time series isn't stationary.

• as a noise smoother

suppose the measured time series is the sum of its true value and noise:

$s_{n} = x_{n} + z_{n} .$ (TeX formula: s_n = x_n + z_n.)

where $z_{n}$ (TeX formula: z_n) is the random component. also suppose $x_{n}$ (TeX formula: x_n) and $z_{n}$ are independent, and that temporal correlation is short.

we want to substitute $s_{n}$ (TeX formula: s_n) with ${\hat{s}}_{n}$ $(TeX formula: \hat{s}_n)$ . starting from the #non-linear prediction algorithm:

in phase space, use trajectories which are close to the current one (both in the past and future portions), since the points inside that neighborhood belong to an n-dimensional sphere with center at the vector state $s_{n}$ under consideration.
apply the nonlinear prediction algorithm afterwards: average the vectors in the neighborhood. thus the stochastic component will be largely cancelled out.

${\hat{s}}_{n_{0} - m / 2} = \frac{1}{| 𝒰 (\vec{s_{n_{0}}}) |} \sum_{\vec{s_{n}} \in 𝒰 (\vec{s_{n_{0}}})} s_{n - m / 2}$ $(TeX formula: \hat{s}_{n_0-m/2}=\frac{1}{\left|𝒰(\vec{s_{n_0}})\right|} ∑_{\vec{s_n}∈𝒰(\vec{s_{n_0}})}s_{n-m/2})$

• chaos theory

• Lyapunov coefficient

quantifies chaos levels in a dynamical system. it is the exponent in the exponential divergence measurement of phase space. the system has as many Lyapunov exponents as dimensions, positive ones are chaotic dimensions:

Lyapunov coefficient	kind of attractor
λ < 0	stable fixed point
λ = 0	stable limit cycle
λ > 0	chaos
λ → ∞	noise

• estimation from time series

it is advisable to perform a #phase portrait beforehand, so as to rule out that the system is nondeterministic.

$\hat{λ} (Δ t) = \frac{1}{N} \sum_{n_{0} = 1}^{N} l n (\frac{1}{| 𝒰 (\vec{s_{n_{0}}}) |} \sum_{\vec{s_{n}} \in 𝒰 (\vec{s_{n_{0}}})} | s_{n_{0} + Δ_{n}} - s_{n + Δ_{n}} |)$ $(TeX formula: \hat{λ}(Δt) = \frac{1}{N} ∑_{n_0=1}^{N} ln \left( \frac{1}{\left|𝒰(\vec{s_{n_0}})\right|} \; ∑_{\vec{s_n}∈𝒰(\vec{s_{n_0}})} \left| s_{n_0+Δ_n} - s_{n+Δ_n} \right| \right) )$

caveats:

embedding: if the #embedding dimension is too low divergence will be underestimated because of false neighbors.
noise level: overestimated because of noise
#stochastic correlations: if autocorrelation is big, the attractor will look compressed along a diagonal, underestimating true divergence

• fractal dimension

strange attractors are the flagship of chaos: trajectories are still deterministic (no crossroads) yet non-cyclic (exact knowledge is required to tell accurately which trajectory the system is in). since some chaotic systems are also #dissipative, all of this implies that a subset of infinitely-dense phase space is being relentlessly filled by a line of infinite detail.

• box counting

box counting is one of many methods to assess scaling property of fractals.

let ε be the side of each box (n-dimensional squares), which are in turn used to segment the figure. then let N(ε) be the number of boxes. note that:

$lim_{N (ε) \to \infty} N (ε) = {(\frac{1}{ε})}^{D}$ $(TeX formula: \lim_{N(ε)→∞} \; N(ε) = \left(\frac{1}{ε}\right)^D )$

D being the box-counting fractal dimension.

$⊢ lim_{N (ε) \to \infty} l n (N (ε)) = D l n (1 / ε)$ $(TeX formula: ⊢ \lim_{N(ε)→∞} \; ln(N(ε)) = D \; ln(1/ε) )$

$⊢ D = lim_{N (ε) \to \infty} \frac{l n (N (ε))}{l n (1 / ε)} .$ $(TeX formula: ⊢ D = \lim_{N(ε)→∞} \; \frac{ln(N(ε))}{ln(1/ε)}. )$

Example:

(Image "Figure 17") — Figure 17 - Sierpinski's triangle

N(ε)	ε
3⁰	2⁰
3¹	2¹
3²	2²
...	...
3ⁿ	2ⁿ

$D = lim_{n \to \infty} \frac{l n (3^{n})}{l n (2^{n})} = lim_{n \to \infty} \frac{n l n (3)}{n l n (2)} = \frac{l n (3)}{l n (2)} = 1.5849 . . .$ $(TeX formula: D = \lim_{n→∞} \; \frac{ln(3^n)}{ln(2^n)} = \lim_{n→∞} \; \frac{n\;ln(3)}{n\;ln(2)} = \frac{ln(3)}{ln(2)} = 1.5849... )$

• correlation dimension

the correlation dimension can be used as an estimator of an attractor's dimension.(under certain circumstances), including fractal dimensions.

the idea now isn't to take the ratio of the number of boxes and their length ε in the limit to infinity. instead, we take the ratio of points (among all possible valid pairs) which fall within the neighborhood (defined by a radius of length ε) of some other point:

$D = lim_{ε \to 0} lim_{N \to \infty} \frac{\partial l n C (ε, N)}{\partial l n ε};$ $(TeX formula: D = \lim_{ε→0}\;\lim_{N→∞} \; \frac{∂ln\;C(ε,N)}{∂lnε};)$

where C(N, ϵ) is the correlation sum:

$C (N, ϵ) = \frac{2}{(N) (N - 1)} \sum_{i = 1}^{N} \sum_{j = i + 1 + n_{m i n}}^{N} 𝛩 (ϵ - | | \vec{s_{i}} - \vec{s_{j}} | |) .$ $(TeX formula: C(N,ϵ) = \frac{2}{(N)(N-1)} ∑_{i=1}^{N}\;∑_{j=i+1+n_{min}}^N 𝛩(ϵ - ||\vec{s_i}-\vec{s_j}||). )$

𝛩 is Heaviside's step function

• estimation from time series

given the lack of a phase space, it is necessary to #reconstruct it first. the right choice of delay τ during reconstruction of embedded vectors provides a suitable delay to estimate correlation dimension.

the aforementioned estimator of C(m, ϵ) is biased to underestimate the dimension, because data that are close in a trajectory of phase space tend to retain some temporal correlation.

(Image "Figure 18") — Figure 18 - this figure shows how for point B the attractor seems to have a dimension of 1, however, B's neighbors aren't statistically independent.

the corrected estimator of C(m, ϵ) excludes sufficiently-close vector pairs (in phase space); which usually correspond to subsequent vectors in the time series:

$C (m, ϵ) = \frac{2}{(N - n_{m i n}) (N - n_{m i n} - 1)} \sum_{i = 1}^{N} \sum_{j = i + 1 + n_{m i n}}^{N} 𝛩 (ϵ - | | \vec{s_{i}} - \vec{s_{j}} | |)$ $(TeX formula: C(m, ϵ) = \frac{2}{(N-n_{min})(N-n_{min}-1)} ∑_{i=1}^{N}\;∑_{j=i+1+n_{min}}^N 𝛩(ϵ - ||\vec{s_i}-\vec{s_j}||) )$

with $n_{m i n} = t_{m i n} / Δ t$ $(TeX formula: n_{min} = t_{min}/Δt)$ . $t_{m i n}$ $(TeX formula: t_{min})$ doesn't necessarily have to be the average correlation time.

the following plots from Kantz & Schreiber's book⁵ come from a map-like system. for a $t_{m i n} = 500 s t e p s$ $(TeX formula: t_{min} = 500\;steps)$ only 2.5% of all pairs is lost in the corrected correlation dimension estimator. this is statistically negligible.

(Image "Figure 19") — Figure 19 - correlation sum (*C(m,ϵ)*) as a function of the neighborhood threshold (ϵ) for various embeddings. for low-dimensional attractors (small m, upper plots) state vectors are denser, and estimated dimension is consequently greater. nonetheless, this only happens up to a certain point, when m isn't big enough to capture all dimensions of the system. Log scale allows scaling to be identified as straight lines. for small ϵ's point neighborhoods are indistinguishable from noise, so dimension slopes look exaggerated, after which a scaling region arises.

(Image "Figure 20") — Figure 20 - correlation dimension (*D(m,ϵ)*). as in the previous figure, scaling is observed for *ϵ > 100*, which makes those values a good choice (we don't want the estimation to depend on a parameter). the scaling region reveals that the hypothetical strange attractor is of dimension less than 2. if *ϵ → ∞* then the whole of phase space reduces to a single point, as far as the correlation sum estimation is concerned. that's why dimension eventually drops to 0 (macroscopic regime).

possible error sources:

the correlation is a well-defined construction (for finite data sets), whereas the correlation dimension is just an extrapolation that must be handled with care.
it isn't enough to observe fractal scaling for a single choice of m. saturation should also occur for many values.
if measurements are too noisy, a scaling regime may never be observed. the dimension estimate can be as big as m whenever ϵ is small enough, so that only noise is captured by it.
noise affects scales 3 orders of magnitude as big as its own level. in order to observe a scaling region, noise should be at most 1/8 of the whole attractor.
it is therefore advisable to use a #noise reduction algorithm in advance.
certain stochastic systems may pass for low-dimension deterministic systems.
low-resolution digitization will make the estimate look smaller
self-similarity may never be observed, even for good-quality deterministic data, depending on the manifold's large-scale structure.

• minimum delay

a space-time separation plot shows the proportion of points inside the neighborhood as a function of Δt and ϵ. we hope to find some value of Δt after which increments in ϵ don't add extra points. such delay marks the ideal choice of Δt when computing the correlation sum.

this is more rigorous a technique than using the autocorrelation distance.

• as a noise meter

figures 19 and 20 introduced the value ε that marks the limit between noise and scaling regimes. we found a resolution limit in the data, when neighborhoods were so tiny that they only capture randomly distributed points in all directions.

in a nutshell, given that ε is a distance in phase space units, we can define the noise level as the ratio between ε and the attractor's "size".

• neuroscience applications

how not to use it:

Lehnertz and Elger (1998)⁶ analysed localized intracranial EEG data from 16 epilepsy patients. they claimed that a loss of complexity in brain activity should result from the synchronization of pathological cells before and during seizures.

linear techniques can only predict seizures a few seconds in advance. here the intention is to use the correlation dimension estimate as a complexity detector, even though it cannot actually estimate the fractal dimension for these data.

a state called state 2 coincides with pre-ictal synchronization some 10 to 30 minutes in advance to the seizure. nothing is mentioned with regard to confounding sources of complexity reduction, though.

quasi-scaling regions were observed for state 2 (D₂^eff = 3.7) and the seizure state (D₂^eff = 5.3), but not the control state. in other words, complexity apparently increased, counter to theory, and there's no base level to compare it with.

plots don't really show a clear power law either that would justify the reliability of their estimates. nor was any statistical hypothesis test performed using surrogate data.

a more credible one?

Martinerie et al (1998)⁷ attempted to predict seizures by looking at phase transitions between two subsections in reconstructed phase space, also built from intracranial EEG series. it is claimed that for most cases (17 out of 19) it is possible to foretell seizure onset 2 to 6 seconds in advance, and that all of them follow a similar trajectory in phase space.

the alternative hypothesis is actually explored with surrogate data, and authors made sure to smooth noise out previous to the embedding. the number of dimensions (16) may be too unwieldy, though.

• hypothesis testing

the maximal Lyapunov coefficient and the correlation dimension come handy in detecting non-linear dynamics, however they cannot establish such a fact by themselves. it is necessary to observe a clear scaling region.

a stricter analysis would test the hypothesis that observed data are due to a linear process. this implies that certain parameter's conditional distribution (given H₀) has to be empirically estimated, so that it can be contrasted with the real data.

• surrogate data

surrogate data is a akin to resampling methods (permutation, Monte Carlo, bootstrapping) for time series. surrogate data share many similarities with true data for some respects, while consistent with H₀ for others. when testing for non-linearity, Fourier phases are generated randomly while the power spectrum is retained, since the imaginary part is where non-linearity could emerge from.

the general procedure for generating surrogate data is as follows:

decompose the signal in amplitudes and phases using the (fast, discrete) Fourier transform:

$F [s_{0} (t)] \to {A (f), φ (f)}$ $(TeX formula: F[s_0(t)] \; → \; \{A(f), \; φ(f)\} )$
substitute the alleged non-linear properties with random numbers drawn from the uniform distribution ranging from 0 to 2π radians (trying to make $ζ_{N - k} = - ζ_{k}$ $(TeX formula: ζ_{N-k} = -ζ_{k})$ , so that the imaginary part being odd will result in a substitute series belonging to ℝ. see the #Fourier transform properties section):

${A (f), ζ (f)}$ $(TeX formula: \{A(f), \; ζ(f)\} )$
wrap the surrogate series back with the inverse Fourier transform:

$s_{1} (t) \leftarrow F^{- 1} [{A (f), ζ (f)}]$ $(TeX formula: s_1(t) \; ← \; F^{-1}[\{A(f), \; ζ(f)\}] )$
test the series out (dimension estimation, Lyapunov coefficient, autocorrelation, non-linear prediction error, etc.).
rinse and repeat till the parameter's distribution given H₀ is dense enough to yield good type-I error estimates.

• statistical significance

as in standard hypothesis testing, for a significance level of α (type I error: probability that H₀ is erroneously rejected) we would like at most α% of the area under the surrogate data curve to overlap the original measurements, or be equal or more extreme than our point measurement. such a rank-based test is a non-parametric way of deriving a p-value.

for instance, in order to reject H₀ with 95% of success, it would be enough to perform $\frac{1}{.05} - 1 = 19$ $(TeX formula: \frac{1}{.05} - 1 = 19)$ tests (for a one-side test. $\frac{2}{.05} - 1 = 39$ $(TeX formula: \frac{2}{.05} - 1 = 39)$ for the two-tail case).

• as a determinism detector

does passing the test also mean that the system is deterministic (since we tested for non-linear determinism)?

partially yes, with some probability. recall that all statistical inference is based on limited evidence and inductive reasoning:

	accept H₀	reject H₀
H₀ is true	1-α (confidence)	α (type I error)
H₀ is false	β (type II error)	1-β (power)

more caveats:

we have assumed that the available power spectrum captures at least one whole period in the time domain (i.e. no systemic sampling bias due to long-term process). otherwise, random events in a non-stationary process could be misinterpreted as deviations from H₀.
the process may not follow the conjectured probability distribution (this problem is mitigated by non-parametric tests).
multiple comparisons problem: performing many independent tests overblows type I error. for those cases, significance levels should be made more stringent according to some correction method (Bonferroni e.g.)

• as a phase transition detector

the same non-linearity tests have been used to distinguish states of a system.

compute the same statistic for different time segments and compare with one another. the reasoning follows what we did in the #stationarity test.

complexity doesn't imply advanced math, for it can emerge from very simple descriptions (see rule 110 for instance). that said, this post relies on calculus (including some differential equations), inferential statistics (also the fundamentals of information theory), bits of graph theory and linear algebra; most (or all) of which should be familiar to STEM undergraduates. topics like Fourier analysis and chaos theory are introduced from there. ↩
not to be confused with "complicated" or "difficult". ↩
we could have arrived at the same harmonic oscillator equation using the mechanics of a pendulum, or an LC circuit. See system equivalence. ↩
$\forall \vec{z} \in ℂ (| \vec{z} | = \sqrt{\vec{z} \cdot \bar{\vec{z}}})$ $(TeX formula: ∀\vec{z}∈ℂ \; \left(|\vec{z}| = \sqrt{\vec{z}·\overline{\vec{z}}} \right))$ , because $\sqrt{\vec{z} \cdot \bar{\vec{z}}} = \sqrt{z_{1}^{2} + i z_{1} z_{2} - i z_{1} z_{2} + z_{2}^{2}}$ $(TeX formula: \sqrt{\vec{z}·\overline{\vec{z}}} = \sqrt{z_1^2 + iz_1z_2 - iz_1z_2 + z_2^2})$ ↩
Holger Kantz, Thomas Schreiber (2003). Nonlinear Time Series Analysis. Cambridge University Press. ↩
Klaus Lehnertz, Christian E. Elger. Can Epileptic Seizures be Predicted? Evidence from Nonlinear Time Series Analysis of Brain Electrical Activity (1998). Phys. Rev. Lett. 80, 5019. https://doi.org/10.1103/PhysRevLett.80.5019 ↩
Martinerie, J., Adam, C., Le Van Quyen, M., Baulac, M., Clemenceau, S., Renault, B., & Varela, F. J. (1998). Epileptic seizures can be anticipated by non-linear analysis. Nature medicine, 4(10), 1173. https://doi.org/10.1038/2667 ↩

Complex (Cognitive) Systems - Autonomous University of the State of Morelos

• dynamical systems basics

• harmonic oscillator

• initial conditions

• complex polar notation

• energy loss

• underdamped: ω0 > β

• ω0 = β

• overdamped: ω0 < β

• energy input

• resonance

• phase space

• Fourier analysis

• linear independence of Fourier terms

• Fourier transform

• properties

• uncertainty principle

• convolution theorem

• cross-correlation theorem

• discrete Fourier transform

• Whittaker–Shannon (sinc) interpolation

• Nyquist-Shannon sampling theorem

• testing for undersampling

• other time-series topics

• stochastic correlations

• stationarity

• linear prediction

• cross-validation

• phase-space methods

• dissipative systems and attractors

• visual inspection

• Takens-Whitney embedding theorem

• non-linear prediction

• as stationarity test

• as a noise smoother

• chaos theory

• Lyapunov coefficient

• estimation from time series

• fractal dimension

• box counting

• correlation dimension

• estimation from time series

• minimum delay

• as a noise meter

• neuroscience applications

• hypothesis testing

• surrogate data

• statistical significance

• as a determinism detector

• as a phase transition detector

• underdamped: ω₀ > β

• ω₀ = β

• overdamped: ω₀ < β