A simple example#
Let’s start from the ideal gas model and comparing the deterministic approach and the statistical approach explicitly.
Deterministic approach#
To describe the system in a deterministic fashion, we need to specify the initial condition and the equation of motion.
To specify the initial configuration of the system of \(N\) particles. We need to specify the case with one particle first. We know we need a three component position vector, \(\textbf{R}\), and a three component velocity vector, \(\textbf{V}\), to describe the initial condition for a single particle. When we have \(N\) particles, we will need \(N\) such vectors for the position and \(N\) such vectors for the velocity. That is, the initial condition of the system is specified by \(\{\textbf{r},\textbf{v}\}\equiv\{\textbf{R}_1,\cdots,\textbf{R}_N;\textbf{V}_1,\cdots,\textbf{V}_N\}\). Here \(\textbf{r}\) and \(\textbf{v}\) are vectors of length \(3N\).
With the initial condition specified by \(\{\textbf{r},\textbf{v}\}\), we still need to construct the equation of motion. In general, they are described by
In practice, we usually do not have the detailed information of the system with \(N\sim \mathcal{O}(10^{23})\). Also, the coupled equation of motion is very complicated. Even if we manage to have the solution, it is impossible to make sense out of this complicated deterministic result.
Statistical description#
Now we understand the challenge from the deterministic approach. We known we have a huge space that describes the possible initial condition of our macroscopic system. Are all the initial condition equivalent? Suppose we have a \(1m^3\) box, it is possible that we have a “homogeneous” distribution of or particle as our initial condition. The initial condition that all the particles are concentrated at the top left corner of \(1cm^3\) volume of our box is also valid. However, from the intuition, we know it is more likely that the gas molecules will distribute evenly, instead of choosing to sit at one corner.
This intuition motivates us to make some postulation about the initial conditions. Following is the Maxwell’s postulates:
\(\{\textbf{R}_i,\textbf{V}_i\}\) and \(\{\textbf{R}_j,\textbf{V}_j\}\) are independent variables for different \(i,j\). Let’s use \(f(\textbf{r},\textbf{v})\) to denote the density of particles in the space at \(\textbf{r},\textbf{v}\). The number of particles in this volume is \(dN=f(\textbf{r},\textbf{v})d\textbf{r}d\textbf{v}\).
\(\textbf{r}\) and \(\textbf{v}\) are independent variables. So we expect \(f(\textbf{r},\textbf{v})=f_r(\textbf{r})f_v(\textbf{v})\).
The density is unifor in the space. So we know \(f_r(\textbf{r})=N/V\).
The velocity components, the \(x,y,z\) components, are mutually independent. So we have \(f_v(\textbf{v})=f_x(v_x)f_y(v_y)f_z(v_z)\)
The distribution of the velocity only depends on the magnitude of the velocity. Also, the system is isotropic. That is, \(f_v(\textbf{v})=F(|\textbf{v}|^2)=F(v_x^2+v_y^2+v_z^2)\). Similarly, we expect \(f_x(v_x)=\phi(v_x^2), f_y(v_y)=\phi(v_y^2),f_z(v_z)=\phi(v_z^2)\). The isotropic assumption suggest \(f_x,f_y,f_z\) should have the same functional form \(\phi\).
From 4 and 5, we have
Here, we choose the normalization \(\phi(0)=1\) for simplicity. Once we make such a choice, we have \(F(v_x^2)=\phi(v_x^2)\) and \(F(v_x^2+v_y^2)=\phi(v_x^2)\phi(v_y^2)=F(v_x^2)+F(v_y^2)\).
Let \(G(x)=\ln[F(x)]\), we have
This is the Cauchy functional equation. If we assume \(G(x)\) is continuous, the general solution is \(G(x)=cx\). This implies \(F(x)\propto e^{cx}\) where \(c\) is a normalization constant to be determined. To have a normalizable distribution function, we can choose \(c=-|c|<0\). So we have
Once we figure out the form of the probability density function, we know the normalization condition is
So we have the normalization condition
Performing the Gaussian integral, we have \(f_{\mu}(u)=\sqrt{\frac{\lambda}{\pi}}\exp[-\lambda u^2]\).
At this point, we don’t know what is \(\lambda\). But we can already evaluate the expectation value, \(\langle v^2\rangle=\langle v_x^2+v_y^2+v_z^2\rangle=3\int v_x^2 f_{\mu}(v_x)dv_x=\frac{3}{2\lambda}\).
To figure out the expression of \(\lambda\) using the microscopic modeling, we can consider the ideal gas equation \(PV=Nk_BT\) and the microscopic expression of the concept of pressure, \(P\).
Consider the surface expand in the \(y-z\) plan, the particles collide with the container without energy lost, so we have the momentum transfer to be
We also know the magnitude of pressure is defined as
So we have \(\lambda=\frac{m}{2k_BT}\).
What do we achieve?#
Let’s review what we have done here.
We assume the initial condition satisfy some statistical properties. Here, it is the statistical independence between particles, statistical independence between \(\textbf{r},\textbf{v}\) and the statistical independence of the projection of \(\textbf{r},\textbf{v}\) on the \(x,y,z\) axes.
We then can argue the form of the distribution function with \(\lambda\) undetermined.
We use the microscopic picture and the ideal gas equation \(PV=Nk_BT\) to derive the expression of \(\lambda\).
Then we find the corresponding distribution function.
However, it is not clear that this distribution function is compatible with the deterministic approach. That is, even if we assume the initial condition satisfied this distribution function, it does not mean the distribution function will be satisfied for arbitrary time after we evolve the particles using the deterministic equation of motion!
To rationalize this picture, we postulate that the statistical independence assumption captures the most relevant initial conditions. Even if the deterministic evolution of the system might drive the system outside of this statistical independence assumption, the chance that the system evolved into such status is small and measure zero when the system is at the macroscopic scale.
So one can imagine that if we start from this particular distribution function as our initial condition. As the system evolved in a deterministic way, we expect the distribution function to fluctuates in time. The statistical independence picture we derived previously suggest the fluctuation to be estimated by the relative standard deviation we discussed in the random walk section. At the limit \(N\to\infty\), our distribution function will play dominant role and will be a very good approximation for our system. This is the spirit of the statistical description.