5.1 KiB
ECE 204: Numerical Methods
Linear regression
Given a regression \(y=mx+b\) and a data set \((x_{i..n}, y_{i..n})\), the residual is the difference between the actual and regressed data:
\[E_i=y_i-b-mx_i\]
Method of least squares
This method minimises the sum of the square of residuals.
\[\boxed{S_r=\sum^n_{i=1}E_i^2}\]
\(m\) and \(b\) can be found by taking the partial derivative and solving for them:
\[\frac{\partial S_r}{\partial m}=0, \frac{\partial S_r}{\partial b}=0\]
This returns, where \(\overline y\) is the mean of the actual \(y\)-values:
\[ \boxed{m=\frac{n\sum^n_{i=1}x_iy_i-\sum^n_{i=1}x_i\sum^n_{i=1}y_i}{n\sum^n_{i=1}x_i^2-\left(\sum^n_{i=1}x_i\right)^2}} \\ b=\overline y-m\overline x \]
The total sum of square around the mean is based off of the actual data:
\[\boxed{S_t=\sum(y_i-\overline y)^2}\]
Error is measured with the coefficient of determination \(r^2\) — the closer the value is to 1, the lower the error.
\[ r^2=\frac{S_t-S_r}{S_t} \]
If the intercept is the origin, \(m\) reduces down to a simpler form:
\[m=\frac{\sum^n_{i=1}x_iy_i}{\sum^n_{i=1}x_i^2}\]
Non-linear regression
Exponential regression
Solving for the same partial derivatives returns the same values, although bisection may be required for the exponent coefficient (\(e^{bx}\)) Instead, linearising may make things easier (by taking the natural logarithm of both sides. Afterward, solving as if it were in the form \(y=mx+b\) returns correct
!!! example \(y=ax^b\implies\ln y = \ln a + b\ln x\)
Polynomial regression
The residiual is the offset at the end of a polynomial.
\[y=a+bx+cx^2+E\]
Taking the relevant partial derivatives returns a system of equations which can be solved in a matrix.
Interpolation
Interpolation ensures that every point is crossed.
Direct method
To interpolate \(n+1\) data points, you need a polynomial of a degree up to \(n\), and points that enclose the desired value. Substituting the \(x\) and \(y\) values forms a system of equations for a polynomial of a degree equal to the number of points chosen - 1.
Newton’s divided difference method
This method guesses the slope to interpolate. Where \(x_0\) is an existing point:
\[\boxed{f(x)=b_0+b_1(x-x_0)}\]
The constant is an existing y-value and the slope is an average.
\[ \begin{align*} b_0&=f(x_0) \\ b_1&=\frac{f(x_1)-f(x_0)}{x_1-x_0} \end{align*} \]
This extends to a quadratic, where the second slope is the average of the first two slopes:
\[\boxed{f(x)=b_0+b_1(x-x_0)+b_2(x-x_0)(x-x_1)}\]
\[ b_2=\frac{\frac{f(x_2)-f(x_1)}{x_2-x_1}-\frac{f(x_1)-f(x_0)}{x_1-x_0}}{x_2-x_0} \]
Derivatives
Derivatives are estimated based on first principles:
\[f'(x)=\frac{f(x+h)-f(x)}{h}\]
Derivatives of continuous functions
At a desired \(x\) for \(f'(x)\):
- Choose an arbitrary \(h\)
- Calculate derivative via first principles
- Shrink \(h\) and recalculate derivative
- If the answer is drastically different, repeat step 3
Derivatives of discrete functions
Guesses are made based on the average slope between two points.
\[f'(x_i)=\frac{f(x_{i+1})-f(x_i)}{x_{i+1}-x_i}\]
Divided differences
- Using the next term, or a \(\Delta x > 0\) indicates a forward divided difference (FDD).
- Using the previous term, or a \(\Delta x < 0\) indicates a backward divided difference (BDD).
The central divided difference averages both if \(h\) or \(\Delta x\) of the forward and backward DDs are equal.
\[f'(x)=CDD=\frac{f(x+h)-f(x-h)}{2h}\]
Higher order derivatives
Taking the Taylor expansion of the function or discrete set and then expanding it as necessary can return any order of derivative. This also applies for \(x-h\) if positive and negative are alternated.
\[f(x+h)=f(x)+f'(x)h+\frac{f''(x)}{2!}h^2+\frac{f'''(x)}{3!}h^3\]
!!! example To find second order derivatives:
\begin{align*}
f''(x)&=\frac{2f(x+h)-2f(x)-2f'(x)h}{h^2} \\
&=\frac{2f(x+h)-2f(x)-(f(x+h)-f(x-h))}{h^2} \\
&=\frac{f(x+h)-2f(x)+f(x-h)}{h^2}
\end{align*}
!!! example \(f''(3)\) if \(f(x)=2e^{1.5x}\) and \(h=0.1\):
\begin{align*}
f''(3)&=\frac{f(3.1)-2\times2f(3)+f(2.9)}{0.1^2} \\
&=405.08
\end{align*}
For discrete data:
- If the desired point does not exist, differentiating the surrounding points to create a polynomial interpolation of the derivative may be close enough.
!!! example | t | 0 | 10 | 15 | 20 | 22.5 | 30 | | — | — | — | — | — | — | — | | v(t) | 0 | 227.04 | 362.78 | 517.35 | 602.47 | 901.67 |
$v'(16)$ with FDD:
Using points $t=15,t=20$:
\begin{align*}
v'(x)&=\frac{f(x+h)-f(x)}{h} \\
&=\frac{f(15+5)-f(15)}{5} \\
&=\frac{517.35-362.78}{5} \\
&=30.914
\end{align*}
$v'(16)$ with Newton's first-order interpolation:
\begin{align*}
v(t)&=v(15)+\frac{v(20)-v(15)}{20-15}(t-15) \\
&=362.78+30.914(t-15) \\
&=-100.93+30.914t \\
v'(t)&=\frac{v(t+h)-v(t)}{2h} \\
&=\frac{v(16.1)-v(15.9)}{0.2} \\
&=30.914
\end{align*}
- If the spacing is not equal (to make DD impossible), again creating an interpolation may be close enough.
- If data is noisy, regressing and then solving reduces random error.