# ECE 204: Numerical Methods

## Linear regression

Given a regression $y=mx+b$ and a data set $(x_{i..n}, y_{i..n})$, the **residual** is the difference between the actual and regressed data:

$$E_i=y_i-b-mx_i$$

### Method of least squares

This method minimises the sum of the square of residuals.

$$\boxed{S_r=\sum^n_{i=1}E_i^2}$$

$m$ and $b$ can be found by taking the partial derivative and solving for them:

$$\frac{\partial S_r}{\partial m}=0, \frac{\partial S_r}{\partial b}=0$$

This returns, where $\overline y$ is the mean of the actual $y$-values:

$$
\boxed{m=\frac{n\sum^n_{i=1}x_iy_i-\sum^n_{i=1}x_i\sum^n_{i=1}y_i}{n\sum^n_{i=1}x_i^2-\left(\sum^n_{i=1}x_i\right)^2}} \\
b=\overline y-m\overline x
$$

The total sum of square around the mean is based off of the actual data:

$$\boxed{S_t=\sum(y_i-\overline y)^2}$$

Error is measured with the **coefficient of determination** $r^2$ — the closer the value is to 1, the lower the error.

$$
r^2=\frac{S_t-S_r}{S_t}
$$

If the intercept is the **origin**, $m$ reduces down to a simpler form:

$$m=\frac{\sum^n_{i=1}x_iy_i}{\sum^n_{i=1}x_i^2}$$

## Non-linear regression

### Exponential regression

Solving for the same partial derivatives returns the same values, although bisection may be required for the exponent coefficient ($e^{bx}$) Instead, linearising may make things easier (by taking the natural logarithm of both sides. Afterward, solving as if it were in the form $y=mx+b$ returns correct 

!!! example
    $y=ax^b\implies\ln y = \ln a + b\ln x$

### Polynomial regression

The residiual is the offset at the end of a polynomial.

$$y=a+bx+cx^2+E$$

Taking the relevant partial derivatives returns a system of equations which can be solved in a matrix.

## Interpolation

Interpolation ensures that every point is crossed.

### Direct method

To interpolate $n+1$ data points, you need a polynomial of a degree **up to $n$**, and points that enclose the desired value. Substituting the $x$ and $y$ values forms a system of equations for a polynomial of a degree equal to the number of points chosen - 1.

### Newton's divided difference method

This method guesses the slope to interpolate. Where $x_0$ is an existing point:

$$\boxed{f(x)=b_0+b_1(x-x_0)}$$

The constant is an existing y-value and the slope is an average.

$$
\begin{align*}
b_0&=f(x_0) \\
b_1&=\frac{f(x_1)-f(x_0)}{x_1-x_0}
\end{align*}
$$

This extends to a quadratic, where the second slope is the average of the first two slopes:

$$\boxed{f(x)=b_0+b_1(x-x_0)+b_2(x-x_0)(x-x_1)}$$

$$
b_2=\frac{\frac{f(x_2)-f(x_1)}{x_2-x_1}-\frac{f(x_1)-f(x_0)}{x_1-x_0}}{x_2-x_0}
$$

## Derivatives

Derivatives are estimated based on first principles:

$$f'(x)=\frac{f(x+h)-f(x)}{h}$$

### Derivatives of continuous functions

At a desired $x$ for $f'(x)$:

1. Choose an arbitrary $h$
2. Calculate derivative via first principles
3. Shrink $h$ and recalculate derivative
4. If the answer is drastically different, repeat step 3

### Derivatives of discrete functions

Guesses are made based on the average slope between two points.

$$f'(x_i)=\frac{f(x_{i+1})-f(x_i)}{x_{i+1}-x_i}$$

### Divided differences

- Using the next term, or a $\Delta x > 0$ indicates a **forward divided difference (FDD)**.
- Using the previous term, or a $\Delta x < 0$ indicates a **backward divided difference (BDD)**.

The **central divided difference** averages both if $h$ or $\Delta x$ of the forward and backward DDs are equal.

$$f'(x)=CDD=\frac{f(x+h)-f(x-h)}{2h}$$

### Higher order derivatives

Taking the Taylor expansion of the function or discrete set and then expanding it as necessary can return any order of derivative. This also applies for $x-h$ if positive and negative are alternated.

$$f(x+h)=f(x)+f'(x)h+\frac{f''(x)}{2!}h^2+\frac{f'''(x)}{3!}h^3$$

!!! example
    To find second order derivatives:
    
    \begin{align*}
    f''(x)&=\frac{2f(x+h)-2f(x)-2f'(x)h}{h^2} \\
    &=\frac{2f(x+h)-2f(x)-(f(x+h)-f(x-h))}{h^2} \\
    &=\frac{f(x+h)-2f(x)+f(x-h)}{h^2}
    \end{align*}

!!! example
    $f''(3)$ if $f(x)=2e^{1.5x}$ and $h=0.1$:
    
    \begin{align*}
    f''(3)&=\frac{f(3.1)-2\times2f(3)+f(2.9)}{0.1^2} \\
    &=405.08
    \end{align*}
    
For discrete data:

- If the desired point does not exist, differentiating the surrounding points to create a polynomial interpolation of the derivative may be close enough.

!!! example
    | t | 0 | 10 | 15 | 20 | 22.5 | 30 |
    | --- | --- | --- | --- | --- | --- | --- |
    | v(t) | 0 | 227.04 | 362.78 | 517.35 | 602.47 | 901.67 |
    
    $v'(16)$ with FDD:
    
    Using points $t=15,t=20$:
    
    \begin{align*}
    v'(x)&=\frac{f(x+h)-f(x)}{h} \\
    &=\frac{f(15+5)-f(15)}{5} \\
    &=\frac{517.35-362.78}{5} \\
    &=30.914
    \end{align*}
    
    $v'(16)$ with Newton's first-order interpolation:
    
    \begin{align*}
    v(t)&=v(15)+\frac{v(20)-v(15)}{20-15}(t-15) \\
    &=362.78+30.914(t-15) \\
    &=-100.93+30.914t \\
    v'(t)&=\frac{v(t+h)-v(t)}{2h} \\
    &=\frac{v(16.1)-v(15.9)}{0.2} \\
    &=30.914
    \end{align*}

- If the spacing is not equal (to make DD impossible), again creating an interpolation may be close enough.
- If data is noisy, regressing and then solving reduces random error.