Machine Learning Quizzes

Quiz 1

Question

$y = A x + v$ , where $v$ is a Gaussian noise.

What is the optimal solution for $x$ ?
What is the optimal solution for $x$ if $v \sim N (0, R)$ ?
What is the optimal solution for $x$ if $v \sim N (0, R)$ and $X \sim N (0, a I)$ ?
$A$ and $X$ are unknown, what is the optimal solution for $x$ ?

Answer

$J (x) = \frac{1}{2} (y - A x)^{T} (y - A x)$ , $\frac{\partial J}{\partial x} = 0$ , $x = (A^{T} A)^{- 1} A^{T} y$
$J (x) = \frac{1}{2} (y - A x)^{T} R^{- 1} (y - A x)$ , $\frac{\partial J}{\partial x} = 0$ , $x = (A^{T} R^{- 1} A)^{- 1} A^{T} R^{- 1} y$
$J (x) = \frac{1}{2} (y - A x)^{T} R^{- 1} (y - A x) + \frac{1}{2} x^{T} (a I)^{- 1} x$ , $\frac{\partial J}{\partial x} = 0$ , $x = (A^{T} R^{- 1} A + a I)^{- 1} A^{T} R^{- 1} y$
We can distinguish two cases:
1. For x: $J (x) = \frac{1}{2} (y - A x)^{T} R^{- 1} (y - A x) + \frac{1}{2} x^{T} (a I)^{- 1} x$ , $\frac{\partial J}{\partial x} = 0$ , $x = (A^{T} R^{- 1} A + a I)^{- 1} A^{T} R^{- 1} y$
2. For A: $Y^{T} = X^{T} A^{T}$ $J (A) = \frac{1}{2} (Y - X A)^{T} R^{- 1} (Y - X A)$ , $\frac{\partial J}{\partial A} = 0$ , $A^{T} = (X R^{- 1} X^{T})^{- 1} X R^{- 1} Y^{T}$

Quiz 2

Question

$Y = A X + ω$ , where $ω \sim N (0, Q)$ and $X \sim N (μ_{0}, Σ_{0})$

What is $p (Y | X)$ ?
What is $p (Y)$ ?
What is $p (X | Y)$ ?
What is $p (Y^{'} | Y)$ ?

Answer

$p (Y | X) \sim N (A X, Q)$ We regard $X$ as a constant under conditional probability.
$p (Y) \sim \int p (Y | X) p (X) d x \sim N (A μ_{0}, A Σ_{0} A^{T} + Q)$ . $v a r [Y] = v a r [A X] + v a r [ω] = A Σ_{0} A^{T} + Q$
Assume that $p (X | Y) \sim N (m, L)$ , then we can use the equality of quadratic from to solve the problems.
1. $p (X | Y) \sim p (Y | X) p (X) = N (Y | A X, Q) N (X | μ_{0}, Σ_{9})$
2. $- \frac{1}{2} (x - m)^{T} L^{- 1} (x - m) \propto - \frac{1}{2} (y - A x)^{T} Q^{- 1} (y - A x) - \frac{1}{2} (x - μ_{0})^{T} Σ_{0}^{- 1} (x - μ_{0})$
3. We can get the result:
$\begin{aligned} L^{- 1} & = A^{T} Q^{- 1} A + Σ_{0}^{- 1} \\ L^{- 1} m & = A^{T} Q^{- 1} y + Σ_{0}^{- 1} μ_{0} \end{aligned}$
1. By applying $[A + B C D]^{- 1} = A^{- 1} - A^{- 1} B [C^{- 1} + D A^{- 1} B]^{- 1} D A^{- 1}$
$\begin{aligned} L & = (I - K A) Σ_{0} \\ m & = μ_{0} + K (y - A μ_{0}) \end{aligned}$ where $K = Σ_{0} A^{T} (A^{T} Σ_{0} A + Q)^{- 1}$
$p (Y^{'} | Y) \sim \int p (Y^{'} | X) p (X | Y) d x \sim N (A m, A L A^{T} + Q)$ . The same format as question 2.

Quiz 3

TIP

Learning: $p (θ | D) \propto p (D | θ) p (θ)$
Prediction: $p (D^{n e w} | D) = \int p (D^{n e w} | θ) p (θ | D) d θ$
Evaluation: $p (D) = \int p (D | θ) p (θ) d θ$

Question

Given $t = Φ (x) ω + v$ where $Φ (x) = [1, x, x . . ., x^{M}]$ and $v \sim N (0, β^{- 1})$ , $D = {[x_{1}, . . ., x_{N}], [t_{1}, . . ., t_{N}]}$

What is the solution of $ω_{M L}$ ?
What is the solution of $ω_{M A P}$ if $ω \sim N (0, α^{- 1} I)$ ?
What is the predictive distribution if $D^{n e w} = {x^{n e w}, t^{n e w}}$ ?
What is the model evaluation?

Answer

$J (ω) = \frac{β}{2} (T - Φ ω)^{T} (T - Φ ω) \to ω_{M L} = (Φ^{T} Φ)^{- 1} Φ^{T} T$
$J (ω) = \frac{β}{2} (T - Φ ω)^{T} (T - Φ ω) + \frac{α}{2} ω^{T} ω \to ω_{M A P} = (β Φ^{T} Φ + α I)^{- 1} β Φ^{T} T$
$N (Φ (x^{n e w}) ω_{M A P}, Φ (x^{n e w}) Σ_{M A P} Φ (x^{n e w})^{T} + β I)$
$N (0, α^{- 1} Φ Φ^{T} + β^{- 1} I)$

Quiz 4

Question

For $y = σ (Φ (x) w)$ , and $D = {[x_{1}, . . ., x_{N}], [t_{1}, . . ., t_{N}]}$ , where $σ (x) = \frac{1}{1 + e^{- x}}$ .

What is the solution of $w_{M L}$ ?
What is the solution of $w_{M A P}$ if $w \sim N (m_{0}, S_{0})$ ?
What is the predictive distribution if $D^{n e w} = {x^{n e w}, t^{n e w} = 1}$ ?
What is the model evaluation?

Answer

$J (w) = - \sum_{n = 1}^{N} {t_{n} \log y_{n} + (1 - t_{n}) \log (1 - y_{n})} b = ▽ J (w) = \sum_{n = 1}^{N} ϕ^{T} (y_{n} - t_{n}) H = ▽ ▽ J (w) = \sum_{n = 1}^{N} y_{n} (1 - y_{n}) ϕ_{n}^{T} ϕ$
Because $σ$ is not a linear function, there are no explicit solution to find $w_{M L}$ . We can use the gradient descent method to find the solution.
$w^{+} \larr w - H^{- 1} b$
$J (w) = - \sum_{n = 1}^{N} {t_{n} \log y_{n} + (1 - t_{n}) \log (1 - y_{n})} + \frac{1}{2} (w - m_{0})^{T} S_{0}^{- 1} (w - m_{0})$
Therefore, $b = \triangledown J(w) = \sum_{n=1}^N \phi^T(y_n - t_n) + S_0^{-1}(w - m_0) $ and $H = ▽ ▽ J (w) = \sum_{n = 1}^{N} y_{n} (1 - y_{n}) ϕ_{n}^{T} ϕ + S_{0}^{- 1}$
$p (t^{n e w} = 1 | x^{n e w}, D) = \int p (t^{n e w} = 1 | w) p (w | D) d w = \int σ (ϕ^{n e w} w) N (w_{M A P}, H^{- 1}) d w$
$σ (κ (σ_{a}^{2}) μ_{a})$
$\sum_{n = 1}^{N} [t_{n} \ln y_{n} + (1 - t_{n}) \ln (1 - y_{n})]_{M A P} - \frac{1}{2} (w_{M A P} - m_{0})^{T} S_{0}^{- 1} (w_{M A P} - m_{0}) + \frac{M}{2} \ln 2 π - \frac{1}{2} \ln | H |_{M A P}$

Machine Learning Quizzes ​

Quiz 1 ​

Question ​

Answer ​

Quiz 2 ​

Question ​

Answer ​

Quiz 3 ​

Question ​

Answer ​

Quiz 4 ​

Question ​

Answer ​

Machine Learning Quizzes

Quiz 1

Question

Answer

Quiz 2

Question

Answer

Quiz 3

Question

Answer

Quiz 4

Question

Answer