对于线性回归, 成本函数为–
![J(\ Theta)= \ frac {1} {m} \ sum_ {i = 1} ^ {m} \ frac {1} {2} [h _ {\ Theta}(x ^ {(i)})-y ^ {(i)}] ^ {2}](https://www.lsbin.com/wp-content/uploads/2021/05/quicklatex.com-3716654884181f571d24bf33f1cb49bb_l3.png)
但是对于Logistic回归,
data:image/s3,"s3://crabby-images/25c0f/25c0f186cdf9c42a977ae461aa8e174c28bb2ca3" alt="h _ {\ Theta}(x)= g(\ Theta ^ {T} x)"
这将导致非凸成本函数。但这会导致成本函数具有局部最优值, 这对于梯度下降计算全局最优值来说是一个很大的问题。
data:image/s3,"s3://crabby-images/074d8/074d8fe2c8c6cf63a4a85903a1390779fbb6fc01" alt="ML Logistic回归中的成本函数1"
因此, 对于Logistic回归, 成本函数为
data:image/s3,"s3://crabby-images/43f05/43f055b81383a68de7c7fc885e01716c28be150a" alt="Cost(h _ {\ Theta}(x),y)= \ left \ {\ begin {matrix} -log(h _ {\ Theta}(x))&if&y = 1 \\ -log(1-h _ {\ Theta }(x))&if&y = 0 \ end {matrix} \ right。"
如果y = 1
data:image/s3,"s3://crabby-images/34332/34332010b0fc15916eb98034cd641192a6f8963a" alt="ML Logistic回归中的成本函数2"
Cost = 0 if y = 1, hθ(x) = 1
但是
hθ(x) -> 0
Cost -> Infinity
data:image/s3,"s3://crabby-images/ab9d4/ab9d46664fad086706527c0c36a0e7679f1ce0fa" alt="ML Logistic回归中的成本函数3"
所以,
data:image/s3,"s3://crabby-images/8ea4b/8ea4b8df55920c157d1bea7b24dde5ca90878d01" alt="Cost(h _ {\ Theta}(x),y)= \ left \ {\ begin {matrix} 0&if&h _ {\ Theta}(x)= y \\ \ infty&if&y = 0&and&h _ {\ Theta }(x)\ rightarrow 1 \\ \ infty&如果&y = 1&and&h _ {\ Theta}(x)\ rightarrow 0 \ end {matrix} \ right。"
data:image/s3,"s3://crabby-images/8a44e/8a44e6d7dca438d9dd40900e5eaeffcd162b9e6b" alt="成本(h _ {\ Theta}(x),y)= -y log(h _ {\ Theta}(x))-(1-y)log(1-h _ {\ Theta}(x))"
data:image/s3,"s3://crabby-images/7d37f/7d37f25ea0ae1cf31305bd7e516f31b3da050509" alt="J({\ Theta})= \ frac {-1} {m} \ sum_ {i = 1} ^ {m} Cost(h _ {\ Theta}(x),y)"
拟合参数θ, 必须将J(θ)最小化, 并且为此需要梯度下降。
梯度下降 -看起来与线性回归相似, 但区别在于假设hθ(X)
data:image/s3,"s3://crabby-images/42ed2/42ed2c68318fb023c87b46717f945fa436746865" alt="\ Theta_ {j}:= \ Theta_ {j}-\ alpha \ sum_ {i = 1} ^ {m}(h_ \ Theta(x ^ {(i)})-y ^ {(i)})x_j ^ {(一世)}"
首先, 你的面试准备可通过以下方式增强你的数据结构概念:Python DS课程。