打印维度是（100，2），函数内运行是（100，1）？哪里错了？

来源：6-4 实现线性回归中的梯度下降法

Jason_chen0755

2020-02-08

from playML.LinearRegression import LinearRegression
import numpy as np
import matplotlib.pyplot as plt

if __name__ == "__main__":

    x = np.random.random(size=100)
    y = x * 3. + 4. + np.random.normal(size=100)
    # plt.scatter(x,y)
    # plt.show()
    X = x.reshape(-1,1)
    X_b = np.hstack([np.ones((len(y),1)),X])
    print(X_b.shape)
    lin_reg = LinearRegression()
    lin_reg.fit_gd(X,y)

def fit_gd(self, X, y, eta=0.01, n_iters=1e4):
        assert X.shape[0] == y.shape[0], \
             "The size of X_train and y_train must be same!"
        X_b = np.hstack([np.ones((len(y), 1)),X])
        initial_theta = np.zeros(X_b.shape[1])

        def J(theta, X_b, y):
            try:
                return (y -X_b.dot(theta)) ** 2 / len(y)
            except:
                return float('inf')

        def dJ(theta, X_b,y):
            # res = np.empty(len(theta))
            # res[0] = np.sum(X_b.dot(theta) - y)
            # for i in range(1,len(theta)):
            #     res[i] = (X_b.dot(theta - y)).dot(X_b[:,i])
            return X_b.T.dot(X_b.dot(theta) - y) * 2 / len(theta)
        
        def gradient_descend(X_b,y,initial_theta,eta=0.01,n_iters=1e4,epsilon=1e-8):
            theta = initial_theta
            i_iters = 0
            while i_iters < n_iters:
                gradient = dJ(theta,X_b,y)
                last_theta = theta
                theta = theta - eta * gradient
                if abs(J(last_theta,X_b,y) - J(theta,X_b,y)) < epsilon:
                    break
                i_iters += 1
            return theta

        self._theta = gradient_descend(X,y,initial_theta,eta,n_iters=1e4,epsilon=1e-8)
        self.interception_ = self._theta[0]
        self.coef_ = self._theta[1:]
        return self

运行结果：
/Users/jason/anaconda3/bin/python3.7 /Users/jason/Desktop/MarchineLearning/main-gradient-descendent.py
Traceback (most recent call last):
File “/Users/jason/Desktop/MarchineLearning/main-gradient-descendent.py”, line 15, in
lin_reg.fit_gd(X,y)
File “/Users/jason/Desktop/MarchineLearning/playML/LinearRegression.py”, line 52, in fit_gd
self._theta = gradient_descend(X,y,initial_theta,eta,n_iters=1e4,epsilon=1e-8)
File “/Users/jason/Desktop/MarchineLearning/playML/LinearRegression.py”, line 44, in gradient_descend
gradient = dJ(theta,X_b,y)
File “/Users/jason/Desktop/MarchineLearning/playML/LinearRegression.py”, line 38, in dJ
return X_b.T.dot(X_b.dot(theta) - y) * 2 / len(theta)
ValueError: shapes (100,1) and (2,) not aligned: 1 (dim 1) != 2 (dim 0)
**(100, 2)**这是打印的维度

写回答

1回答

liuyubobobo

2020-02-09

你传的是 X 不是 X_b？

liuyubobobo

Jason_chen0755

这是正常的。因为 Jupyter Notebook 是一个展示环境，不是一个程序运行环境，程序在 Jupyter Notebook 上要经过浏览器才能开始计算。无人车或者人脸识别系统里不可能转一个 Jupyter Notebook，notebook 的意思是笔记本，它更多的是用来展示程序的测试过程，而非满足性能需求。

2020-02-13

共5条回复

Python3入门机器学习经典算法与应用

Python3+sklearn，兼顾原理、算法底层实现和框架使用。

5904 学习 · 2455 问题

查看课程

相似问题

对于多特征少样本（特征数大于样本数）的数据采用高斯核函数是不是相当于降维？

回答 2

关于维度和决策边界的问题

回答 1

关于高斯核函数的映射

回答 1

您好想问一下，是不是针对不同的算法写的梯度下降法是不一样的，大概就是说不同的学习模型有着不同的函数类型，在写梯度下降法对其优化时也是针对其改部分代码

回答 1

老师，我想问一下return的问题

回答 1

打开慕课网App查看更多内容