动手学深度学习：多层感知机

主要介绍了softmax，ReLU,tanh激活函数，以及权重衰退和dropout暂退法。

snowful world

548人浏览 · 2025-02-28 13:24:38

snowful world · 2025-02-28 13:24:38 发布

多层感知机

之前学习了线性模型，我们可以很容易想到，现实生活中很多的现象用线性是无法拟合的，所以研究人员就想到了在线性之后添加一些非线性的激活函数，使得整个网络获得非线性，以拟合更加复杂的情况。

常见的激活函数

数学公式自行查找

ReLU

import torch
import matplotlib.pyplot as plt
x=torch.arange(-10.0,10.0,0.1,requires_grad=True)
x

在这里插入图片描述

y=torch.relu(x)
y

在这里插入图片描述

可视化图像

plt.plot(x.detach(),y.detach())

在这里插入图片描述

可视化梯度

y.backward(torch.ones_like(x),retain_graph=True)
plt.clf()
plt.plot(x.detach(),x.grad)

在这里插入图片描述

sigmoid

y=torch.sigmoid(x)
y

在这里插入图片描述

可视化图像

plt.clf()
plt.plot(x.detach(),y.detach())

在这里插入图片描述

可视化梯度

x.grad.data.zero_()
y.backward(torch.ones_like(x),retain_graph=True)
plt.clf()
plt.plot(x.detach(),x.grad)

在这里插入图片描述

tanh

y=torch.tanh(x)
y

在这里插入图片描述

可视化图像

plt.clf()
plt.plot(x.detach(),y.detach())

在这里插入图片描述

可视化梯度

x.grad.data.zero_()
y.backward(torch.ones_like(x),retain_graph=True)
plt.clf()
plt.plot(x.detach(),x.grad)

在这里插入图片描述

读取数据

我上一篇博客写过，点击直接跳转

模型（ReLU激活+权重衰减+dropout）

上一篇博客train和test的acc中间就有1%的差距，似乎是有一点过拟合，在添加ReLU激活函数将线性模型变成多层感知机之后（整体准确率提高1%）会发现两者之间仍然有差距，所以我尝试了权重衰减和dropout。

权重衰减0.01会发现两者差距几乎没有，但是train的acc降低了1%，几乎跟线性效果差不多了，所以不可取。

dropout0.3会发现两者几乎重合，train下降很少，同时test也有所上升。

from torch import nn
net=nn.Sequential(nn.Flatten(),
                  nn.Linear(28*28,256),
                  nn.ReLU(),
                  nn.Dropout(0.3),
                  nn.Linear(256,8))
def init_weight(m):
    if type(m)==nn.Linear:
        nn.init.normal_(m.weight,std=0.01)
net.apply(init_weight)

在这里插入图片描述
权重衰退就是在优化器传参的时候对Linear（）的参数进行限制。

wd=0.00 #权重衰减的值
loss_fn=nn.CrossEntropyLoss()
optimer=torch.optim.SGD([{"params": net[1].weight,"weight_decay": wd},{"params": net[1].bias},{"params": net[4].weight,"weight_decay": wd},{"params": net[4].bias}],lr=0.1)

epochs_num=10
train_len=len(train_iter.dataset)
all_acc=[]
all_loss=[]
test_all_acc=[]
for epoch in range(epochs_num):
    acc=0
    loss=0
    for x,y in train_iter:
        hat_y=net(x)
        l=loss_fn(hat_y,y)
        loss+=l
        optimer.zero_grad()
        l.backward()
        optimer.step()
        acc+=(hat_y.argmax(1)==y).sum()
    all_acc.append(acc/train_len)
    all_loss.append(loss.detach().numpy())
    test_acc=0
    test_len=len(test_iter.dataset)
    with torch.no_grad():
        for x,y in test_iter:
            hat_y=net(x)
            test_acc+=(hat_y.argmax(1)==y).sum()
    test_all_acc.append(test_acc/test_len)
    print(f'{epoch}的test的acc{test_acc/test_len}')

在这里插入图片描述

可视化

import matplotlib.pyplot as plt

损失函数可视化

plt.plot(range(1,epochs_num+1),all_loss,'.-',label='train_loss')
plt.text(epochs_num, all_loss[-1], f'{all_loss[-1]:.4f}', fontsize=12, verticalalignment='bottom')

在这里插入图片描述

准确率可视化

plt.plot(range(1,epochs_num+1),all_acc,'-',label='train_acc')
plt.text(epochs_num, all_acc[-1], f'{all_acc[-1]:.4f}', fontsize=12, verticalalignment='bottom')
plt.plot(range(1,epochs_num+1),test_all_acc,'-.',label='test_acc')
plt.legend()

在这里插入图片描述

预测结果

with torch.no_grad():
    all_num=5
    index=1
    plt.figure(figsize=(12,5))
    for i,label in zip(test_data_path,test_labels):
        if index<=all_num:
            img=cv2.imread(i)
            input_img=cv2.cvtColor(img,cv2.COLOR_BGR2GRAY)
            img=cv2.cvtColor(input_img,cv2.COLOR_BGR2RGB)
            input_img=cv2.resize(input_img,size) 
            input_img=transforms.ToTensor()(input_img)
            result=net(input_img).argmax(1)
            plt.subplot(1,all_num,index)
            plt.imshow(img)
            plt.title(f'true{label},predict{result.detach().numpy()}')
            plt.axis("off")
            index+=1

在这里插入图片描述

九章云极普惠算力

更多推荐

@github/relative-time-element 与标准＜time＞元素的对比分析

在现代Web开发中，时间展示是用户体验的重要组成部分。标准HTML5的`<time>`元素虽然能够标记时间，但在动态显示和本地化方面存在局限。而`@github/relative-time-element`作为一款强大的Web组件扩展，为开发者提供了更灵活、智能的时间处理方案。本文将深入对比这两种时间元素的功能特性，帮助你快速掌握它们的差异与应用场景。## 核心功能对比：静态标记 vs 动态智

九章云极普惠算力

March7thAssistant企业合作：探索与游戏开发商的创新合作模式

March7thAssistant作为一款专注于崩坏：星穹铁道的全自动辅助工具，正通过其强大的自动化任务处理能力，为游戏生态带来新的可能性。本文将深入探讨该工具与游戏开发商之间的潜在合作空间，以及如何通过技术创新实现双赢。## 工具核心价值：重新定义玩家体验March7thAssistant的核心优势在于其全面的自动化任务系统，能够帮助玩家高效完成日常任务、资源收集和活动参与。从自动战斗到

九章云极普惠算力

为什么选择Topcoat？探索轻量级CSS框架的独特优势 ✨

Topcoat是一个专注于构建**干净且快速Web应用**的轻量级CSS框架。它通过精心设计的样式规则和组件系统，帮助开发者轻松创建具有专业外观的用户界面，同时保持代码的简洁性和高性能。无论是桌面端还是移动端应用，Topcoat都能提供一致且优雅的设计体验。## 🚀 Topcoat的核心优势### 1. 极致轻量化设计Topcoat的核心理念是"轻装上阵"。相比其他动辄数百KB的CSS