softmax(3)--简洁实现

2024-05-24 约 182 字预计阅读 1 分钟

注意

本文主要记录代码，优化细节，添加注释。

加载数据

import torch
from torch import nn
from d2l import torch as d2l
batch_size = 256
train_iter, test_iter = d2l.load_data_fashion_mnist(batch_size)

初始化参数

# PyTorch不会隐式地调整输入的形状。因此，
# 我们在线性层前定义了展平层（flatten），来调整网络输入的形状
net = nn.Sequential(nn.Flatten(), nn.Linear(784, 10))

def init_weights(m):
    if type(m) == nn.Linear:
        nn.init.normal_(m.weight, std=0.01)

net.apply(init_weights);

损失函数

loss = nn.CrossEntropyLoss(reduction='none')

这里的交叉熵包括了softmax,也就是说，这一层直接接受原始输出就可以

优化器

trainer = torch.optim.SGD(net.parameters(), lr=0.1)

训练

num_epochs = 10
d2l.train_ch3(net, train_iter, test_iter, loss, num_epochs, trainer)

赞赏支持

微信打赏

支付宝打赏

目录

softmax(3)--简洁实现

加载数据

初始化参数

损失函数

优化器

训练