Pytorch_Lightning学习心得

🕗 发布于 2024-07-06 17:40 pytorch

Introduction

Pytorch_Lightning中与pytorch有一些格式上的差异，我认为Pytorch_Lightning的更为规范，让人读很繁琐的工程时能够直接快速定位，而且封装的更好，下面开始介绍一下相关的component。

Training

亮点：不需要手动将变量放入cuda；省去很多冗余操作，如zero_grad、backward、optimizer.step

在Pytorch中通常如下：

num_epoch = 5
for epoch in range(num_epoch):
    losses = list()
    accuracies = list()
    model.train()
    for batch in train_loader:
        x, y = batch

        # x: b x 1 x 28 x 28
        b = x. size(0)
        x = x.view(b,-1).cuda()

        # 1.forward
        1 = model (x)

        # 2.compute the objective function
        J = loss(1, y.cuda())
    
        # 3.cleaning the gradient
        model.zero_grad()                    # 很冗余
    
        # 4.accumlate the partial derivatives of J wrt params
        J.backward()                         # 很冗余

        # 5.step in the opposite direction of the gradient
        optimizer.step()                     # 很冗余

而在Pytorch_Lightning中，会简化这一步骤：

from pytorch_lightning.metrics.functional import accuracy

def training_step(self, batch, batch_idx):
    x, y = batch

    # x: b x 1 x 28 x 28
    b = x. size(0)
    x = x.view(b,-1)       # 不再需要指定cuda

    # 1.forward
    l = self(x)            # 因为这是在class中，用forward

    # 2.compute the objective function
    J = self.loss(1, y)    # 不再需要指定cuda，self.loss需要在class的__init__中定义好
    
    acc = accuracy(l, y)
    pbar = {'train_acc': acc}

    
    return {'loss': J, 'progress_bar': pbar}    # 以字典返回loss
    # return J               # 以值返回loss

Validation

基本上与Training类似

损失函数

在Pytorch中通常是先定义网络，专门写一个train函数，里面定义用到的loss，在Pytorch_Lightning中，是在网络中提前在网络的__init__中设置好loss，比如：

def __init__(self):
    super().__init__()
    self.loss = nn.CrossEntropyLoss()

优化器

单个优化器：当你只有一个优化器时，可以直接返回该优化器对象，PyTorch Lightning会自动处理，这里return [optimizer]也可以，但是一般单个不用列表：

def configure_optimizers(self):
    optimizer = optim.SGD(self.parameters(), lr=1e-2)
    return optimizer

多个优化器：当你有多个优化器时，需要返回一个包含这些优化器的列表：

def configure_optimizers(self):
    lr_g = self.lr_g_factor * self.learning_rate
    lr_d = self.learning_rate
    opt_ae = torch.optim.Adam(list(self.encoder.parameters()) +
                              list(self.decoder.parameters()) +
                              list(self.quant_conv.parameters()) +
                              list(self.post_quant_conv.parameters()),
                              lr=lr_g, betas=(0.5, 0.9))
    opt_disc = torch.optim.Adam(self.loss.discriminator.parameters(),
                                lr=lr_d, betas=(0.5, 0.9))

    return [opt_ae, opt_disc]

多个优化器➕调度器

def configure_optimizers(self):
    lr_g = self.lr_g_factor * self.learning_rate
    lr_d = self.learning_rate
    opt_ae = torch.optim.Adam(list(self.encoder.parameters()) +
                              list(self.decoder.parameters()) +
                              list(self.quant_conv.parameters()) +
                              list(self.post_quant_conv.parameters()),
                              lr=lr_g, betas=(0.5, 0.9))
    opt_disc = torch.optim.Adam(self.loss.discriminator.parameters(),
                                lr=lr_d, betas=(0.5, 0.9))

    scheduler_ae = torch.optim.lr_scheduler.StepLR(opt_ae, step_size=10, gamma=0.1)
    scheduler_disc = torch.optim.lr_scheduler.StepLR(opt_disc, step_size=10, gamma=0.1)

    return [opt_ae, opt_disc], [scheduler_ae, scheduler_disc]

原文地址：https://blog.csdn.net/weixin_45696917/article/details/140078749

免责声明：本站文章内容转载自网络资源，如本站内容侵犯了原著者的合法权益，可联系本站删除。更多内容请关注自学内容网（zxcms.com）！

上一篇：ESP32-C3(基本信息)
下一篇：检查与批量清理Oracle用户会话

uni-app快速入门（四）--maninfest.json及pages.json配置
本文介绍uniapp开发的manifest.json配置及pages.json的设置，以及TabBar、分包加载。
阅读更多2024-11-17
创客节初中组C++模拟题
2024年深圳罗湖区创客节模拟题（初中组）
阅读更多2024-11-17
超详细：索引介绍（易懂！）
索引的作用就相当于书的目录。打个比方: 我们在查字典的时候，如果没有目录，那我们就只能一页一页的去找我们需要查的那个字，速度很慢。如果有目录了，我们只需要先去目录里查找字的位置，然后直接翻到那一页就行
阅读更多2024-11-17
Elastic Agent：可灵活地在任何地方发送和处理任何数据
Elastic Agent 是一款功能强大且用途广泛的工具，可用于从各种数据源（包括自定义用户应用程序）收集日志和指标。现在，Elastic Agent 提供了无与伦比的灵活性，可以将数据准确地传递到
阅读更多2024-11-17
基于Java Springboot鲜花商城系统
项目编号：springbootA0521、管理员：登录、数据面板、鲜花类型、鲜花信息管理、订单信息管理、会员信息管理、修改密码。
阅读更多2024-11-17
一文3000字从0到1带你进行Mock测试（建议收藏）
什么是mock？mock测试是以可控的方式模拟真实的对象行为。程序员通常创造模拟对象来测试对象本身该具备的行为，很类似汽车设计者使用碰撞测试假人来模拟车辆碰撞中人的动态行为
阅读更多2024-11-17
嵌入式学习-C嘎嘎-Day02
上面代码中，非静态的变量abc要跟对象绑定，对象的创建严格的讲是在运行时发生的，因此上面的变量c在编译时无法确定，这与constexpr的含义冲突，编译出错。拷贝构造函数实现的功能是，以一个已经存在的
阅读更多2024-11-17
【C++】引用(reference)
既然是对一个变量或者对象取别名，那就得先有变量或对象，不能凭空取一个别名。也就是定义引用必须初始化。假设我们想通过调用一个函数来实现一个整型变量的自增，有下面3种传递方式。我们为什么要使用引用呢？我们
阅读更多2024-11-17
跳房子（弱化版）
具体而言，当 g
阅读更多2024-11-17
01 P2367 语文成绩
01 P2367 语文成绩
阅读更多2024-11-17

Pytorch_Lightning学习心得

Introduction

Training

Validation

损失函数

优化器

相关文章