EPSANet2021笔记

🕗 发布于 2025-01-20 08:24 笔记

来源：

EPSANet: An Efficient Pyramid Squeeze Attention Block on Convolutional Neural Network

创新点：

np4avkyp.sjh.png

ptzkmgv0.clx.png

贡献：

建立了长距离通道依赖关系
有效获取利用不同尺度特征图的空间信息

问题：

作者提供代码和文章描述处理过程不一致
在小样本上训练测试效果不佳

代码：


# ---------------------------------------  
# 论文: EPSANet: An Efficient Pyramid Squeeze Attention Block on Convolutional Neural Network (AICV 2021)  
# Github:https://github.com/murufeng/EPSANet  
# ---------------------------------------  
import torch  
from torch import nn  
  
  
def conv(in_planes, out_planes, kernel_size=3, stride=1, padding=1, dilation=1, groups=1):  
    """standard convolution with padding"""  
    return nn.Conv2d(in_planes, out_planes, kernel_size=kernel_size, stride=stride,  
                     padding=padding, dilation=dilation, groups=groups, bias=False)  
  
  
def conv1x1(in_planes, out_planes, stride=1):  
    """1x1 convolution"""  
    return nn.Conv2d(in_planes, out_planes, kernel_size=1, stride=stride, bias=False)  
  
  
class SEWeightModule(nn.Module):  
  
    def __init__(self, channels, reduction=16):  
        super(SEWeightModule, self).__init__()  
        self.avg_pool = nn.AdaptiveAvgPool2d(1)  
        self.fc1 = nn.Conv2d(channels, channels // reduction, kernel_size=1, padding=0)  
        self.relu = nn.ReLU(inplace=True)  
        self.fc2 = nn.Conv2d(channels // reduction, channels, kernel_size=1, padding=0)  
        self.sigmoid = nn.Sigmoid()  
  
    def forward(self, x):  
        out = self.avg_pool(x)  
        out = self.fc1(out)  
        out = self.relu(out)  
        out = self.fc2(out)  
        weight = self.sigmoid(out)  
  
        return weight  
  
  
class PSAModule(nn.Module):  
  
    def __init__(self, inplans, planes, conv_kernels=[3, 5, 7, 9], stride=1, conv_groups=[1, 4, 8, 16]):  
        super(PSAModule, self).__init__()  
        self.conv_1 = conv(inplans, planes // 4, kernel_size=conv_kernels[0], padding=conv_kernels[0] // 2,  
                           stride=stride, groups=conv_groups[0])  
        self.conv_2 = conv(inplans, planes // 4, kernel_size=conv_kernels[1], padding=conv_kernels[1] // 2,  
                           stride=stride, groups=conv_groups[1])  
        self.conv_3 = conv(inplans, planes // 4, kernel_size=conv_kernels[2], padding=conv_kernels[2] // 2,  
                           stride=stride, groups=conv_groups[2])  
        self.conv_4 = conv(inplans, planes // 4, kernel_size=conv_kernels[3], padding=conv_kernels[3] // 2,  
                           stride=stride, groups=conv_groups[3])  
        self.se = SEWeightModule(planes // 4)  
        self.split_channel = planes // 4  
        self.softmax = nn.Softmax(dim=1)  
  
    def forward(self, x):  
        batch_size = x.shape[0]  
        x1 = self.conv_1(x)  
        x2 = self.conv_2(x)  
        x3 = self.conv_3(x)  
        x4 = self.conv_4(x)  
  
        feats = torch.cat((x1, x2, x3, x4), dim=1)  
        feats = feats.view(batch_size, 4, self.split_channel, feats.shape[2], feats.shape[3])  
  
        x1_se = self.se(x1)  
        x2_se = self.se(x2)  
        x3_se = self.se(x3)  
        x4_se = self.se(x4)  
  
        x_se = torch.cat((x1_se, x2_se, x3_se, x4_se), dim=1)  
        attention_vectors = x_se.view(batch_size, 4, self.split_channel, 1, 1)  
        attention_vectors = self.softmax(attention_vectors)  
        feats_weight = feats * attention_vectors  
        for i in range(4):  
            x_se_weight_fp = feats_weight[:, i, :, :]  
            if i == 0:  
                out = x_se_weight_fp  
            else:  
                out = torch.cat((x_se_weight_fp, out), 1)  
  
        return out  
  
  
#   输入 N C H W,  输出 N C H Wif __name__ == '__main__':  
    input = torch.randn(3, 64, 32, 32)  
    s2att = PSAModule(inplans=64, planes=64)  
    output = s2att(input)  
    print(output.shape)

原文地址：https://blog.csdn.net/qq_52964132/article/details/145248230

免责声明：本站文章内容转载自网络资源，如侵犯了原著者的合法权益，可联系本站删除。更多内容请关注自学内容网（zxcms.com）！

上一篇：嵌入式硬件篇---基本组合逻辑电路
下一篇：【Linux系统编程】—— 虚拟内存与进程地址空间的管理：操作系统如何实现内存保护与高效分配

大模型GUI系列论文阅读 DAY2：《ScreenAgent：一种基于视觉语言模型的计算机控制代理》
因此，为了实现这一目标，首先需要为视觉语言模型（VLM）代理创建一个真实的交互环境，然后引导模型与环境形成一个持续的交互流程，并通过训练提高代理的性能。【Kolb，2014】的启发，使代理能够进行反思
阅读更多2025-01-21
重学分屏之进入分屏进入动画Splash Screen图层剖析
本文主要带大家认识了一个新的StartingWindowType：STARTING_WINDOW_TYPE_SOLID_COLOR_SPLASH_SCREEN，它主要就代表当前SplashWindow
阅读更多2025-01-21
PyTorch使用教程(15)-常用开源数据集简介
公开、免费且大规模的计算机视觉开源数据集扮演着至关重要的角色，它们为科研人员提供了标准化的训练平台，加速了模型开发与验证进程，并推动了整个领域的知识共享与技术创新。
阅读更多2025-01-21
Linux：生产者消费者模型
现实生活中，我们也会有像生物世界的生产者和消费者的概念，但是我们的消费者在大多数情况下并不和生产者直接联系，就比如说食物，不能说我今天去找供货商要十个面包，然后我还得在那等他把十个面包生产完了再走，虽
阅读更多2025-01-21
QT 占位符的用法
QString(“Elapsedtime:%1seconds”).arg(elapsed_seconds.count())的作用是动态生成字符串，按顺序用arg()的参数替换字符串中的占位符%1。%1
阅读更多2025-01-21
Ubuntu安装docker
对于部署企业级应用，
阅读更多2025-01-21
20250120 深入了解 Apache Flink 的 Checkpointing
当任务因故障而中断时，Flink可以从最近一次成功的Checkpoint恢复，继续任务执行，而无需重新处理已经完成的数据。当任务重启时，Flink会从最近的偏移量开始重新消费数据，确保数据不会丢失或重
阅读更多2025-01-21
AUTOSAR从入门到精通-自动驾驶测试技术（二）
自动驾驶是交通强国等众多国家战略的聚焦点，科学的测试与评价是推动自动驾驶技术进步的重要基础和核心保障。2020年，国家发改委等11部委联合发布《智能汽车创新发展战略》，明确“完善测试评价技术”是自动驾
阅读更多2025-01-21
数据结构——AVL树的实现
Hello，大家好，这一篇博客我们来讲解一下数据结构中的AVL树这一部分的内容，AVL树属于是数据结构的一部分，顾名思义，AVL树是一棵特殊的搜索二叉树，我们接下来要讲的这篇博客是建立在了解搜索二叉树
阅读更多2025-01-21
食品加工厂的高效“引擎“，canopen转ethercat网关快速稳定应用
随着技术的不断进步，伺服电机的应用将更加广泛，推动食品加工行业向更高效、智能和环保的方向发展。通过不断优化和创新，伺服电机将在未来的食品加工中发挥更大的作用，为满足日益增长的市场需求提供强有力的支持。
阅读更多2025-01-21

EPSANet2021笔记

来源：

相关工作：

创新点：

贡献：

问题：

代码：

相关文章