李宏毅机器学习2022-HW8-Anomaly Detection

🕗 发布于 2024-10-05 22:03 机器学习 人工智能

文章目录

Task
Baseline
Report
- Question2
Code Link

Task

异常检测Anomaly Detection

在这里插入图片描述

将data经过Encoder，在经过Decoder，根据输入和输出的差距来判断异常图像。training data是100000张人脸照片，testing data有大约10000张跟training data相同分布的人脸照片(label 0)，还有10000张不同分布的照片(anomaly, label 1)，每张照片都是(64,64,3)，.npy file

以训练集的前三张照片为例，auto-encoder的输入和输出如下：

在这里插入图片描述

Baseline

Auto-encoder model一共有五种模型

fcn: fully-connected network
cnn: convolutional network
VAE
Resnet
Multi-encoder autoencoder
- encoder(fcn+fcn+fcn)+decoder(fcn)
- encoder(cnn+cnn+cnn)+decoder(cnn)
- encoder(fcn+fcn+conv)+decoder(fcn)

通过FCN+调节超参数的方式可以轻易的达到strong，Resnet也是但是Multi-encoder的方式表现并不好，也许是我处理方式有问题，具体代码可以参考GitHub中的文件

Report

Question2

Train a fully connected autoencoder and adjust at least two different element of the latent representation. Show your model architecture, plot out the original image, the reconstructed images for each adjustment and describe the differences.

import matplotlib.pyplot as plt
# sample = train_dataset[random.randint(0,100000)]
sample = train_dataset[0]
print("sample shape:{}".format(sample.size()))
sample = sample.reshape(1,3,64,64)

model.eval()
with torch.no_grad():
    img = sample.cuda()
            
    # 只调整fcn中的latent representation的其中两维，其他模型都是正常输出
    if model_type in ['res']:
        output = model(img)
        output = decoder(output)
        print("res output shape:{}".format(output.size()))
        output = output[0] # 第一个重建图像，当然只有一个图像
        
    if model_type in ['fcn']:
        img = img.reshape(img.shape[0], -1)
        x = model.encoder(img)
        x[0][2] = x[0][2]*3
        output = model.decoder(x)
        print("fcn output shape:{}".format(output.size()))
        output = output.reshape(3,64,64)
        
    if model_type in ['vae']:
        output = model(img)
        print("vae output shape:{}".format(output.size()))
        output = output[0][0] # output[0]是重建后的图像，output[0][0]重建后的第一个图像
        
    if model_type in ['cnn']:
        output = model(img)[0]
        
    print("output shape:{}".format(output.size()))
       
sample = sample.reshape(3,64,64)   

# 创建画布
fig, axes = plt.subplots(nrows=1, ncols=2, figsize=(5, 5))

# plt sample image
axes[0].imshow(transforms.ToPILImage()((sample+1)/2)) #imshow的输入(H,W,C)
axes[0].axis('off')
axes[0].annotate('sample input', xy=(0.5, -0.15), xycoords='axes fraction',ha='center', va='center')
# plt output image
axes[1].imshow(transforms.ToPILImage()((output+1)/2))
axes[1].axis('off')
axes[1].annotate('sample output', xy=(0.5, -0.15), xycoords='axes fraction',ha='center', va='center')

plt.show()

在这里插入图片描述

Code Link

具体代码在Github

原文地址：https://blog.csdn.net/qq_42875127/article/details/142602973

免责声明：本站文章内容转载自网络资源，如本站内容侵犯了原著者的合法权益，可联系本站删除。更多内容请关注自学内容网（zxcms.com）！

上一篇：CSS中height设置100vh和100%的区别
下一篇：点击按钮提示气泡信息（Toast）

linux信号 | 学习信号四步走 | 透析信号是如何被处理的？
linux信号的处理
阅读更多2024-10-06
No.2 笔记 | 网络安全攻防：PC、CS工具与移动应用分析
Cobalt Strike是一款基于Java的高级渗透测试工具,支持多人协同操作。它被称为"线上多人运动平台",功能强大且灵活。通过学习PC端和移动端的恶意程序利用技术,以及Cob
阅读更多2024-10-06
Linux 传输层UDP
表示UDP数据报文的长度，包括头部和数据部分，占用2个字节。由于UDP报头长度固定为8字节，因此实际UDP报文的数据长度最大为65527字节。Udp协议首部中有一个16位的最大长度，也就是说一个Udp
阅读更多2024-10-06
【Codeforces】CF 2007 E
树形结构 #贪心 #数学。
阅读更多2024-10-06
如何在电脑上浏览手机界面
如何在电脑上浏览手机界面
阅读更多2024-10-06
MAC备忘录空白解决方案
取消勾选同步此MAC后再次勾选，然后点击完成即可。打开icloud->备忘录。
阅读更多2024-10-06
Spring Boot ⽇志
我们可以通过⽇志记录这个系统的运⾏状态，对数据进⾏分析, 设置不同的规则, 超过阈值时进⾏报警。如果我们的⽇志都放在⼀个⽂件中, 随着项⽬的运⾏, ⽇志⽂件会越来越⼤, 需要对⽇志⽂件进⾏分割。2.E
阅读更多2024-10-06
如何解决 MySQL ERROR 1040 (08004): Too many connections ?
我们会经常遇到一个错误，特别是在高流量系统中，error 1040 (08004): Too many connections，让我们详细探讨此错误，了解其原因，并给出可能的解决方案。
阅读更多2024-10-06
【Qt】Qt学习笔记(一)：Qt界面初识
Qt 是一个跨平台应用程序和 UI 开发框架。使用 Qt 您只需一次性开发应用程序，无须重新编写源代码，便可跨不同桌面和嵌入式操作系统部署这些应用程序。Qt Creator是跨平台的Qt集成开发环境。
阅读更多2024-10-06
Spring Boot中获取application.yml中属性的几种方式
在Spring Boot应用程序中，可以通过多种方式从文件中获取配置属性。
阅读更多2024-10-06

李宏毅机器学习2022-HW8-Anomaly Detection

文章目录

Task

Baseline

Report

Question2

Code Link

相关文章