【nnUNet V2系列】nnUNet V2在Ubuntu下安装调试篇

🕗 发布于 2024-07-24 17:46 健康医疗 深度学习 pytorch centos 图像处理

安装之前网上很多教程，很多是nnUNet V1的安装过程，有的V1和V2混在一起讲解，导致V1的转化指令用到V2中，产生不少误解。这篇是针对V2整理出来的安装过程，有什么不妥之处请指出会及时修改。

1. 创建虚拟环境

conda create -n nnUNetV2 python=3.9
2. 激活虚拟环境

conda activate nnUNetV2
3. 输入nvidia-smi查询CUDA的版本号（如服务器是CUDA12.1），然后选择不超过此CUDA的torch版本（CUDA11.8）进行安装：

conda install pytorch torchvision torchaudio pytorch-cuda=11.8 -c pytorch -c nvidia

安装成功如下：

测试：

python

import torch

print(torch.__version__) 输出：版本号

print(print(torch.cuda.is_available()) 输出为：True

print(torch.backends.cudnn.version()) 输出

4. 最后，参考官方文档安装nnUNet V2程序：

git clone https://github.com/MIC-DKFZ/nnUNet.git
cd nnUNet
pip install -e .
注意：别丢掉小圆点。
【可选项】如果下载不方便，这里提供下载好的压缩包，直接解压包【提取码:x2FK】从压缩包里安装：
cd nnUNet 
pip install -e .
【可选项】安装hiddenlayer，让nnU-net可以生成网络拓扑图，指令如下（用不到可以不装）：
pip install --upgrade git+https://github.com/FabianIsensee/hiddenlayer.git

5.修改环境变量，可以在.bashrc(XXX：自己的用户名)中最下面添加：

export nnUNet_raw="/home/xxx/nnUNetV2-master/DATASET/nnUNet_raw"
export nnUNet_preprocessed="/home/xxx/nnUNetV2-master/DATASET/nnUNet_preprocessed"
export nnUNet_results="/home/xxx/nnUNetV2-master/DATASET/nnUNet_trained_models"

然后source .bashrc。这里直接修改的程序不会影响环境或者影响V1版本，修改nnunetv2文件下的paths.py程序：

6.需要下载官网数据，登录Medical Segmentation Decathlon的谷歌网盘。

7.由于nnUNet V1跟nnUNet V2版有一些差别，需要进行数据转换：

nnUNetv2_convert_MSD_dataset -i /home/XXX/nnUNetV2-master/DATASET/nnUNet_raw/Task04_Hippocampus -overwrite_id 004

文件名由Task04_Hippocampus变成Dataset004_Hippocampus，另外就是dataset.json的变化，主要是V2不再需要记录样本名字和标签的记录方式发生了变化：

别V2和V1操作别搞混了，有的博客上写用nnUNet_convert_decathlon_task，这个是V1版本用的；然后，要想给V2用还需要在经过一次转换：

nnUNetv2_convert_old_nnUNet_dataset Task004_Hippocampus Dataset004_Hippocampus

8. 对样本进行预处理:

模版：nnUNetv2_plan_and_preprocess -d DATASET_ID --verify_dataset_integrity

nnUNetv2_plan_and_preprocess -d 004 --verify_dataset_integrity
注：--verify_dataset_integrity首次需要校验之后就不需要。

预处理的结果：

发现DATASET的子目录nnUNet_preprocessed生成预处理的文件：

9. 训练nnUNet模型:

模版：nnUNetv2_train DATASET_NAME_OR_ID UNET_CONFIGURATION FOLD --val --npz

nnUNetv2_train 4 3d_fullres 0

DATASET_NAME_OR_ID：指定应在哪个数据集上进行训练；

UNET_CONFIGURATION：是一个字符串，用于标识请求的 U-Net 配置（默认值：2d、3d_fullres、3d_lowres、3d_cascade_lowres）；

FOLD：指定训练 5 折交叉验证的哪个折叠。

注：nnU-Net 每 50 个周期存储一个模型权重checkpoint。如果需要继续之前的训练，只需在训练命令中添加 --c 即可。

这里为了做测试大概跑了100epochs，默认训练周期为1000次，可以搜一下nnUNetTrainer.py中改为self.num_epochs = 100。

插播一下棘手的问题：

刚开始训练遇到一个错误，困扰了一天：

1）

RuntimeError: One or more background workers are no longer alive. Exiting. Please check the print statements above for the actual error message

解决：

重新安装一下cuda，在pip install -e .的时候，系统有时会重新安装torch导致它从gpu版变成CPU版，先测试一下torch.is_available是否为True，如果不是建议重新装一下torch：

conda install pytorch torchvision torchaudio pytorch-cuda=xx.x -c pytorch -c nvidia （具体你的cuda版本根据自己系统而定）

确认torch正常之后，可以用如下命令训练：

nnUNet_n_proc_DA=0 CUDA_VISIBLE_DEVICES=0 nnUNetv2_train 4 3d_fullres 0 -device cuda

2）

/tmp/tmpzj43o3qe/main.c:354:3: note: use option -std=c99 or -std=gnu99 to compile your code
/tmp/tmpzj43o3qe/main.c: In function ‘list_to_cuuint32_array’:
/tmp/tmprdfpoq6h/main.c: In function ‘list_to_cuuint64_array’:
/tmp/tmpzj43o3qe/main.c:365:3: error: ‘for’ loop initial declarations are only allowed in C99 mode
   for (Py_ssize_t i = 0; i < len; i++) {
   ^
/tmp/tmprdfpoq6h/main.c:354:3: error: ‘for’ loop initial declarations are only allowed in C99 mode
   for (Py_ssize_t i = 0; i < len; i++) {
   ^
/tmp/tmprdfpoq6h/main.c:354:3: note: use option -std=c99 or -std=gnu99 to compile your code
/tmp/tmprdfpoq6h/main.c: In function ‘list_to_cuuint32_array’:
/tmp/tmprdfpoq6h/main.c:365:3: error: ‘for’ loop initial declarations are only allowed in C99 mode
   for (Py_ssize_t i = 0; i < len; i++) {
   ^
/tmp/tmpdlxf7jri/main.c: In function ‘list_to_cuuint64_array’:
/tmp/tmpdlxf7jri/main.c:354:3: error: ‘for’ loop initial declarations are only allowed in C99 mode
   for (Py_ssize_t i = 0; i < len; i++) {
   ^
/tmp/tmpdlxf7jri/main.c:354:3: note: use option -std=c99 or -std=gnu99 to compile your code
/tmp/tmpdlxf7jri/main.c: In function ‘list_to_cuuint32_array’:
/tmp/tmpdlxf7jri/main.c:365:3: error: ‘for’ loop initial declarations are only allowed in C99 mode
   for (Py_ssize_t i = 0; i < len; i++) {
   ^
/tmp/tmpb2y0zzge/main.c: In function ‘list_to_cuuint64_array’:
/tmp/tmpb2y0zzge/main.c:354:3: error: ‘for’ loop initial declarations are only allowed in C99 mode
   for (Py_ssize_t i = 0; i < len; i++) {
   ^
/tmp/tmpb2y0zzge/main.c:354:3: note: use option -std=c99 or -std=gnu99 to compile your code
/tmp/tmpb2y0zzge/main.c: In function ‘list_to_cuuint32_array’:
/tmp/tmpb2y0zzge/main.c:365:3: error: ‘for’ loop initial declarations are only allowed in C99 mode
   for (Py_ssize_t i = 0; i < len; i++) {
   ^
/tmp/tmpgv8r5a3m/main.c: In function ‘list_to_cuuint64_array’:
/tmp/tmpgv8r5a3m/main.c:354:3: error: ‘for’ loop initial declarations are only allowed in C99 mode
   for (Py_ssize_t i = 0; i < len; i++) {
   ^
/tmp/tmpgv8r5a3m/main.c:354:3: note: use option -std=c99 or -std=gnu99 to compile your code
/tmp/tmpgv8r5a3m/main.c: In function ‘list_to_cuuint32_array’:
/tmp/tmpgv8r5a3m/main.c:365:3: error: ‘for’ loop initial declarations are only allowed in C99 mode
   for (Py_ssize_t i = 0; i < len; i++) {
   ^
/tmp/tmpatkm0ctm/main.c: In function ‘list_to_cuuint64_array’:
/tmp/tmpatkm0ctm/main.c:354:3: error: ‘for’ loop initial declarations are only allowed in C99 mode
   for (Py_ssize_t i = 0; i < len; i++) {
   ^
/tmp/tmpatkm0ctm/main.c:354:3: note: use option -std=c99 or -std=gnu99 to compile your code
/tmp/tmpatkm0ctm/main.c: In function ‘list_to_cuuint32_array’:
/tmp/tmpatkm0ctm/main.c:365:3: error: ‘for’ loop initial declarations are only allowed in C99 mode
   for (Py_ssize_t i = 0; i < len; i++) {

解决方案：

可以先试试这个方案：

pip install triton==2.1.0

如果问题依旧，再修改如下：

主要是由于服务器的编译器默认使用了较旧的C标准，不支持在for循环中声明变量。

从/home/XXX/anaconda3/envs/nnUNetV2/lib/python3.9/site-packages/triton/common/找到build.py修改如下：

10. 成功运行大概如下，随便截了几个图：

大功告成，可以开心的训练你的模型了！对你有帮助的，请点个赞！

2.nnUNet_v2（Linux）_nnunetv2-CSDN博客

3.在Windows上实现nnU-Net v2的环境配置_nnunetv2-CSDN博客

4.论文解读- nnU-Net: Self-adapting Framework for U-Net-Based Medical Image Segmentation（附实现教程）_nnunet self adaptinng-CSDN博客

原文地址：https://blog.csdn.net/qqbb1987/article/details/140553160

免责声明：本站文章内容转载自网络资源，如本站内容侵犯了原著者的合法权益，可联系本站删除。更多内容请关注自学内容网（zxcms.com）！

上一篇：一个打印问题引发的思考(js中变量创建的流程以及引发的错误)
下一篇：Git下载安装

Java学习，基本数据类型
System.out.println("最小值：Double.MIN_VALUE=" + Double.MIN_VALUE);System.out.println("最小
阅读更多2024-11-17
创建第一个react项目
通过以上步骤，你已经成功创建并运行了你的第一个React项目。接下来，你可以继续探索React的更多功能，编写更复杂的组件和应用程序。希望这个教程对你有所帮助！如果有任何问题，欢迎随时提问。参考资料R
阅读更多2024-11-17
从零开始的c++之旅——二叉搜索树
这与之前实现的二叉树类似，只不过用上了模板跟构造函数，因为构造函数我们在后面需要用来生成节点。K _key;:_key(key){}//这里也能体现封装思想，不管我们如何实现的类此处我们只需定义成No
阅读更多2024-11-17
c/c++内存管理
int main()// new/delete 和 malloc/free最大区别是 new/delete对于【自定义类型】除了开空间还会调用构造函数和析构函数free(p1);delete p2;/
阅读更多2024-11-17
1、PyTorch介绍与张量的创建
【代码】1、PyTorch介绍与张量的创建。
阅读更多2024-11-17
‌REST风格（Representational State Transfer）
REST风格的核心思想是将Web应用程序的功能作为资源来表示，使用统一的标识符（URI）来对这些资源进行操作，并通过HTTP协议（如GET、POST、PUT、DELETE等）来定义对这些资源的操作。‌
阅读更多2024-11-17
软件测试 —— 自动化基础
自动化是指自动的代替人的行为完成操作，自动化在生活中可以说是随处可见，如：自动洒水机、自动洗手液等，这些生活中的自动案例有效的减少了我们人力的消耗，同时也提高了我们的生活质量，在我们软件中的自动化测试
阅读更多2024-11-17
Python爬虫下载新闻，Flask展现新闻（2）
Python爬虫下载新闻和Flask展现新闻的主要技术
阅读更多2024-11-17
【CSS in Depth 2 精译_057】第九章 CSS 的模块化与作用域 + 9.1 CSS 模块的定义（上）
本篇为《CSS in Depth》全新第2版9.1小节内容的上篇，主要介绍了 CSS 模块化的产生背景及相关概念，并结合上一节层叠图层（cascade layer）的知识，通过一个简单的 messag
阅读更多2024-11-17
分布式事务seata基于docker安装和项目集成seata
分布式系统节点通过网络连接，一定会出现分区问题（P）当分区出现时,系统的一致性和可用性就无法同时满足cp-->不同节点的角色不同ap-->不同节点的角色相同。
阅读更多2024-11-17

【nnUNet V2系列】nnUNet V2在Ubuntu下安装调试篇

相关文章