MLU370-M8运行llama3.2-11B手册

🕗 发布于 2024-10-11 03:34 人工智能 langchain 语言模型 python 深度学习

文章目录

前言
一、平台环境准备
二、模型下载
三、环境安装
四、代码修改
五、运行代码

前言

Llama 3.2-Vision多模态大型语言模型（LLM）集合是11B和90B大小（文本+图像输入/文本输出）的预训练和指令调优图像推理生成模型的集合。Llama 3.2-Vision指令调优模型针对视觉识别、图像推理、字幕和回答有关图像的一般问题进行了优化。在常见的行业基准上，这些模型的表现优于许多可用的开源和封闭式多模式模型。

一、平台环境准备

卡选择：MLU370系列
驱动选择：5.10.29以上
镜像选择：SDK版本>v24.0 pytorch >= 2.1
本次操作镜像如下：
cambricon_pytorch_container:v24.08.02-torch2.4.0-catch1.22.1-ubuntu22.04-py310

二、模型下载

git-lfs clone https://www.modelscope.cn/LLM-Research/Llama-3.2-11B-Vision-Instruct.git

三、环境安装

pip install transformers pillow accelerate

四、代码修改

本次操作参考链接：https://www.modelscope.cn/models/LLM-Research/Llama-3.2-11B-Vision-Instruct/files

1.在运行得py文件头处添加

import torch_mlu
from torch_mlu.utils.model_transfer import transfer

2.将代码中torch_dtype=torch,bfloat16 改成torch_dtype=torch.float16

五、运行代码

import torch_mlu
from torch_mlu.utils.model_transfer import transfer
import requests
import torch
from PIL import Image
from transformers import MllamaForConditionalGeneration, AutoProcessor

model_id = "/workspace/volume/guojunceshi2/Llama-3.2-11B-Vision-Instruct"

model = MllamaForConditionalGeneration.from_pretrained(
    model_id,
    torch_dtype=torch.float16,
    device_map="auto",
)
processor = AutoProcessor.from_pretrained(model_id)

url = "rabbit.jpg"
image = Image.open(url)

messages = [
    {"role": "user", "content": [
        {"type": "image"},
        {"type": "text", "text": "If I had to write a haiku for this one, it would be: "}
    ]}
]
input_text = processor.apply_chat_template(messages, add_generation_prompt=True)
inputs = processor(image, input_text, return_tensors="pt").to(model.device)

output = model.generate(**inputs, max_new_tokens=500)
print(processor.decode(output[0]))

输入：请添加图片描述
问：如果我必须为这个图片想一首诗歌：
答：
在阳光照耀的绿色田野里，
一只小白兔在路上蹦蹦跳跳。
胡须抽动，耳朵如此明亮，
它在晨曦中翩翩起舞。
它的皮毛像刚下的雪一样白，
它欢快地跳跃着，你不知道吗？
在它的眼中，有一丝喜悦，
天真、狂野和自由的象征。
每一次跳跃，都会留下足迹，
幸福，没有一次失败。
在这宁静祥和的场景中，
白兔的喜悦是真实的。
所以，让我们珍惜这一刻的恩典，
沐浴在它美丽的脸庞上。
因为在其简单性中，我们发现，
提醒人们与大自然的设计和谐相处
…

Thu Oct 10 22:36:48 2024
+------------------------------------------------------------------------------+
| CNMON v5.10.29                                               Driver v5.10.29 |
+-------------------------------+----------------------+-----------------------+
| Card  VF  Name       Firmware |               Bus-Id | Util        Ecc-Error |
| Fan   Temp      Pwr:Usage/Cap |         Memory-Usage | Mode     Compute-Mode |
|===============================+======================+=======================|
| 0     /   MLU370-M8    v1.1.4 |         0000:49:00.0 | 0%                  0 |
|  0%   31C         64 W/ 300 W | 13578 MiB/ 42396 MiB | FULL          Default |
+-------------------------------+----------------------+-----------------------+
| 1     /   MLU370-M8    v1.1.4 |         0000:52:00.0 | 61%                 0 |
|  0%   30C         68 W/ 300 W | 11762 MiB/ 42396 MiB | FULL          Default |
+-------------------------------+----------------------+-----------------------+

+------------------------------------------------------------------------------+
| Processes:                                                                   |
|  Card  MI  PID     Command Line                             MLU Memory Usage |
|==============================================================================|
|  0     /   27318   python                                          13504 MiB |
|  1     /   27318   python                                          11673 MiB |
+------------------------------------------------------------------------------+

(pytorch) root@notebook-llama32-1010-221744-kgtrtz-notebook-0:/workspace/volume/guojunceshi2# python demo.py 
/torch/venv3/pytorch/lib/python3.10/site-packages/torch_mlu/utils/model_transfer/__init__.py:2: FutureWarning: `torch_mlu.utils.model_transfer` is deprecated. Please use `torch_mlu.utils.gpu_migration` instead.
  warnings.warn("`torch_mlu.utils.model_transfer` is deprecated. "
/torch/venv3/pytorch/lib/python3.10/site-packages/torch_mlu/mlu/__init__.py:107: UserWarning: Linear memory is not supported on this device. Falling back to common memory. (Triggered internally at /catch/torch_mlu/csrc/framework/core/caching_allocator.cpp:527.)
  torch_mlu._MLUC._mlu_init()
[WARNING][/catch/torch_mlu/csrc/utils/version.cpp:133][operator()][process:23313][thread:140636830436224]: Cambricon NEUWARE minimum version requirements not met! Require DRIVER minimum verion is 6.1.3-1, but current version is 5.10.29-1
/torch/venv3/pytorch/lib/python3.10/site-packages/accelerate/utils/modeling.py:1169: DeprecationWarning: The 'warn' method is deprecated, use 'warning' instead
  logger.warn(
The model weights are not tied. Please use the `tie_weights` method before using the `infer_auto_device` function.
Loading checkpoint shards: 100%|███████████████████████████████████████| 5/5 [00:05<00:00,  1.18s/it]
/torch/venv3/pytorch/lib/python3.10/site-packages/transformers/models/mllama/modeling_mllama.py:2237: UserWarning: Casting input of dtype int64 to int32, maybe overflow! (Triggered internally at /catch/torch_mlu/csrc/aten/utils/cnnl_util.cpp:128.)
  position_ids.masked_fill_(attention_mask == 0, 1)
[2024-10-10 22:35:37] [CNNL] [Warning]:[cnnlGetConvolutionForwardAlgorithm] is deprecated and will be removed in the future release. See cnnlFindConvolutionForwardAlgorithm() API for replacement.
[2024-10-10 22:35:37] [CNNL] [Warning]:[cnnlLayerNormForward] is deprecated and will be removed in the future release,  please use [cnnlLayerNormForward_v2] instead.
/torch/venv3/pytorch/lib/python3.10/site-packages/torch_mlu/utils/gpu_migration/migration.py:244: DeprecationWarning: at::autocast::is_privateuseone_enabled() is deprecated. Please use at::autocast::is_autocast_enabled(at::kPrivateUse1) instead. (Triggered internally at /opt/python/cp310-cp310/lib/python3.10/site-packages/torch/include/ATen/autocast_mode.h:125.)
  return fn(*args, **kwargs)
/torch/venv3/pytorch/lib/python3.10/site-packages/transformers/models/mllama/modeling_mllama.py:1946: DeprecationWarning: at::autocast::is_privateuseone_enabled() is deprecated. Please use at::autocast::is_autocast_enabled(at::kPrivateUse1) instead. (Triggered internally at /opt/python/cp310-cp310/lib/python3.10/site-packages/torch/include/ATen/autocast_mode.h:125.)
  logits = self.lm_head(hidden_states[:, -num_logits_to_keep:, :]).float()
<|begin_of_text|><|begin_of_text|><|start_header_id|>user<|end_header_id|>

<|image|>如果我必须为这个图片想一首诗歌： <|eot_id|><|start_header_id|>assistant<|end_header_id|>

In fields of green, where sunbeams play,
A little white rabbit hops on its way.
With whiskers twitching, ears so bright,
It dances through the morning light.

Its fur as white as freshly fallen snow,
It leaps and bounds with joy, don't you know?
In its eyes, a spark of glee,
A symbol of innocence, wild and free.

With every hop, it leaves a trail,
Of happiness, without a single fail.
In this serene and peaceful scene,
The white rabbit's joy is truly seen.

So let us cherish this moment's grace,
And bask in the beauty of its face.
For in its simplicity, we find,
A reminder to live life in harmony with nature's design.<|eot_id|>
/opt/py3.10/lib/python3.10/tempfile.py:860: ResourceWarning: Implicitly cleaning up <TemporaryDirectory '/tmp/tmps9dk944a'>
  _warnings.warn(warn_message, ResourceWarning)

原文地址：https://blog.csdn.net/xiaojunjun200211/article/details/142833510

免责声明：本站文章内容转载自网络资源，如本站内容侵犯了原著者的合法权益，可联系本站删除。更多内容请关注自学内容网（zxcms.com）！

上一篇：web端使用高德地图逆地理编码
下一篇：Oracle EBS中税务报告模块的财务流程概览

C++模板初阶速成
温馨提示：本篇文章依旧是c++速成系列的文章，但和以往有所不同的是，本篇文章带大家简单了解并能够学会使用模板。
阅读更多2024-10-13
热成像人像算法呈现方式!
通过设定变化阈值，比对前后两帧的同一位置像素点的差值，若满足条件，则将该像素点标记为“warm”pixel。因此，热红外成像可以不受光照条件的影响，且在图像中，人体由于温度较高，通常会比背景显得更亮。
阅读更多2024-10-13
证件照制作工具
声明：本站大部分素材均源于互联网收集整理，若侵犯第三方权益，请及时联系我们删除。
阅读更多2024-10-13
安当数据库透明加密组件：守护您的 ClickHouse 数据安全
ClickHouse是高性能列式数据库，安当推出透明加密组件保护数据安全，支持整库、表级、列级加密，透明不影响性能，提供高效灵活的数据安全解决方案。
阅读更多2024-10-13
ClickHouse 数据保护指南：从备份到迁移的全流程攻略
clickhouse-backup 是社区开源的一个 ClickHouse 备份工具，可用于实现数据迁移。其原理是先创建一个备份，然后从备份导入数据，类似 MySQL 的 mysqldump + SO
阅读更多2024-10-13
打假！制裁不良网站
众所周知，Chrome是一个很好用的浏览器。然后今天装了虚拟机Win 10，想着下个Chrome用来开发，随即发现了：看着好像很正经，对吧？但其实，这是一个盗版网站！包括MS Defender也说了
阅读更多2024-10-13
docker入门（二）之容器命令及私有仓库的部署（本地和harbor）
所以我们大致应该能明白守护进程是给需要在后台默默执行的程序。如redis，如果不添加守护进程，ctrl+c，前台进程结束，容器就停止了，显然不能接受。通过下图可以看到redis开始在正常运行在前台，假
阅读更多2024-10-13
图论day59|并查集理论基础、107.寻找存在的路径（卡码网）
图论day59|并查集理论基础、107.寻找存在的路径（卡码网）
阅读更多2024-10-13
Solon-Boot 与 SpringBoot 的概念差别
平常我们是拿 Solon 生态与 SpringBoot 生态作比较。而非 Solon-Boot（仅是功能模块）与 SpringBoot 生态，但这两名字太容易让人误解了。
阅读更多2024-10-13
STM32—W25Q64
2的24方=16777216 / 1024 = 16384KB /1024 =16MB 所以24位地址的最大寻址空间是16MB W25Q40到Q128，使用3字节24位的地址都是足够的
阅读更多2024-10-13