文生图模型

🕗 发布于 2024-10-19 15:08 文生图

CogView3/README_zh.md at main · THUDM/CogView3 · GitHub

快速开始

提示词优化

虽然 CogView3 系列模型都是通过长篇合成图像描述进行训练的，但我们强烈建议在文本生成图像之前，基于大语言模型（LLMs）进行提示词的重写操作，这将大大提高生成质量。

我们提供了一个示例脚本。我们建议您运行这个脚本，以实现对提示词对润色

python prompt_optimize.py --api_key "智谱AI API Key" --prompt {你的提示词} --base_url "https://open.bigmodel.cn/api/paas/v4" --model "glm-4-plus"

推理模型(Diffusers)

首先，确保从源代码安装diffusers库。

pip install git+https://github.com/huggingface/diffusers.git

接着，运行以下代码：

from diffusers import CogView3PlusPipeline
import torch

pipe = CogView3PlusPipeline.from_pretrained("THUDM/CogView3-Plus-3B", torch_dtype=torch.float16).to("cuda")

# Open it for reduce GPU memory usage
pipe.enable_model_cpu_offload()
pipe.vae.enable_slicing()
pipe.vae.enable_tiling()

prompt = "A vibrant cherry red sports car sits proudly under the gleaming sun, its polished exterior smooth and flawless, casting a mirror-like reflection. The car features a low, aerodynamic body, angular headlights that gaze forward like predatory eyes, and a set of black, high-gloss racing rims that contrast starkly with the red. A subtle hint of chrome embellishes the grille and exhaust, while the tinted windows suggest a luxurious and private interior. The scene conveys a sense of speed and elegance, the car appearing as if it's about to burst into a sprint along a coastal road, with the ocean's azure waves crashing in the background."
image = pipe(
    prompt=prompt,
    guidance_scale=7.0,
    num_images_per_prompt=1,
    num_inference_steps=50,
    width=1024,
    height=1024,
).images[0]

image.save("cogview3.png")

原文地址：https://blog.csdn.net/asd8705/article/details/143069759

免责声明：本站文章内容转载自网络资源，如本站内容侵犯了原著者的合法权益，可联系本站删除。更多内容请关注自学内容网（zxcms.com）！

上一篇：【网络】HTTP协议及fiddler抓包工具（1）
下一篇：10月18日笔记(基于系统服务的权限提升)

Vue93 vue3 watch监视ref属性的说明
监视person时，不加.value是监视person对象。加了.value是监视person内部通过reactive生成的对象。'person的值变化了'//返回一个对象（常用）'sum的值变化了'
阅读更多2024-10-20
基于python+dj+mysql的音乐推荐系统网页设计
网站动态信息以歌曲的动态为主，如热门下载、热门搜索和新歌推荐等；本章以音乐网站项目为例，介绍Django在实际项目开发中的应用，该网站共分为6个功能模块分别是：网站首页、歌曲排行榜、歌曲播放、歌曲点评
阅读更多2024-10-20
【SQL|大数据|数据清洗|过滤】where条件中 “ != “ 和 “ NOT IN() ” 对NULL的处理
对数据进行清洗过滤的时候，NULL往往是一个很特殊的存在，对NULL值的存在通常有以下三种方式1、保留NULL2、过滤掉NULL3、将NULL替换为其他符合业务需求的默认常量下面是一些常用处理NULL
阅读更多2024-10-20
Turn-it：优化线材重构雕塑制造
Tune-It: Optimizing Wire Reconfiguration for Sculpture ManufacturingQIBING WU∗, Shandong University,
阅读更多2024-10-20
【火山引擎】AIGC图像风格化 | 风格实践 | PYTHON
【火山引擎】AIGC图像风格化 | 风格实践 | PYTHON
阅读更多2024-10-20
【C++】deque（空间适配器））
deque是一种双开口的"连续"空间的数据结构双开口的含义是：可以在头尾两端进行插入和删除操作，且时间复杂度为O(1)。与vector比较，头插效率高，不需要搬移元素；与list比
阅读更多2024-10-20
机器学习课程学习周报十七
本周报主要探讨了变分推理（Variational Inference）的基本思想及其在机器学习中的应用，详细介绍了证据下界（ELBO）的推导过程。接着，讨论了变分自编码器（VAE）的原理及其在生成模型
阅读更多2024-10-20
WPF中的Style如何使用
通常在 XAML 的资源部分（）中定义样式。
阅读更多2024-10-20
DISTINCT 去重
1. 单字段去重以表 student_course 和表 student 链接为例：SELECT * FROM student_course a INNER JOIN student b ON a.
阅读更多2024-10-20
压缩SQL Server 2014 数据库日志文件
一开始没有设置好，数据库的日志文件膨胀到了3个G。以下使用Sql语句压缩日志文件的方法。
阅读更多2024-10-20

文生图模型

CogView3/README_zh.md at main · THUDM/CogView3 · GitHub

快速开始

提示词优化

推理模型(Diffusers)

相关文章