Elasticsearch向量搜索：从语义搜索到图搜图只有一步之遥

🕗 发布于 2024-11-23 11:51 elasticsearch 搜索引擎 AI

续

上集说到语义搜索，这集接着玩一下图搜图，这种场景在电商中很常见——拍照搜商品。图搜图实现非常类似语义搜索，代码逻辑结构都很类似…

开搞

还是老地方modelscope找个Vision Transformer模型，这里选用vit-base-patch16-224,如果还想玩玩文搜图，可以选用支持多模态的multi-modal_clip-vit-base-patch16_zh

D:\python2023>modelscope download --model AI-ModelScope/vit-base-patch16-224

准备测试数据

在这里插入图片描述

运行代码

from PIL import Image
from transformers import ViTFeatureExtractor, ViTModel
import torch
import time
from elasticsearch import Elasticsearch

# 初始化模型和图片特征提取器
MODEL_PATH = 'C:\\Users\\Administrator\\.cache\\modelscope\\hub\\AI-ModelScope\\vit-base-patch16-224'
feature_extractor = ViTFeatureExtractor.from_pretrained(MODEL_PATH)
model = ViTModel.from_pretrained(MODEL_PATH)


def extract_features(image_path):
    image = Image.open(image_path)
    inputs = feature_extractor(images=image, return_tensors="pt")
    with torch.no_grad():
        outputs = model(**inputs)
        last_hidden_states = outputs.last_hidden_state
        # 取出CLS token的输出作为图片的特征向量
        features = last_hidden_states[:, 0].squeeze()
    return features.numpy()

def store_image_features(image_id, features):
    doc = {
        'image_id': image_id,
        'features': features.tolist()
    }
    res = es.index(index=index_name, id=image_id, body=doc)
    print(res['result'])

def search_similar_images(query_image_path, top_k=5):
    query_features = extract_features(query_image_path)
    body = {
        "size": top_k,
        "query": {
            "script_score": {
                "query": {"match_all": {}},
                "script": {
                    "source": "cosineSimilarity(params.query_vector, 'features') + 1.0",
                    "params": {"query_vector": query_features.tolist()}
                }
            }
        }
    }

    response = es.search(index=index_name, body=body)
    similar_images = [hit['_id'] for hit in response['hits']['hits']]
    return similar_images

# 假设有一个图片ID和路径的字典
images = {
    'img_dog': 'D:\\1\\dog.jpg',
    'img_wolf': 'D:\\1\\wolf.jpg',
    'img_person': 'D:\\1\\person.jpg',
    'img_montain': 'D:\\1\\montain.jpg',
}
# 调用ES api创建索引
es = Elasticsearch([{'scheme':'http','host':'192.168.72.128','port':9200}])
index_name = 'image_search'
body = {
    "mappings": {
        "properties": {
            "features": {
                "type": "dense_vector",
                "dims": 768  # ViT base模型的特征向量维度
            }
        }
    }
}
if not es.indices.exists(index=index_name):
    es.indices.create(index=index_name, body=body)
# 存储图片特征到ES
for img_id, img_path in images.items():
    img_features = extract_features(img_path)
    store_image_features(img_id, img_features)

# ES向量搜索找到某张图片最相似的图片集
time.sleep(3)
similar_ids = search_similar_images("D:\\1\\test_dog.jpg")
print(f"找到的相似图片ID: {similar_ids}")

可以看到最相似的还是狗，狼也像狗所以次之，然后是人，其实相关度已经很低了，最不相关的是风景图
在这里插入图片描述

看看索引情况

图片向量ES内存使用还不大，和文本数据基本一样，主要是因为图片特征向量维度都使用了768，如果不搜索不不准确，可以调高维度，但ES内存使用会增加。反之图片干扰特征少一点，特征向量维度小一些也会有较好的搜索效果。
在这里插入图片描述

原文地址：https://blog.csdn.net/qq_39506978/article/details/143981433

免责声明：本站文章内容转载自网络资源，如本站内容侵犯了原著者的合法权益，可联系本站删除。更多内容请关注自学内容网（zxcms.com）！

上一篇：C++ -运算符重载
下一篇：Spring Boot 的 WebClient 实践教程

Benchmark是什么？有什么作用？实例——助理解
GLUE 基准作为一个 benchmark，定义了一套标准任务和指标，帮助研究者评估模型的性能。通过在 GLUE 上的测试，你的模型结果可以用数据清楚地展示出来，同时可以与其他模型进行公平比较，这就是
阅读更多2024-11-23
2024年11月22日Github流行趋势
项目维护者：@louis030195, @m13v, @github-actions, @Neptune650, @EzraEllette。项目维护者：@amhsirak, @naveenpan09,
阅读更多2024-11-23
Echarts中柱状图完成横向布局
Echarts中柱状图完成横向布局。
阅读更多2024-11-23
使用uniapp编写APP的文件上传
缺陷是只能一个一个单独上传。使用uniapp插件。
阅读更多2024-11-23
自由学习记录（23）
")如果表里带表，则不能拼接，表里带nil也不能，都会报错true和false也不可以，数字和字符串可以if要和一个end配对，所以endend两个endctrl+b运行脚本函数在表外部声明
阅读更多2024-11-23
深入了解 Linux htop 命令：功能、用法与示例
htop是一个交互式的进程查看工具，用于 Linux 和类 Unix 系统。相比传统的top命令，htop提供了更加直观和用户友好的界面，支持颜色高亮、鼠标操作以及更多可视化功能，适合系统资源的实时监
阅读更多2024-11-23
Python入门（13）--并发编程
Python之旅第十二站
阅读更多2024-11-23
C/C++精品项目之图床共享云存储（6）：图片的共享，浏览，获取，以及短链的生成
这一篇把图片访问和短链的工作原理讲解了
阅读更多2024-11-23
第一章 Go语言简介
go语言是由Google公司在2007年提出的。Go 语言从入门到实战蔡超极客时间 90元。go语言hello_world.go，
阅读更多2024-11-23
[Redis#2] 定义 | 使用场景 | 安装教程 | 快！
Redis是一款高性能的内存数据结构存储系统，支持多种数据类型及丰富的特性如可编程性、扩展性和持久化。它广泛应用于实时数据存储、缓存与会话管理、流处理等场景，具备快速访问、高可用性和分布式集群支持。本
阅读更多2024-11-23

Elasticsearch向量搜索：从语义搜索到图搜图只有一步之遥

续

开搞

准备测试数据

运行代码

看看索引情况

相关文章