CV图像处理小工具——语义分割json生成检测框json

🕗 发布于 2024-11-07 07:12 图像处理 json 人工智能

语义分割json生成检测框json

import json
import os
from os import listdir, getcwd
from os.path import join
import os.path


rootdir = 'F:/dataset/'# 写自己存放图片的数据地址
input_dir = 'F:/dataset/labels_json/'
output_dir = 'F:/dataset/labels_box/'
def position(pos):
    # 该函数用来找出xmin,ymin,xmax,ymax即bbox包围框
    x = []
    y = []
    nums = len(pos)
    for i in range(nums):
        x.append(pos[i][0])
        y.append(pos[i][1])
    x_max = max(x)
    x_min = min(x)
    y_max = max(y)
    y_min = min(y)
    b = (float(x_min), float(x_max), float(y_min), float(y_max))
    return b




def convert_annotation(image_ids, input_dir, output_dir):
    for image_id in image_ids:
        print(image_id)
        input_file_path = os.path.join(input_dir, f"{image_id}.json")
        output_file_path = os.path.join(output_dir, f"{image_id}.json")

        try:
            with open(input_file_path, 'r') as load_f:
                load_dict = json.load(load_f)

            w = load_dict['imageWidth']
            h = load_dict['imageHeight']
            objects = load_dict['shapes']

            annotations = []  # 创建一个空列表来存储注解

            for obj in objects:
                # 去除可能的额外空格，并设置默认标签（如果需要）
                labels = obj['label'].strip()
                # 根据标签处理形状
                if labels in [
                    "class1", "class2",
                    "class3", "class4"
                ]:
                    pos = obj['points']
                    b = position(pos)
                    cls_id_mapping = {
                        "class1": 1,
                        "class2": 2,
                        "class3": 3,
                        "class4": 4,
                        
                    }
                    cls_id = cls_id_mapping[labels]
                    annotation = {
                        'label': labels,
                        'class_id': cls_id,
                        'bbox': b
                    }
                    annotations.append(annotation)

            output_dict = {
                "version": "5.0.2",
                "flags": {},
                "shapes": [],  # shapes数组为空，因为注解信息已经放在annotations中
                "imagePath": load_dict['imagePath'],
                "imageData": None,
                "imageHeight": h,
                "imageWidth": w,
                "annotations": annotations  # 添加注解列表
            }
            for shape in load_dict['shapes']:
                # 假设每个多边形都有四个点，我们可以直接取对角线上的两个点来定义矩形
                # 这里我们取第一个点和第三个点（或者您可以根据具体情况选择其他点对）
                objects = load_dict['shapes']
                for obj in objects:
                    # 去除可能的额外空格，并设置默认标签（如果需要）
                    labels = obj['label'].strip()
                    if labels in [
                       "class1", "class2",
                    "class3", "class4"
                    ]:
                        pos = obj['points']
                        b = position(pos)
                        rect_points = [
                            [b[0], b[3]],
                            [b[1], b[2]]
                            ]

                        new_shape = {
                        "label": labels,  # 保留原多边形的标签（或者您可以根据需要生成新的标签）
                        "points": rect_points,  # 使用调整后的点来表示矩形（但这里我们实际上只使用了两个点）
                        # 注意：由于我们简化了问题，并没有真正地使用四个点来定义一个完整的矩形
                        # 在实际应用中，您可能需要更精确地处理这个问题。
                        "group_id": None,  # 您可以根据需要设置这个字段的值
                        "shape_type": "rectangle",  # 指定形状类型为矩形
                        "flags": {}  # 保留空的flags字段（或者您可以根据需要填充它）
                    }
                        output_dict['shapes'].append(new_shape)

                # 写入新的JSON文件
            with open(output_file_path, 'w') as output_f:
                json.dump(output_dict, output_f, indent=4)

        except Exception as e:
            print(f"Error processing {input_file_path}: {e}")


def image_id(rootdir):
    a = []
    for parent, dirnames, filenames in os.walk(rootdir):
        for filename in filenames:
            # print(filename)
            if filename.endswith('.jpg'):
                filename = os.path.splitext(filename)[0]
            if filename.endswith('.jpeg'):
                filename = os.path.splitext(filename)[0]
            if filename.endswith('.JPG'):
                filename = os.path.splitext(filename)[0]
            if filename.endswith('.JPEG'):
                filename = os.path.splitext(filename)[0]
            a.append(filename)
    return a


names = image_id("F:/dataset/images/"),

for image_id in names:
    convert_annotation(image_id, input_dir, output_dir)

原文地址：https://blog.csdn.net/siyumiao_hbu/article/details/143560804

免责声明：本站文章内容转载自网络资源，如本站内容侵犯了原著者的合法权益，可联系本站删除。更多内容请关注自学内容网（zxcms.com）！

上一篇：Java+Swing可视化图像处理软件
下一篇：golang学习2

SpringBoot框架助力欢迪迈手机商城快速开发
通过对欢迪迈手机商城的开发，让我深刻明白开发一个程序软件需要经历的流程，当确定要开发一个欢迪迈手机商城的程序时，我在开发期间，对其功能进行合理的需求分析，然后才是程序软件的功能的框架设计，数据库的实体
阅读更多2024-11-28
欢迪迈手机商城：SpringBoot框架的缓存机制
通过对欢迪迈手机商城的开发，让我深刻明白开发一个程序软件需要经历的流程，当确定要开发一个欢迪迈手机商城的程序时，我在开发期间，对其功能进行合理的需求分析，然后才是程序软件的功能的框架设计，数据库的实体
阅读更多2024-11-28
数据结构 ——— 归并排序算法的实现
递归分治的思路解决归并排序的逻辑
阅读更多2024-11-28
Linux iptables 命令详解
iptables是一个在Linux中的管理防火墙规则的命令行工具，它作为Linux内核的netfilter框架的一部分运行，以控制传入和传出的网络流量。Chains。
阅读更多2024-11-28
Spring 框架中AOP（面向切面编程）和 IoC（控制反转）
换句话说，Spring 框架负责对象的创建、初始化和管理，开发者只需要关心“菜单”（定义对象的功能），而不需要手动管理对象的生命周期。它的核心思想是将程序中的某些通用功能（如日志、事务管理、安全等）抽
阅读更多2024-11-28
【ArcGIS Pro实操第11期】经纬度数据转化成平面坐标数据
经纬度数据转化成平面坐标数据
阅读更多2024-11-28
使用MATLAB读取.ply点云图
使用MATLAB读取.ply点云图
阅读更多2024-11-28
火山引擎VeDI在AI+BI领域的演进与实践
在探讨ChatBI或AI+BI的未来发展方向与趋势时，有几个关键点值得关注。首先，垂类场景的深度挖掘将成为重要趋势。随着BI厂商与大模型结合能力的不断探索，市场逐渐认识到，在部分场景下，大模型加BI的
阅读更多2024-11-28
MATLAB中Simulink的基础知识
Simulink是MATLAB中的一种可视化仿真工具，是一种基于MATLAB的框图设计环境，是实现动态系统建模、仿真和分析的一个软件包，被广泛应用于线性系统、非线性系统、数字控制及数字信号处理的建模
阅读更多2024-11-28
网络安全-安全散列函数，信息摘要SHA-1，MD5原理
单向散列函数或者安全散列函数之所以重要，不仅在于消息认证(消息摘要。数据指纹)。还有数字签名（加强版的消息认证）和验证数据的完整性。常见的单向散列函数有MD5和SHA。
阅读更多2024-11-28

CV图像处理小工具——语义分割json生成检测框json

语义分割json生成检测框json

相关文章