Yolo-将coco数据集中的json文件转为txt且解决类别不连续问题

解决问题1.将coco数据集中，annotations的json文件，读取，进行转为ID保持一致的txt文件。2.解决COCO数据集中，类别不连续的问题方法from __future__ import print_functionimport os, sys, zipfileimport jsonclass_num = 0def convert(size, box):dw = 1. / (size

莫陌莫墨

1559人浏览 · 2020-11-07 14:06:30

莫陌莫墨 · 2020-11-07 14:06:30 发布

解决问题

1.将coco数据集中，annotations的json文件，读取，进行转为ID保持一致的txt文件。
2.解决COCO数据集中，类别不连续的问题

方法

from __future__ import print_function
import os, sys, zipfile
import json
 
class_num = 0
def convert(size, box):
    dw = 1. / (size[0])
    dh = 1. / (size[1])
    x = box[0] + box[2] / 2.0
    y = box[1] + box[3] / 2.0
    w = box[2]
    h = box[3]
 
    x = x * dw
    w = w * dw
    y = y * dh
    h = h * dh
    return (x, y, w, h)
 
 
json_file = '/Data/coco/annotations/instances_val2017.json'  # # Object Instance 类型的标注
 
data = json.load(open(json_file, 'r'))
ana_txt_save_path = "/Data/coco/test2017"  # 保存的路径
if not os.path.exists(ana_txt_save_path):
    os.makedirs(ana_txt_save_path)

index = 0
cat_id_map = {}
for img in data['images']:
    filename = img["file_name"]
    img_width = img["width"]
    img_height = img["height"]
    img_id = img["id"]
    ana_txt_name = filename.split(".")[0] + ".txt"  # 对应的txt名字，与jpg一致
    index +=1
    print(str(index) +'   '+str(ana_txt_name))
    f_txt = open(os.path.join(ana_txt_save_path, ana_txt_name), 'w')# 保存的文件



    for ann in data['annotations']:
        if ann['image_id'] == img_id:
            if ann['category_id'] not in cat_id_map:
                cat_id_map[ann['category_id']] = class_num
                class_num += 1
            box = convert((img_width, img_height), ann["bbox"])
            f_txt.write("%s %s %s %s %s\n" % (cat_id_map[ann['category_id']], box[0], box[1], box[2], box[3]))
        
    f_txt.close()
fileObject = open('/Data/coco/val_cat_id_map.txt', 'w') # 类别进行重新映射
for cat_id in cat_id_map:  
    fileObject.write(str(cat_id))
    fileObject.write('\n')
fileObject.close()

魔乐社区

魔乐社区（Modelers.cn) 是一个中立、公益的人工智能社区，提供人工智能工具、模型、数据的托管、展示与应用协同服务，为人工智能开发及爱好者搭建开放的学习交流平台。社区通过理事会方式运作，由全产业链共同建设、共同运营、共同享有，推动国产AI生态繁荣发展。

更多推荐

【计算机视觉】Pixel逐像素分类&Mask掩码分类理解摘要

魔乐社区

计算机视觉（opencv）实战三十二——CascadeClassifier 人脸微笑检测（摄像头）

本文从原理到实现，详细介绍了基于 OpenCV Haar 分类器的人脸与微笑检测：讲解了 Haar 特征和级联检测原理。对代码逐行拆解并解释参数含义。画出完整流程图，帮助理解执行过程。给出了常见问题和优化建议，甚至扩展到深度学习方法。这种方法简单、轻量、实时性好，非常适合入门和小型应用项目。但如果需要更高准确率和更强鲁棒性，建议使用深度学习检测器替代 Haar 分类器。