tensorflow-lite - YOLO 的输出是什么？

问题描述

我正在尝试使用 YOLO 来检测 Android 应用程序中的车牌。

所以我在 Google Colab 中训练了一个 YOLOv3 和一个 YOLOv4 模型。我使用Hunglc007 的精彩项目将这 2 个模型转换为 TensorFlow Lite，我还验证了它们是否正常工作并得到以下结果：

但是当我尝试了解模型的输出以在我的应用程序中对其进行调整时，我使用netron得到了这个：

当模型被训练为只检测一个对象时，为什么我有 2 个输出？

而为什么输出的格式是这样的，这[1,1,4]代表什么？

编辑

bboxes的代码可以在这里找到

boxes, scores, classes, valid_detections = tf.image.combined_non_max_suppression(
            boxes=tf.reshape(boxes, (tf.shape(boxes)[0], -1, 1, 4)),
            scores=tf.reshape(
                pred_conf, (tf.shape(pred_conf)[0], -1, tf.shape(pred_conf)[-1])),
            max_output_size_per_class=50,
            max_total_size=50,
            iou_threshold=FLAGS.iou,
            score_threshold=FLAGS.score
        )
        pred_bbox = [boxes.numpy(), scores.numpy(), classes.numpy(), valid_detections.numpy()]
        image = utils.draw_bbox(original_image, pred_bbox)
        # image = utils.draw_bbox(image_data*255, pred_bbox)
        image = Image.fromarray(image.astype(np.uint8))
        image.show()
        image = cv2.cvtColor(np.array(image), cv2.COLOR_BGR2RGB)
        cv2.imwrite(FLAGS.output + 'detection' + str(count) + '.png', image)

标签： tensorflow-liteyolo

这很明显。用于检测模型。通常，它应该至少给出 2 个输出：边界框和与边界框相关的类。因此，(1,1,4) 是边界框的 4 个结果。第一个数字 1 是根据您将图像提取到模型中的。由于您只有一个对象，因此第二个数字的输出为 1。此外，边界框的 YOLO 配置为 (x_center,y_center,width,height)

(1,1,1) 将是相同的，但现在 1 是您选择的类的标签。

tensorflow-lite - YOLO 的输出是什么？

问题描述

解决方案

推荐阅读