深度学习论文: YOLO-Z: Improving small object detection in YOLOv5 for autonomous vehicles及其PyTorch实现

文章目录[隐藏]

1 概述
2 数据集(Dataset)
3 YOLO-Z
4 Results

深度学习论文: YOLO-Z: Improving small object detection in YOLOv5 for autonomous vehicles及其PyTorch实现
YOLO-Z: Improving small object detection in YOLOv5 for autonomous vehicles
PDF: https://arxiv.org/pdf/2112.11798.pdf
PyTorch代码: https://github.com/shanglianlm0525/CvPytorch
PyTorch代码: https://github.com/shanglianlm0525/PyTorch-Networks

1 概述

本研究探索了如何对YOLOv5进行修改，以提高其在检测较小目标时的性能，并在自动赛车中进行了特殊应用。为了实现这一点，作者研究了替换模型的某些结构会如何影响性能和推理时间。在这一过程中在不同的尺度上提出一系列的模型YOLO-Z，并得到高达6.9%的改善，相比原YOLOv5推理时间检测更小的目标时的成本就增加3ms。

2 数据集(Dataset)

用于自动驾驶赛车的一个带注释的圆锥数据集。总共有4类圆锥体(黄色、蓝色、橙色和大橙色)，接近4000张图像。
在这里插入图片描述

3 YOLO-Z

3-1 Backbone

The backbone of a model is the element dedicated to taking the input image and extracting feature maps from it.
尝试用2个Backbone替换YOLOv5中现有的Backbone。

ResNet50
按比例缩小了DenseNet

**实验表明：**对于小目标 DenseNet性能更好，增加的推理时间也相对较少，resnet性能较差。

3-2 Neck

当前的PAN-Net替换为bi-FPN。

3-3 其他修改

通过扩大Neck以适应额外的特征图来实现，也可以通过替换最低分辨率的特征图以适应新的特征图来实现。
在这里插入图片描述

4 Results

在这里插入图片描述

版权声明：本文为CSDN博主「mingo_敏」的原创文章，遵循CC 4.0 BY-SA版权协议，转载请附上原文出处链接及本声明。
原文链接：https://blog.csdn.net/shanglianlm/article/details/122301921

深度学习论文: YOLO-Z: Improving small object detection in YOLOv5 for autonomous vehicles及其PyTorch实现

1 概述

2 数据集(Dataset)