文章目录[隐藏]

前言

在阅读本博客时，建议先阅读AFDetV1以及cornernet和centernet

AFDetV1在我之前讲解过:【3D 目标检测】AFDet: Anchor Free One Stage 3D Object Detection_JY.Wang_China的博客-CSDN博客
centernet和cornernet的链接在下述:

cornertnet: https://arxiv.org/abs/1808.01244v2

【2D 目标检测】CornerNet: Detecting Objects as Paired Keypoints_JY.Wang_China的博客-CSDN博客

centernet: https://arxiv.org/abs/1904.07850v2

一核心思路

AFDetV2承接AFDet的一大续作，是Waymo2021的冠军。与V1版本不同的地方主要在于：精简的3D feature extractor、一种improved RPN、新增的两个detect head(分别为IoU-aware confidence score prediction和Keypoints auxiliary supervision)。具体算法流程见下图所示:

所有的步骤分为:point Cloud Voxelization、3D Feature Extractor、Region Proposal Networks 和 anchor-free detector heads。

二核心步骤

2.1 Point Cloud Voxelization

与之前第一版不同的是，第二版开始关注提取特征的细节。体素化的方法比较常规，之后生成特征的时候用到GPU加速算法。具体细则还需阅读代码。

2.2 3D Feature Extractor

论文中采用3D conv来提取特征。然后在最后一层将z-axis与特征维度进行concatenate操作，得到的feature map。文章说只在z轴(也就是维度)进行下采样操作，但是看到和维度的stride也是2，可能在代码中与文章有所出入。

2.3 Region Proposal Network

本文采用Fig.2b中的结果来代替original RPN baseline。这有助于扩大对空间位置的receptive field，并引入了channel-wise attention和spatial-wise attention机制。改进的RPN结构在参数数量和计算成本与baseline相似的情况下提高了检测精度。

2.4 Anchor-free Head

除了AFDet中的五个sub-heads外，我们在AFDetV2中设计了两个新的sub-heads，以实现更好的准确性。AFDet和AFDetV2共有的5个sub-heads是the heatmap prediction head、the local offset regression head、the z-axis location regression head、the 3D object size regression head 和 the orientation regression head。新增2个sub-heads为IoU-aware confifidence score、prediction and keypoints auxiliary supervision。

1、Differences of the 5 sub-heads

对于heatmap head，我们通过设置Gaussian半径为2（可以理解为以为中心，向四周扩散的影响力递减。在作者的实验中用到的是Gaussian kernal），扩展了positive supervision的范围。