SECOND:Sparsely Embedded Convolutional Detection

1. SECOND & History paper:SECOND: Sparsely Embedded Convolutional Detection graph TB VoxelNet --> SECOND --> PointPillar Related paper: VoxelNet: stacked VFE(voxel feature encoding module) + 3D Conv RPN This paper: 3D Conv --> Sparse Conv --> improved sparse conv angle loss regression approach We apply sparse convolution in LiDAR-based object detection, thereby greatly increasing the speeds of training and inference. We propose an improved method of sparse convolution that allows it to run faster....

January 19, 2024 · 1 min · 153 words · lvsolo

VoxelNet Paper Reading

1. VoxelNet & History paper:VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection Related works: Several methods project point clouds into a perspective view and apply image-based feature extraction techniques.Other approaches rasterize point clouds into a 3D voxel grid and encode each voxel with handcrafted feature PointNet:an end-to-end deep neural network that learns point-wise features directly from point clouds,moving from hand-crafted features to learned features. RPN: Region Proposal Network This Paper: Structure: VFE layer: Voxel Feature Encoding layer enables inter-point interaction within a voxel; Pipeline: Divides point clouds into equally spaced 3D voxels and encodes each voxel with stacked VFE layers; 3D Conv RPN Advantages:This efficient algorithm benefits both from the sparse point structure and efficient parallel processing on the voxel grid....

January 19, 2024 · 3 min · 564 words · lvsolo

PointPillar Paper Reading

1. Point Pillar & History paper:PointPillars: Fast Encoders for Object Detection from Point Clouds graph TB PointNet --> Second --> PointPillar PointNet --> VoxelNet-->CenterPoint 2. pipeline (1) A feature encoder network that converts a point cloud to a sparse pseudoimage; (2) a 2D convolutional backbone to process the pseudo-image into high-level representation; (3) a detection head that detects and regresses 3D boxes. graph TB pts(points cloud) --> A[pts to pseudo-image] A --> B[2D CNN Net Backbone] B -->C[SSD Head] 2....

January 16, 2024 · 2 min · 295 words · lvsolo