Skip to content

Latest commit

 

History

History
62 lines (40 loc) · 1.4 KB

README.md

File metadata and controls

62 lines (40 loc) · 1.4 KB

A CUDA implementation of KDTree in PyTorch

Adapted from: KdTreeGPU

This repo is specially useful if the point cloud is very large (>100,000 points).

Currently KD-Tree is built on CUDA, and the query is done on CPU. We are now working on making a new function of querying point on CUDA device, which should be faster.

Functions currently implemented:

  • nearest search (CPU)
  • knn search (CPU)
  • radius search (CPU)

NOTE: this repo is still under heavy development

build

build environment: (other environment should be okey)

  • torch == 1.8.0
  • nvcc == 10.2

there are generally two ways to build the library.

  1. build with cmake:
mkdir build && cd build

cmake .. \
-DCMAKE_PREFIX_PATH=`python -c 'import torch;print(torch.utils.cmake_prefix_path)'` \
-DCMAKE_CUDA_ARCHITECTURES=60 \
-DCUDA_TOOLKIT_ROOT_DIR=$CU102_CUDA_TOOLKIT_DIR
  1. build with setuptools:
TORCH_CUDA_ARCH_LIST="6.0+PTX" python setup.py develop

usage

please check the testing script in test/perf/ folder.

benchmarking

nearest search

TODO

  • multiple trees memory conflict
  • remove all global variables such as d_verifyKdTreeError
  • template for other cases N > 32
  • CUDA query
  • cuda-tree do not own host memory; cpu-tree do not own cuda memory
  • host memory leak testing
  • support any num of points