基于神经网络6-Dof的3D重建技术.zip资源-CSDN文库

共213个文件

h：81个

py：69个

sh：12个

版权申诉

demo

深度学习

113 浏览量 2023-07-17 21:56:22 上传评论收藏 15.03MB ZIP 举报

"基于神经网络6-Dof的3D重建技术.zip"揭示了这是一个与计算机视觉和深度学习相关的项目，重点在于使用神经网络实现三维（3D）物体的六自由度（6-DoF）重建。6-DoF指的是物体在三维空间中的位置和旋转，包括沿x、y、z轴的平移以及绕这三个轴的旋转，这在机器人导航、虚拟现实和增强现实等领域具有广泛的应用。提到这是开发者自创的Demo，意味着它是一个可以实际运行和部署的示例程序，可能包含源代码、训练数据集、预训练模型以及相关的说明文档。这个项目的目标可能是展示如何利用深度学习技术，尤其是神经网络，来从二维图像或序列中恢复出物体的完整3D信息，并且具备6个自由度的精确定位。在中，"demo"表明这是一个演示性质的项目，可能包含简化版的算法实现，适合初学者理解和学习；"深度学习"标签则提示我们，这个项目的重点在于利用深度神经网络进行模型训练和预测，可能是通过卷积神经网络（CNNs）、循环神经网络（RNNs）或者更复杂的架构如U-Net或PointNet等。【压缩包子文件的文件名称列表】只有一个条目："基于神经网络6-Dof的3D重建技术"，这可能是项目的主要代码库或者包含所有资源的文件夹。通常，这样的压缩包内可能包含以下部分： 1. **源代码**：实现3D重建算法的Python脚本或其他编程语言文件，可能包括数据预处理、模型定义、训练、评估和推理的代码。 2. **数据集**：用于训练和验证模型的2D图像或3D点云数据，可能分为训练集、验证集和测试集。 3. **预训练模型**：已经训练好的神经网络模型权重文件，可以直接部署进行3D重建。 4. **配置文件**：包含了训练参数、超参数设置的文本文件，便于调整模型性能。 5. **README**或**Instructions**：详细的项目介绍和使用指南，解释如何运行代码和解压后的文件结构。 6. **结果展示**：可能包含使用该技术重建的3D模型示例，以便用户直观地理解模型效果。 7. **依赖库**：项目所需的第三方库和框架的版本信息，如TensorFlow、PyTorch等。这个项目提供了一个实践深度学习3D重建的平台，通过神经网络解决6-DoF定位问题，为研究者和爱好者提供了一种理解、学习和应用此类技术的途径。用户可以通过运行和修改代码，进一步探索和优化模型，以适应不同的应用场景。

资源推荐

资源详情

资源评论

收起资源包目录

基于神经网络6-Dof的3D重建技术.zip （213个子文件）

format.cc 3KB

FeatureManager.cpp 100KB

Bundler.cpp 39KB

Utils.cpp 18KB

Frame.cpp 13KB

pybind_api.cpp 8KB

SIFTImageManager.cpp 5KB

CUDACache.cpp 4KB

LossGPU.cpp 4KB

bindings.cpp 758B

bindings.cpp 717B

SolverBundling.cu 55KB

CUDAImageUtil.cu 44KB

cuda_ransac.cu 43KB

CUDASolverBundling.cu 25KB

gridencoder.cu 20KB

SIFTImageManager.cu 13KB

common.cu 9KB

SBA.cu 7KB

CUDACache.cu 5KB

dockerfile 5KB

driller.gif 5.18MB

problem_setup_c.gif 2.1MB

milk_jug.gif 2.09MB

preview_results_c.gif 1.05MB

.gitignore 407B

.gitignore 356B

.gitmodules 163B

format.h 107KB

format-inl.h 101KB

format.h 101KB

core.h 96KB

cuda_SimpleMatrixUtil.h 51KB

core.h 46KB

pattern_formatter.h 38KB

cutil_math.h 38KB

format-inl.h 32KB

SolverBundlingDenseUtil.h 27KB

printf.h 25KB

color.h 21KB

SolverBundlingEquationsLie.h 16KB

SIFTImageManager.h 15KB

chrono.h 13KB

logger_impl.h 12KB

Utils.h 11KB

spdlog.h 11KB

os.h 10KB

LieDerivUtil.h 9KB

ranges.h 9KB

posix.h 9KB

FeatureManager.h 9KB

registry.h 8KB

Frame.h 7KB

tweakme.h 6KB

thread_pool.h 6KB

logger.h 6KB

common.h 6KB

ansicolor_sink.h 5KB

rotating_file_sink.h 5KB

wincolor_sink.h 5KB

ostream.h 5KB

ICPUtil.h 5KB

CUDAImageUtil.h 5KB

CUDATimer.h 4KB

bin_to_hex.h 4KB

time.h 4KB

daily_file_sink.h 4KB

file_helper.h 4KB

CUDASolverBundling.h 4KB

Bundler.h 3KB

fmt_helper.h 3KB

mpmc_blocking_q.h 3KB

android_sink.h 3KB

async.h 3KB

SBA.h 3KB

async_logger_impl.h 3KB

syslog_sink.h 3KB

common.h 3KB

SolverBundlingState.h 3KB

stdout_sinks.h 3KB

locale.h 3KB

CUDACache.h 2KB

async_logger.h 2KB

dist_sink.h 2KB

cuda_ransac.h 2KB

periodic_worker.h 2KB

base_sink.h 2KB

basic_file_sink.h 2KB

cudaUtil.h 2KB

stdout_color_sinks.h 2KB

circular_q.h 1KB

EigenDenseBaseAddons.h 1KB

CUDACacheUtil.h 1KB

sink.h 1KB

gridencoder.h 1KB

common.h 1KB

ostream_sink.h 1KB

log_msg.h 1KB

console_globals.h 1KB

null_sink.h 1KB

共 213 条

# LoFTR: Detector-Free Local Feature Matching with Transformers ### [Project Page](https://zju3dv.github.io/loftr) | [Paper](https://arxiv.org/pdf/2104.00680.pdf) > LoFTR: Detector-Free Local Feature Matching with Transformers > [Jiaming Sun](https://jiamingsun.ml)\*, [Zehong Shen](https://zehongs.github.io/)\*, [Yu'ang Wang](https://github.com/angshine)\*, [Hujun Bao](http://www.cad.zju.edu.cn/home/bao/), [Xiaowei Zhou](http://www.cad.zju.edu.cn/home/xzhou/) > CVPR 2021 ![demo_vid](assets/loftr-github-demo.gif) ## TODO List and ETA - [x] Inference code and pretrained models (DS and OT) (2021-4-7) - [x] Code for reproducing the test-set results (2021-4-7) - [x] Webcam demo to reproduce the result shown in the GIF above (2021-4-13) - [x] Training code and training data preparation (expected 2021-6-10) Discussions about the paper are welcomed in the [discussion panel](https://github.com/zju3dv/LoFTR/discussions). :triangular_flag_on_post: **Updates** - Check out [QuadTreeAttention](https://github.com/Tangshitao/QuadTreeAttention), a new attention machanism that improves the efficiency and performance of LoFTR with less demanding GPU requirements for training. - :white_check_mark: Integrated to [Huggingface Spaces](https://huggingface.co/spaces) with [Gradio](https://github.com/gradio-app/gradio). See [Gradio Web Demo](https://huggingface.co/spaces/akhaliq/Kornia-LoFTR) ## Colab demo Want to run LoFTR with custom image pairs without configuring your own GPU environment? Try the Colab demo: [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1BgNIOjFHauFoNB95LGesHBIjioX74USW?usp=sharing) ## Installation ```shell # For full pytorch-lightning trainer features (recommended) conda env create -f environment.yaml conda activate loftr # For the LoFTR matcher only pip install torch einops yacs kornia ``` We provide the [download link](https://drive.google.com/drive/folders/1DOcOPZb3-5cWxLqn256AhwUVjBPifhuf?usp=sharing) to - the scannet-1500-testset (~1GB). - the megadepth-1500-testset (~600MB). - 4 pretrained models of indoor-ds, indoor-ot, outdoor-ds and outdoor-ot (each ~45MB). By now, the environment is all set and the LoFTR-DS model is ready to go! If you want to run LoFTR-OT, some extra steps are needed: <details> <summary>[Requirements for LoFTR-OT]</summary> We use the code from [SuperGluePretrainedNetwork](https://github.com/magicleap/SuperGluePretrainedNetwork) for optimal transport. However, we can't provide the code directly due its strict LICENSE requirements. We recommend downloading it with the following command instead. ```shell cd src/loftr/utils wget https://raw.githubusercontent.com/magicleap/SuperGluePretrainedNetwork/master/models/superglue.py ``` </details> ## Run LoFTR demos ### Match image pairs with LoFTR <details> <summary>[code snippets]</summary> ```python from src.loftr import LoFTR, default_cfg # Initialize LoFTR matcher = LoFTR(config=default_cfg) matcher.load_state_dict(torch.load("weights/indoor_ds.ckpt")['state_dict']) matcher = matcher.eval().cuda() # Inference with torch.no_grad(): matcher(batch) # batch = {'image0': img0, 'image1': img1} mkpts0 = batch['mkpts0_f'].cpu().numpy() mkpts1 = batch['mkpts1_f'].cpu().numpy() ``` </details> An example is given in `notebooks/demo_single_pair.ipynb`. ### Online demo Run the online demo with a webcam or video to reproduce the result shown in the GIF above. ```bash cd demo ./run_demo.sh ``` <details> <summary>[run_demo.sh]</summary> ```bash #!/bin/bash set -e # set -x if [ ! -f utils.py ]; then echo "Downloading utils.py from the SuperGlue repo." echo "We cannot provide this file directly due to its strict licence." wget https://raw.githubusercontent.com/magicleap/SuperGluePretrainedNetwork/master/models/utils.py fi # Use webcam 0 as input source. input=0 # or use a pre-recorded video given the path. # input=/home/sunjiaming/Downloads/scannet_test/$scene_name.mp4 # Toggle indoor/outdoor model here. model_ckpt=../weights/indoor_ds.ckpt # model_ckpt=../weights/outdoor_ds.ckpt # Optionally assign the GPU ID. # export CUDA_VISIBLE_DEVICES=0 echo "Running LoFTR demo.." eval "$(conda shell.bash hook)" conda activate loftr python demo_loftr.py --weight $model_ckpt --input $input # To save the input video and output match visualizations. # python demo_loftr.py --weight $model_ckpt --input $input --save_video --save_input # Running on remote GPU servers with no GUI. # Save images first. # python demo_loftr.py --weight $model_ckpt --input $input --no_display --output_dir="./demo_images/" # Then convert them to a video. # ffmpeg -framerate 15 -pattern_type glob -i '*.png' -c:v libx264 -r 30 -pix_fmt yuv420p out.mp4 ``` </details> ### Reproduce the testing results with pytorch-lightning You need to setup the testing subsets of ScanNet and MegaDepth first. We create symlinks from the previously downloaded datasets to `data/{{dataset}}/test`. ```shell # set up symlinks ln -s /path/to/scannet-1500-testset/* /path/to/LoFTR/data/scannet/test ln -s /path/to/megadepth-1500-testset/* /path/to/LoFTR/data/megadepth/test ``` ```shell conda activate loftr # with shell script bash ./scripts/reproduce_test/indoor_ds.sh # or python test.py configs/data/scannet_test_1500.py configs/loftr/loftr_ds.py --ckpt_path weights/indoor_ds.ckpt --profiler_name inference --gpus=1 --accelerator="ddp" ``` For visualizing the results, please refer to `notebooks/visualize_dump_results.ipynb`.  ## Training See [Training LoFTR](./docs/TRAINING.md) for more details. ## Citation If you find this code useful for your research, please use the following BibTeX entry. ```bibtex @article{sun2021loftr, title={{LoFTR}: Detector-Free Local Feature Matching with Transformers}, author={Sun, Jiaming and Shen, Zehong and Wang, Yuang and Bao, Hujun and Zhou, Xiaowei}, journal={{CVPR}}, year={2021} } ``` ## Copyright This work is affiliated with ZJU-SenseTime Joint Lab of 3D Vision, and its intellectual property belongs to SenseTime Group Ltd. ``` Copyright SenseTime. All Rights Reserved. Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0 Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License. ```

评论收藏

内容反馈

版权申诉