Skip to content

Conversation

@kvuong2711
Copy link

@kvuong2711 kvuong2711 commented Sep 26, 2025

Summary:

AerialMegaDepth (CVPR'25) is a hybrid varying-altitude 3D dataset combining MegaDepth images with geospatial mesh renderings, featuring 132K images across 137 scenes with camera intrinsics, poses, and depths in a unified coordinate frame. It has two potential benefits:

  • Improve 3D estimation in aerial-ground scenarios (initial observation shown in paper: DUSt3R finetuned on AerialMegaDepth improves aerial-ground reconstruction)
  • Metric scale: by co-registering MegaDepth images with geo-referenced mesh renderings, the provided depths + cameras are in metric scale, thus could potentially help improving metric scale estimation

What's included:

  • Data processing scripts: downloading + conversion to WAI-formatted data + aggregation

  • WAI-formatted dataloader

Not included (deferred to follow-ups):

  • Pre-computed metadata (e.g., covisibility matrix + MoGe depthmaps)

  • Selection of covisibility threshold for best performance during training/finetuning

Screenshots

See below some visualizations of the cameras + point clouds as sanity checks.

Screenshot from 2025-10-01 15-07-26 Screenshot from 2025-10-01 15-06-23

cc: @Nik-V9

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 26, 2025
@kvuong2711 kvuong2711 marked this pull request as draft September 26, 2025 21:37
@kvuong2711 kvuong2711 marked this pull request as ready for review September 26, 2025 21:38
@ethanweber ethanweber requested a review from Nik-V9 September 29, 2025 10:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant