We ran several experiments on large-scale scene datasets using COLMAP’s camera intrinsics and poses. However, we found a clear scale mismatch between the point cloud generated by MapAnything and the original COLMAP point cloud. In the figure below, the full sparse COLMAP point cloud is shown, with the pink box indicating MapAnything’s output.
Additionally, the estimated poses differ from those produced by COLMAP.
