A newer version of the Gradio SDK is available:
6.2.0
MoGe-2 Normal Estimation
Qualitative comparison of normal estimation with Marigold and Metric3D V2
NOTE: Normal estimation was implemented after the submission of the MoGe-2 paper and is therefore not included in the original publication. This feature required minimal additional effort, and we do not claim any novel technical contribution.
We added a lightweight convolutional head and trained the normal output using a squared angular loss:
where $\hat{\mathbf{n}}_i$ is the predicted normal, $\mathbf{n}_i$ is the ground-truth normal, and $\mathcal{M}$ denotes the set of valid pixels. For convenience, we did not collect ground-truth normal maps for training. Instead, we derived surface normals from the depth map and camera intrinsics. The resulting estimates are visually and numerically satisfactory.