arnoldland's picture
Initial commit
aae3ba1

A newer version of the Gradio SDK is available: 6.2.0

Upgrade

MoGe-2 Normal Estimation

Qualitative comparison of normal estimation with Marigold and Metric3D V2

NOTE: Normal estimation was implemented after the submission of the MoGe-2 paper and is therefore not included in the original publication. This feature required minimal additional effort, and we do not claim any novel technical contribution.

We added a lightweight convolutional head and trained the normal output using a squared angular loss:

Lnormal=1∣Mβˆ£βˆ‘i∈M∠(n^i,ni)2 \mathcal L_{\rm normal} = {1\over |\mathcal M|}\sum_{i\in\mathcal M} \angle (\hat{\mathbf n}_i,\mathbf n_i)^2

where $\hat{\mathbf{n}}_i$ is the predicted normal, $\mathbf{n}_i$ is the ground-truth normal, and $\mathcal{M}$ denotes the set of valid pixels. For convenience, we did not collect ground-truth normal maps for training. Instead, we derived surface normals from the depth map and camera intrinsics. The resulting estimates are visually and numerically satisfactory.