How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare Oct 28 • 18
NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image Reasoning, Image and Video QA Tasks Oct 28 • 16
Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard Oct 21 • 14
📢 NVIDIA Releases Nemotron-CC-Math Pre-Training Dataset: A High-Quality, Web-Scale Math Corpus for Pretraining Large Language Models Aug 18 • 5
NVIDIA Releases Improved Pretraining Dataset: Preserves High Value Math & Code, and Augments with Multi-Lingual Aug 18 • 3
NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks Aug 11 • 75
Llama-NeMoRetriever-ColEmbed: Developer-Focused Guide to NVIDIA's State-of-the-Art Text-Image Retrieval Jul 9 • 4
Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions Jun 10 • 21
view post Post 104 ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration (2511.21689) See translation 👀 1 1 + Reply
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published Mar 18 • 50
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published Mar 18 • 50
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published Mar 18 • 50
BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation Paper • 2502.03860 • Published Feb 6 • 25
AutoTemplate: A Simple Recipe for Lexically Constrained Text Generation Paper • 2211.08387 • Published Nov 15, 2022
Noisy Pairing and Partial Supervision for Opinion Summarization Paper • 2211.08723 • Published Nov 16, 2022
Less is More for Long Document Summary Evaluation by LLMs Paper • 2309.07382 • Published Sep 14, 2023 • 1
AmbigNLG: Addressing Task Ambiguity in Instruction for NLG Paper • 2402.17717 • Published Feb 27, 2024
XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates Paper • 2309.11063 • Published Sep 20, 2023
Unlocking Anticipatory Text Generation: A Constrained Approach for Faithful Decoding with Large Language Models Paper • 2312.06149 • Published Dec 11, 2023 • 3
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models Paper • 2206.04615 • Published Jun 9, 2022 • 5
A Long Way to Go: Investigating Length Correlations in RLHF Paper • 2310.03716 • Published Oct 5, 2023 • 10
Neuralangelo: High-Fidelity Neural Surface Reconstruction Paper • 2306.03092 • Published Jun 5, 2023 • 3
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models Paper • 2305.10474 • Published May 17, 2023 • 1