Community Blog & Articles

Community Articles

Introducing Falcon H1R 7B

about 23 hours ago

The Optimal Architecture for Small Language Models

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

Continuity as a First-Class System Property in Artificial Intelligence

Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

about 10 hours ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot

about 9 hours ago

Uncensor any LLM with abliteration

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Deriving the DPO Loss from First Principles

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell

about 14 hours ago

Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem

about 10 hours ago

Code a simple RAG from scratch

Small Language Models (SLM): A Comprehensive Overview

Deriving the PPO Loss from First Principles

Understanding Low-Rank Adaptation (LoRA): A Revolution in Fine-Tuning Large Language Models

Mastering Tensor Dimensions in Transformers

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

transformerspytorchoptimization

Continuous batching from first principles

November 25, 2025

Building Deep Research: How we Achieved State of the Art

November 24, 2025

OVHcloud on Hugging Face Inference Providers 🔥

November 24, 2025

llmexperimentationfine-tuning

20x Faster TRL Fine-tuning with RapidFire AI

November 21, 2025

audiospeechleaderboard

Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks

November 21, 2025

Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms

November 20, 2025

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

November 19, 2025

Easily Build and Share ROCm Kernels with Hugging Face

November 17, 2025

Join the AMD Open Robotics Hackathon

November 13, 2025

partnershipsgoogleannouncement

Building for an Open Future - our new partnership with Google Cloud

November 13, 2025

Aligning to What? Rethinking Agent Generalization in MiniMax M2

October 30, 2025

On the Shifting Global Compute Landscape

October 29, 2025

lerobotrobotics

Building a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac

October 29, 2025

How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare

October 28, 2025

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

Introducing Falcon H1R 7B

about 23 hours ago

The Optimal Architecture for Small Language Models

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

Continuity as a First-Class System Property in Artificial Intelligence

Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

about 10 hours ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot

about 9 hours ago

Uncensor any LLM with abliteration

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Deriving the DPO Loss from First Principles

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell

about 14 hours ago

Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem

about 10 hours ago

Code a simple RAG from scratch

Small Language Models (SLM): A Comprehensive Overview

Deriving the PPO Loss from First Principles

Understanding Low-Rank Adaptation (LoRA): A Revolution in Fine-Tuning Large Language Models

Mastering Tensor Dimensions in Transformers

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

View all articles