Community Blog & Articles

Community Articles

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders

KV Caching Explained: Optimizing Transformer Inference Efficiency

Uncensor any LLM with abliteration

Darwin V6: Diagnostic-Guided Evolutionary Model Merging

Building Harvey-style tabular review from scratch, but better

Mastering Tensor Dimensions in Transformers

How I contributed a new model to the Transformers library using Codex

YC-Bench: Can Your AI Agent Run a Startup Without Going Bankrupt?

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

ArmBench-LLM 1.0: Benchmarking LLMs on Armenian Language Tasks

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Small Language Models (SLM): A Comprehensive Overview

From GRPO to DAPO and GSPO: What, Why, and How

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

NEO-unify: Building Native Multimodal Unified Models End to End

From doctest to runnable Markdown

Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs

Magpie Speech — Applying an LLM Data Synthesis Method to an LLM-Based TTS Model to Synthesize a Speech Dataset

announcementdiffusionworld-model

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

+1

multimodalnlpcommunity

Multimodal Embedding & Reranker Models with Sentence Transformers

ALTK‑Evolve: On‑the‑Job Learning for AI Agents

open-source-collabpartnershipsopen-source

Safetensors is Joining the PyTorch Foundation

multimodalon-devicegemma4

Welcome Gemma 4: Frontier multimodal intelligence on device

+3

Holo3: Breaking the Computer Use Frontier

Falcon Perception

gradioserveropen-source

Any Custom Frontend with Gradio's Backend

Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents

Training mRNA Language Models Across 25 Species for $165

trlreinforcement-learningannouncement

TRL v1.0: Post-Training Library Built to Move with the Field

guideagentsinference-providers

Liberate your OpenClaw

+4

A New Framework for Evaluating Voice Agents (EVA)

Build a Domain-Specific Embedding Model in Under a Day

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders

KV Caching Explained: Optimizing Transformer Inference Efficiency

Uncensor any LLM with abliteration

Darwin V6: Diagnostic-Guided Evolutionary Model Merging

Building Harvey-style tabular review from scratch, but better

Mastering Tensor Dimensions in Transformers

How I contributed a new model to the Transformers library using Codex

YC-Bench: Can Your AI Agent Run a Startup Without Going Bankrupt?

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

ArmBench-LLM 1.0: Benchmarking LLMs on Armenian Language Tasks

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Small Language Models (SLM): A Comprehensive Overview

From GRPO to DAPO and GSPO: What, Why, and How

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

NEO-unify: Building Native Multimodal Unified Models End to End

From doctest to runnable Markdown

Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs

Magpie Speech — Applying an LLM Data Synthesis Method to an LLM-Based TTS Model to Synthesize a Speech Dataset

View all articles