NVIDIA
Articles (19)

Reconstructing FP64: How Supercomputing's Establishment Is Adapting Science to AI Silicon
Two papers, Matsuoka's FP8-emulation preprints and the Dongarra 'Ride the Wave' paper, point to a field adapting scientific computing to AI silicon it no longer controls.

Inside Meta's 83,000-GPU AI Supercomputer: Why It Runs the Silicon at 80% Power on Purpose
Meta's first end-to-end account of running a 150 MW, 83,000-GB200 cluster - when power is the ceiling, the cluster, not the chip, is what you optimize.

2 Million Trajectories Before First Contact: How GPU Simulation Became Robotics' Proving Ground
NVIDIA is bringing receipts to ICRA 2026 - eight accepted papers it says show the sim-to-real gap closing on real hardware. More and more, a robot policy is the output of large-scale GPU simulation, with the heavy compute done long before a gripper touches an object.

MRC Gives Open Ethernet Its First 75,000-GPU Production Proof Point
The 50-author MRC paper gives Ethernet its first multi-vendor, open-spec, production-trace answer to the one argument InfiniBand had left at frontier-training scale.

Apple's Mac Shortage Signals Memory Supply Chain Has Reorganized Around Data Center AI
Apple cut Mac memory ceilings and delayed M5 Ultra by four months as HBM production for data center AI consumes edge LPDDR5X allocation.

IBM and Dallara Enter the AI-CFD Surrogate Race, Eighteen Months In
IBM and Dallara published GIST, an AI surrogate for motorsport CFD. Neural Concept, Ansys SimAI, and NVIDIA reached production deployment first.

DeepSeek V4-Pro on Ascend 950PR: The Two-Stack AI Reality
DeepSeek V4-Pro runs on Huawei Ascend 950PR as the State Department pivots export controls from chip access to model IP, describing two parallel AI stacks.
-800x600.png%3Fprefix%3Dmedia&w=3840&q=75)
Japan's Next Flagship Machine Abandons the Top500 Chase
FugakuNEXT pairs Fujitsu MONAKA-X CPUs with NVIDIA GPUs, ending Japan's all-Arm sovereign architecture and betting on throughput over benchmarks.

Vera Rubin's Memory Stack Is Korean. How Three Vendors Got There Tells You Why It Will Stay That Way.
Samsung, SK hynix, and Micron converged on SOCAMM2 mass production within six weeks for NVIDIA's Vera Rubin. Korean suppliers now control both memory tiers.

Slingshot Held Performance Under AI Traffic Patterns That Collapsed InfiniBand by 5x on Production Exascale
ISC 2026 research on LUMI, Leonardo, CRESCO8: Slingshot held performance; InfiniBand collapsed 5x under Incast, the AI gradient-sync traffic pattern.

HBM Allocation, Not HBM Supply, Is the 2026 AI Infrastructure Story
HBM scarcity has moved beyond semiconductor supply into system planning. Accelerator availability, server bill-of-materials, cluster economics, and 2026 data center buildouts are all being rewritten around memory - not compute.

NVIDIA’s Ising Pitch Is Really About Quantum’s Classical Control Plane
The World Quantum Day launch adds open models for calibration and QEC decoding, but the bigger move is NVIDIA tying AI inference to CUDA-Q, CUDAQ-Realtime, and NVQLink in the path to fault tolerance.

UCCL-EP vs. NCCL EP: Portability or Consolidation for MoE Communication?
Two new expert-parallel efforts point to different futures for MoE systems: one built for heterogeneous fleets, the other folded into NVIDIA’s stack.
NVIDIA's $4 Billion Photonics Bet Is an Admission: The AI Buildout Has a Materials Problem
NVIDIA's $4B investment in Lumentum and Coherent signals indium phosphide scarcity and power equipment lead times are gating $2.52T AI spending forecast.

Nvidia's $1 Trillion Backlog Hits the Grid Capacity Wall
Nvidia projects $1T in Blackwell orders through 2027, but 72% of operators cite grid capacity as the primary constraint. Power now limits AI.

FlatAttention Claims 4× Speedup Over FlashAttention-3 — But on What Hardware?
FlatAttention claims 4× speedup over FlashAttention-3 on unnamed tile-based accelerators. No code, no hardware vendor, no deployment path yet.

NVIDIA's $4 billion photonics bet tells you where copper dies
Investments in Coherent and Lumentum, OFC 2026 timing, and the conspicuous absence of NVLink CPO. NVIDIA knows optical interconnects are the next bottleneck, and it's moving to own the solution.

Physical AI Is NVIDIA's Quiet Second Act at GTC 2026
Two dedicated "Physical AI Days," Isaac GR00T N1.6, and a robotics stack that mirrors the CUDA playbook. NVIDIA is building the operating system for the physical world - and most of the GTC coverage is ignoring it.

NVIDIA's Vera Rubin Is a Capex Grenade - and Every Hyperscaler's 2027 Budget Knows It
The Blackwell-to-Rubin transition is a forcing function for the entire data center industry.