NVIDIA

Articles (19)

Monumental numeral 64 under reconstruction from thousands of small blocks, one digit complete and the other half-built behind scaffolding, illustrating FP64 precision being emulated from 8-bit arithmetic.

Reconstructing FP64: How Supercomputing's Establishment Is Adapting Science to AI Silicon

Two papers, Matsuoka's FP8-emulation preprints and the Dongarra 'Ride the Wave' paper, point to a field adapting scientific computing to AI silicon it no longer controls.

Top500ISC

Matt Walters·Jun 15, 2026

Isometric illustration of a 72-GPU Catalina pod. A small front section glows red at full 1200W power; the rest of the pod, packed more densely, glows in a calm blue-green at 960W.

AInews

Inside Meta's 83,000-GPU AI Supercomputer: Why It Runs the Silicon at 80% Power on Purpose

Meta's first end-to-end account of running a 150 MW, 83,000-GB200 cluster - when power is the ceiling, the cluster, not the chip, is what you optimize.

AI InfrastructureNVIDIA

SCN Staff·May 30, 2026

A single robotic gripper stands in a data-center aisle, emitting a glowing purple swarm of translucent ghost-grippers — a visual metaphor for millions of GPU-simulated grasp attempts run before one real grasp.

AInews

2 Million Trajectories Before First Contact: How GPU Simulation Became Robotics' Proving Ground

NVIDIA is bringing receipts to ICRA 2026 - eight accepted papers it says show the sim-to-real gap closing on real hardware. More and more, a robot policy is the output of large-scale GPU simulation, with the heavy compute done long before a gripper touches an object.

RoboticsNVIDIA

SCN Staff·May 28, 2026

Abstract three-quarter rendering of a two-tier network fabric - a row of leaf switches beneath a shorter row of spine switches, connected by hundreds of fine indigo parallel lines.

AInews

MRC Gives Open Ethernet Its First 75,000-GPU Production Proof Point

The 50-author MRC paper gives Ethernet its first multi-vendor, open-spec, production-trace answer to the one argument InfiniBand had left at frontier-training scale.

AI InfrastructureNVIDIA

SCN Staff·May 12, 2026

Three-tier memory hierarchy with HBM3E stacks at top, server DRAM DIMM modules at middle, and sparse LPDDR5X packages at bottom, visualizing memory manufacturer allocation priority that has reduced consumer and edge AI memory availability.

AIanalysis

Apple's Mac Shortage Signals Memory Supply Chain Has Reorganized Around Data Center AI

Apple cut Mac memory ceilings and delayed M5 Ultra by four months as HBM production for data center AI consumes edge LPDDR5X allocation.

AI InfrastructureSupply Chain & Critical Materials

SCN Staff·May 12, 2026

A matte carbon-fiber Le Mans Prototype race car with its rear half dissolving into a triangular wireframe mesh overlaid with cyan and amber CFD pressure-field contours, representing the boundary between physical aerodynamics and AI surrogate prediction that defines the IBM and Dallara research collaboration.

Emergingnews

IBM and Dallara Enter the AI-CFD Surrogate Race, Eighteen Months In

IBM and Dallara published GIST, an AI surrogate for motorsport CFD. Neural Concept, Ansys SimAI, and NVIDIA reached production deployment first.

NVIDIAAI Surrogate Models

SCN Staff·May 6, 2026

Two parallel rows of AI server infrastructure diverging from a central fault line, representing frontier AI compute splitting between established and emerging hardware ecosystems.

AIanalysis

DeepSeek V4-Pro on Ascend 950PR: The Two-Stack AI Reality

DeepSeek V4-Pro runs on Huawei Ascend 950PR as the State Department pivots export controls from chip access to model IP, describing two parallel AI stacks.

Export Controls & Trade PolicyAI Infrastructure

SCN Staff·Apr 28, 2026

Interior of a Japanese supercomputing facility with open server rack showing GPU accelerators alongside a Fujitsu CPU module, representing FugakuNEXT's hybrid architecture pairing MONAKA-X CPUs with NVIDIA GPUs over NVLink Fusion.

HPCanalysis

Japan's Next Flagship Machine Abandons the Top500 Chase

FugakuNEXT pairs Fujitsu MONAKA-X CPUs with NVIDIA GPUs, ending Japan's all-Arm sovereign architecture and betting on throughput over benchmarks.

Exascale ComputingTop500

SCN Staff·Apr 27, 2026

A SOCAMM2 LPDDR5X server memory module showing dense chip array architecture, illuminated against a black background, designed for next-generation AI server platforms.

AIanalysis

Vera Rubin's Memory Stack Is Korean. How Three Vendors Got There Tells You Why It Will Stay That Way.

Samsung, SK hynix, and Micron converged on SOCAMM2 mass production within six weeks for NVIDIA's Vera Rubin. Korean suppliers now control both memory tiers.

NVIDIASemiconductor Manufacturing

SCN Staff·Apr 22, 2026

Network flows converging at a switch node, with orderly blue streams on the inbound side becoming chaotic amber tangles on the outbound, visualizing the Incast congestion pattern where InfiniBand collapsed while Slingshot maintained performance.

HPCanalysis

Slingshot Held Performance Under AI Traffic Patterns That Collapsed InfiniBand by 5x on Production Exascale

ISC 2026 research on LUMI, Leonardo, CRESCO8: Slingshot held performance; InfiniBand collapsed 5x under Incast, the AI gradient-sync traffic pattern.

AI InfrastructureData Center Infrastructure

SCN Staff·Apr 21, 2026

Macro render of a single HBM memory stack beside a GPU die on a silicon interposer, with adjacent memory sockets sitting empty.

AIanalysis

HBM Allocation, Not HBM Supply, Is the 2026 AI Infrastructure Story

HBM scarcity has moved beyond semiconductor supply into system planning. Accelerator availability, server bill-of-materials, cluster economics, and 2026 data center buildouts are all being rewritten around memory - not compute.

AI InfrastructureHBM

SCN Staff·Apr 21, 2026

A glass-walled quantum computing lab shows a cryogenic processor assembly on the left, workstation monitors with data plots in the center, and tall server racks with dense cabling on the right.

Quantumnews

NVIDIA’s Ising Pitch Is Really About Quantum’s Classical Control Plane

The World Quantum Day launch adds open models for calibration and QEC decoding, but the bigger move is NVIDIA tying AI inference to CUDA-Q, CUDAQ-Realtime, and NVQLink in the path to fault tolerance.

Quantum Classical Control PlaneNVIDIA

SCN Staff·Apr 15, 2026

Illustration of token streams routing through a central AI communication layer to a small set of active compute nodes inside a larger data center, representing sparse activation and expert-parallel communication.

AIanalysis

UCCL-EP vs. NCCL EP: Portability or Consolidation for MoE Communication?

Two new expert-parallel efforts point to different futures for MoE systems: one built for heterogeneous fleets, the other folded into NVIDIA’s stack.

AI InfrastructureInference Economics

SCN Staff·Apr 13, 2026

Semiconductor wafer substrate reflecting iridescent light -- the indium phosphide supply chain constraining AI optical interconnect production

AIanalysis

NVIDIA's $4 Billion Photonics Bet Is an Admission: The AI Buildout Has a Materials Problem

NVIDIA's $4B investment in Lumentum and Coherent signals indium phosphide scarcity and power equipment lead times are gating $2.52T AI spending forecast.

Supply Chain & Critical MaterialsExport Controls & Trade Policy

SCN Staff·Apr 9, 2026

Aerial view of the Eemshaven data center campus in the Netherlands

AIanalysis

Nvidia's $1 Trillion Backlog Hits the Grid Capacity Wall

Nvidia projects $1T in Blackwell orders through 2027, but 72% of operators cite grid capacity as the primary constraint. Power now limits AI.

NVIDIAPower & Energy

SCN Staff·Apr 8, 2026

SambaNova Systems CEO Rodrigo Liang holds the SN40L Reconfigurable Dataflow Unit (RDU), the company's fourth-generation AI inference chip. SambaNova's dataflow architecture makes it one of the most likely candidates to demonstrate whether FlatAttention's collective-primitive approach generalizes beyond the unnamed hardware tested in the April 2026 paper.

AIanalysis

FlatAttention Claims 4× Speedup Over FlashAttention-3 — But on What Hardware?

FlatAttention claims 4× speedup over FlashAttention-3 on unnamed tile-based accelerators. No code, no hardware vendor, no deployment path yet.

AI InfrastructureInference Economics

SCN Staff·Apr 7, 2026

Emergingnews

NVIDIA's $4 billion photonics bet tells you where copper dies

Investments in Coherent and Lumentum, OFC 2026 timing, and the conspicuous absence of NVLink CPO. NVIDIA knows optical interconnects are the next bottleneck, and it's moving to own the solution.

NVIDIAOptical Interconnects

SCN Staff·Mar 16, 2026

AInews

Physical AI Is NVIDIA's Quiet Second Act at GTC 2026

Two dedicated "Physical AI Days," Isaac GR00T N1.6, and a robotics stack that mirrors the CUDA playbook. NVIDIA is building the operating system for the physical world - and most of the GTC coverage is ignoring it.

NVIDIARobotics

SCN Staff·Mar 16, 2026

AIanalysis

NVIDIA's Vera Rubin Is a Capex Grenade - and Every Hyperscaler's 2027 Budget Knows It

The Blackwell-to-Rubin transition is a forcing function for the entire data center industry.

NVIDIAAI Infrastructure

SCN Staff·Mar 12, 2026