Guide › Electrical & Energy Infrastructure › 4.10

Chapter 4.10

Grid-Interactive Behavior: Ride-Through, Reactive/Voltage Support & Frequency Response Toward the POI

A gigawatt of AI load that drops itself to protect its own electronics during a routine grid fault is no longer a customer the grid tolerates — it is a contingency the grid plans against, and the 2026 ride-through mandate turned that distinction into a binding design requirement at the point of interconnection.

POWER-BOUND

What you'll decide here

Whether your facility rides through a normally-cleared transmission fault — a sub-100 ms voltage sag to 0.2–0.4 pu — or trips offline to protect equipment; this is now a grid-code obligation, not a UPS-vendor default, and it is set by relay, UPS-transfer, and undervoltage-load-retention settings you must specify deliberately.
How much dynamic reactive support (Mvar / power-factor / STATCOM / dynamic-VAR) you provision toward the POI, and whether you meet a unity-to-0.95 lagging obligation with switched banks, a STATCOM, or grid-forming inverter headroom in your BESS.
Whether you engineer any load-side frequency response — fast load curtailment on under-frequency — or stay a passive constant-power load that the balancing authority must cover with its own reserves.
How your as-built electrical behavior is represented in the utility's dynamic model (MOD-032/026/027 submittals), because a wrong model is now a reportable reliability gap, not a paperwork miss.
Whether you are scoped as a grid asset (rides through, supports voltage, sheds on under-frequency, exports flexibility) or a grid risk (a voltage-sensitive megawatt block that self-trips) — the same building, two opposite reliability postures.

A data center's relationship to the grid was long one-directional: the grid delivered clean power, and when it did not, the facility protected itself. Double-conversion UPS, fast static-transfer switches, and conservative undervoltage relays existed precisely to disconnect the IT load from a disturbed grid and ride it on batteries. That instinct — protect my electronics first, let the grid sort itself out — was correct when a data center was a 10–30 MW rounding error on a feeder. It is catastrophic when a single campus is 500 MW to a gigawatt of voltage-sensitive load sitting on a 230 kV or 345 kV bus.

The fork this chapter forces is the inversion of that instinct. The question is no longer "how do I isolate my load from the grid during a fault?" but "how do I keep a gigawatt of load online through a fault the grid expects me to survive?" The downstream cost of getting it wrong is no longer your own outage — it is a cascading reliability event that the North American grid operator now plans against, names you as the cause of, and writes mandatory actions to prevent. In 2026 that shift stopped being a thesis and became an enforceable obligation. This chapter is the engineering of ride-through, reactive/voltage support, and frequency response toward the point of interconnection (POI) — the three behaviors that decide whether your facility is modeled as a grid asset or a grid risk. The canonical NERC obligation framing lives in Chapter 4.3; the transient physics in Chapter 4.5; this chapter is where those obligations become relay settings, Mvar, and droop curves at the fence line.

The motivating case: 1.5 GW gone in 82 seconds

On a summer day in 2024 in the Eastern Interconnection, a lightning arrestor failed on a 230 kV transmission line. The fault was normally cleared — protection operated as designed, the line was isolated in tens of milliseconds, the grid did exactly what it was built to do. What was not anticipated is that the resulting voltage sag caused approximately 1,500 MW of data-center load to disconnect simultaneously, across multiple facilities, in response to a disturbance the rest of the grid shrugged off. The configuration produced six successive system faults over an 82-second window; per-event voltage excursions reached 0.24–0.4 pu deviation lasting 42–66 ms, and the post-event system voltage rebounded to 1.07 pu as the suddenly-unloaded grid over-swung high (NERC Incident Review: Large Load Loss; arXiv 2510.03867, 2025).

That over-voltage is the part operators lose sleep over. When a gigawatt of load vanishes in milliseconds, the reactive balance that load was absorbing snaps back as a voltage rise that can trip other equipment — generators, capacitor banks, neighboring loads — and propagate. A voltage-sensitive load block that protects itself is not neutral; it converts a contained transmission fault into a system-wide swing. NERC's response was its rare Level 3 "Essential Actions" Alert, issued May 4, 2026 — the highest of its three alert tiers, reserved for immediate risks to the bulk power system. Registered entities had to acknowledge by May 11 and respond by August 3, 2026. The seven mandated actions reframe large computational load from an emerging-risk footnote into a hard planning obligation: transmission planners must now study stability margins annually in areas with computational load and implement system-side corrective actions ensuring no loss of firm load from normally-cleared faults (NERC Level 3 Alert; Utility Dive; Morgan Lewis, 2026).

Why your UPS makes the problem worse, not better

The irony of the 2024 event is that the equipment that caused the load drop was the equipment installed to improve reliability. A double-conversion UPS or a fast static-transfer switch sees a deep voltage sag and does its job: it disconnects the downstream load from the grid and rides on stored energy. From the IT load's perspective, nothing happened — the servers never blinked. But from the grid's perspective, hundreds of megawatts just dropped off in milliseconds. The data center experienced a successful ride-through of its own load while inflicting a contingency on everyone else. The fix is counterintuitive: the UPS and transfer logic must be tuned to keep drawing from the grid through a shallow, short sag — to behave as a constant, well-mannered load — rather than reflexively islanding the moment voltage dips below a conservative threshold. Ride-through is as much a control-philosophy decision as a hardware one.

Engineering fault-ride-through at the POI

Fault-ride-through (FRT), or low-voltage ride-through (LVRT), is the obligation to remain connected and continue drawing through a defined voltage-versus-time envelope. The envelope is a curve: the deeper the sag, the shorter the time the load is required to tolerate before it is permitted to trip. For a normally-cleared transmission fault — typically 0.1–0.4 pu retained voltage for 80–150 ms while protection operates — the load must stay on. SPP's published demand-facility curve, for example, holds 0.90–1.10 pu continuously and tolerates excursions down to roughly 0.50 pu for ~0.15 s; ERCOT's NOGRR 282 / NPRR 1308 (filed late 2025) codify frequency and voltage ride-through performance expectations for "Large Electronic Loads" — facilities where ≥50% of demand is power-electronic computational load, at sites ≥75 MW (ERCOT NPRR1234 / NOGRR 282, 2025–26).

The deep engineering problem is that AI load is a near-ideal constant-power load (CPL). The front-end rectifiers regulate output power against voltage, so when grid voltage sags, current rises to hold power constant — exactly the wrong-signed behavior, because it drags voltage down further and looks to the grid like negative incremental resistance. Combined with limited DC-link energy, a CPL is prone to tripping on deep, short sags unless it is buffered by storage or coordinated controls. Ride-through is therefore not one setting but a coordinated chain of them: the protective relays (SEL or equivalent) at the POI and on the MV distribution; the UPS transfer logic (when it goes to battery vs holds on grid); and the undervoltage load-retention settings at the rack and rectifier. Each must be set to keep the load on through the curve, and they must be coordinated so that no link trips early and orphans the rest.

The ride-through decision: trip-to-protect vs hold-and-ride

Behavior	Trigger / setting	Grid consequence	What it requires	2026 status
Trip-to-protect (legacy default)	UVR / STS transfer at ~0.85–0.9 pu, <1 cycle	Simultaneous multi-MW load loss; post-fault over-voltage swing	Conservative UPS + fast transfer (already installed)	Now a reportable reliability risk; disallowed for new large loads
Hold-and-ride (mandated)	Stay connected through ~0.2–0.5 pu sag for 80–150 ms	Load behaves as a stable contingency; no spurious trip	Relay/UVR retune, UPS hold-on-grid logic, DC-link/BBU buffering	Required to remain online through normally-cleared faults
Ride + dynamic VAR (grid-asset)	Hold-and-ride plus reactive injection during sag	Supports local voltage; reduces neighbor trip risk	STATCOM or grid-forming BESS headroom at the POI	Emerging best practice; rewarded in some tariffs

The same fault, two facility postures. Column values are 2026-current practitioner ranges; the regulatory column reflects the post-NERC-Level-3 obligation in transmission-connected ISOs (ERCOT NOGRR 282, SPP demand-facility curve).

The fork: where does ride-through energy come from?

To ride a sub-150 ms sag without tripping, something must hold the DC bus up while grid voltage is depressed. There are three sources, and the choice cascades into cost and architecture. (1) On-package and rack capacitance rides the shortest events for free — microseconds to a few milliseconds — but cannot span a fault. (2) Rack-level BBUs / supercapacitors (OCP ORV3-class, or supercap ride-through modules) span the 80–150 ms fault window cheaply and are the natural FRT energy source; this is the same spine treated as the transient-absorption layer in Chapter 4.5. (3) A facility-scale BESS at the medium-voltage bus rides through and can inject reactive support — but only if it is sized and grid-forming. The decision is whether to push ride-through energy down to distributed rack BBUs (cheap, no reactive capability) or up to a central BESS (costly, but doubles as voltage support and primary frequency response). Most 2026 designs do both: BBUs for the fault window, BESS for the reactive and frequency role. Get this wrong and you either over-trip (too little stored energy at the right layer) or over-spend (a gigawatt BESS bought to do a job rack supercaps could have done).

Reactive power and voltage support as a tariff obligation

Ride-through keeps you online; reactive support keeps the bus healthy. A large AI load is a substantial consumer of reactive power — the active-front-end rectifiers and the transformer/cable reactance between the POI and the rack draw Mvar that the grid must supply, depressing voltage at the interconnection. Grid codes and large-load tariffs increasingly require the facility to manage its own power factor toward a target (commonly unity to 0.95 lagging at the POI) and, in the strictest jurisdictions, to provide dynamic voltage support — fast reactive injection during disturbances, not just steady-state correction.

The provision ladder runs from cheap-and-static to expensive-and-fast. Switched capacitor and reactor banks correct steady-state power factor at low cost but respond in cycles-to-seconds and cannot help a millisecond sag. STATCOMs (static synchronous compensators) inject or absorb Mvar continuously and respond in milliseconds — the workhorse for dynamic-VAR obligations, and increasingly specified at the on-site substation of gigawatt campuses. A grid-forming BESS can do the reactive job and the ride-through job and primary frequency response from one asset, which is why facility BESS is migrating from a pure ride-through device to a multi-role grid-interface asset (NVIDIA Production-Ready BESS for AI Factories, 2025). The decision is a classic fork: meet a steady-state PF target with banks (cheap, passive) or commit to dynamic support with a STATCOM or grid-forming BESS (costly, but turns you from a voltage problem into a voltage solution and may earn grid-services revenue — see Chapter 15.8).

Frequency response: the load that pushes back

The third grid-interactive behavior is frequency response — and here the data center is mostly an under-developed asset. A passive constant-power load contributes nothing to frequency regulation; when system frequency falls (generation lost), the load keeps drawing its full demand and the balancing authority must cover the deficit from spinning reserves and primary frequency response on the generation side. A grid-interactive load does the opposite: it sheds or curtails demand on under-frequency, providing fast, load-side primary frequency response that is, megawatt-for-megawatt, faster than spinning up a turbine.

AI load is unusually well-suited to this. Training is checkpoint-tolerant and batch inference is interruptible (Chapter 1.1), so a fraction of campus load can be curtailed in seconds without losing work — a controllable, fast, dispatchable resource the grid would pay for. The engineering is a droop characteristic measured at the facility: define a frequency deadband and a slope (MW shed per 0.1 Hz below nominal), wire it to a fast load-control layer that throttles GPU power caps or pauses curtailable jobs, and represent that behavior in the utility's model. The fork is strategic: stay a passive load (simplest, but contributes to the reserve burden you are increasingly asked to fund) or engineer load-side frequency response (more control complexity, but converts the facility into a grid resource with a revenue line). The islanded-microgrid version of this — where the facility's own inertia and grid-forming inverters set frequency — is treated in Chapter 4.8.

~1,500 MW

data-center load lost simultaneously on a single normally-cleared 230 kV fault (Eastern Interconnection, 2024)

2026NERC Incident Review / Utility Dive

6 faults / 82 s

successive system faults from the lightning-arrestor failure; per-event sag 0.24-0.4 pu deviation for 42-66 ms

2025NERC / arXiv 2510.03867

1.07 pu

post-event over-voltage as ~1.5 GW of load dropped and the grid over-swung high

2025arXiv 2510.03867 (DC LVRT study)

May 4, 2026

NERC Level 3 "Essential Actions" Alert issued; 7 mandated actions, responses due Aug 3, 2026

2026NERC / Morgan Lewis; Utility Dive

≥75 MW / ≥50%

ERCOT Large Load site threshold; "Large Electronic Load" = ≥50% power-electronic computational demand (NPRR1234 / NPRR1308)

2025ERCOT NPRR1234, NOGRR 282

~0.50 pu, ~0.15 s

SPP demand-facility ride-through floor (0.90-1.10 pu continuous; short excursions to ~0.5 pu)

2025SPP demand-facility VRT curve

unity-0.95 lag

typical power-factor obligation at the POI for large transmission-connected loads

2026ISO/utility large-load tariffs (synthesis)

3-7+ yr

large-load grid interconnection lead time, ISO-dependent; ride-through now a study deliverable

2025ERCOT / PJM filings synthesis

Deep dive: the constant-power-load instability and why ride-through is hard

The reason a gigawatt of AI load is genuinely dangerous to ride through — rather than trivially "just keep the breaker closed" — is the physics of the constant-power load. A resistive load draws current proportional to voltage: when voltage sags, current falls, and the load helps the grid recover. An AI rectifier front-end does the opposite. It regulates output power, so as terminal voltage V sags, it raises input current I to hold P = VI constant. The incremental relationship dI/dV is negative — the load presents negative incremental resistance at the bus. During a sag this pulls voltage down further; during recovery it can drive oscillation. Aggregate hundreds of megawatts of this behavior at one POI and you have a destabilizing element, not a passive one.

This is why ride-through cannot be solved by relays alone. To hold the load on through a 0.2–0.4 pu sag, the DC-link energy buffer (capacitance + BBU/supercap) must supply the difference between what the depressed grid can deliver and what the load demands — for the full 80–150 ms fault window — without the rectifier current runaway tripping over-current protection. Recent work (arXiv 2510.03867, 2025) proposes decentralized controllers that damp the CPL behavior with bounded proportional gains, letting the load ride through and contribute to voltage stability rather than fight it. The practical 2026 design pattern: size rack BBUs/supercaps to span the fault window, tune rectifier controls to limit current rise during sag, set undervoltage retention below the deepest required ride-through point, and back the whole thing with a grid-forming BESS at the MV bus for the reactive injection that flattens the local voltage during the event. The cooling-side and on-die twins of this transient problem are in Chapter 4.5.

Deep dive: coordinating with the utility's dynamic model (MOD-032/026/027)

Ride-through and reactive support only protect the grid if the utility's planning model knows how your facility behaves. Under NERC's MOD standards, a transmission-connected entity must submit steady-state and dynamic model data to its Transmission Planner and Planning Coordinator — MOD-032 (data for modeling), MOD-026 (verification of reactive/voltage control model), MOD-027 (verification of frequency/active-power control model). For decades these standards were written with generators in mind. The 2026 NERC alert's most consequential effect is that they now bite on large loads: planners must collect computational-load model data (min/max MW, IT vs non-IT split, ride-through and frequency-response characteristics) and study annual stability margins in load-dense areas.

The stakes are sharp. If you submit a model that says "constant-power load, trips at 0.85 pu" — the legacy default — the planner is obligated to treat your gigawatt as a credible loss-of-load contingency and may require system-side reinforcement (or deny/delay your interconnection) to ensure no firm-load loss. If you submit a verified ride-through-and-support model and your as-built behavior matches it, you become a benign, planned element. The trap is a wrong model: as-built behavior that diverges from the submitted dynamic model is now a reportable reliability gap, discoverable by the dynamic fault recorders NERC mandates you install. Model fidelity moved from a compliance checkbox to an engineering deliverable that determines whether you energize. The canonical obligation framing — registration, applicability, and the standards cluster — is in Chapter 4.3.

Grid asset vs grid risk: the same building, two postures

Every decision in this chapter resolves to one strategic question: is your facility a grid asset or a grid risk? The risk posture is the legacy default and now the dangerous one — a voltage-sensitive constant-power megawatt block that protects its own electronics by tripping, modeled as a contingency, reinforced against at your cost, and increasingly disallowed at scale. The asset posture is the engineered alternative: a load that rides through faults, holds local voltage with dynamic VARs, sheds on under-frequency, and exports flexibility the balancing authority will pay for. The capital delta between the two — STATCOM or grid-forming BESS, retuned protection, a fast load-control layer, a verified dynamic model — is real but bounded, and against a 3–7+ year interconnection wait it is often the difference between energizing on schedule and being told to come back after the system is reinforced.

The 2026 inflection is that the grid stopped asking and started mandating. A facility scoped as a passive load is no longer just leaving grid-services revenue on the table (Chapter 15.8) — it is carrying a reliability liability that shows up as interconnection delay, study cost, and reportable exposure. The facility-as-grid-risk framing, and how it interacts with cluster goodput when the grid asks your load to flex, is developed in Chapter 12.2. Design the building as an asset from the first single-line, and the grid-interactive behaviors fall out of decisions you were making anyway about UPS topology, BESS sizing, and substation reactive plant. Bolt them on after the fact and every one of them is a retrofit.

The NERC obligation framing — registration, MOD/TPL/PRC applicability, and the transmission-connected compliance program — is canonical in Chapter 4.3. The transient physics behind ride-through (synchronized GPU load steps, the chip→BBU→BESS mitigation spine) lives in Chapter 4.5, with the on-site-substation and POI engineering in Chapter 4.2. Islanded inertia, grid-forming vs grid-following inverters, and synchronous condensers are in Chapter 4.8; the DC-bus and ride-through energy architecture connect to Chapter 4.7. The facility-as-grid-risk reliability reframing is in Chapter 12.2; the grid-services revenue these behaviors unlock is quantified in Chapter 15.8; and the workload flexibility that makes load-side frequency response cheap traces back to the archetypes in Chapter 1.1.