Guide › Site Selection, Power Procurement & Permitting › 3.3

Chapter 3.3

Power Availability & Power-Cost Structure

Energizing a site tells you whether you can run the machine; the power-cost structure tells you whether you should — because at 25–60% of opex, the price, shape, and curtailment terms of those electrons are the single largest controllable lever on lifetime return, and the cheapest headline rate routinely hides the most expensive bill.

POWER-BOUNDGOODPUT

What you'll decide here

Whether the candidate site clears the physical power-availability screen — transmission proximity, voltage class, and existing substation headroom — before you spend diligence dollars on anything else, because a site that needs a greenfield substation and a new line is a different project on a different schedule than one with bays to spare.
Which price denominator you are actually underwriting against: an all-in delivered ¢/kWh, or its decomposition into energy (nodal/LMP), congestion/basis, capacity, and demand charges — because the same headline tariff produces a 2–3x swing in effective cost depending on which components dominate your load shape.
How much curtailment exposure you will accept in exchange for queue speed and a lower energy charge — and whether your workload's goodput economics survive the expected curtailment-hours, because flexible/non-firm service is the cheapest fast electron and the easiest one to mis-price.
Who pays for the network upgrades your load triggers — you, via a large-load tariff and minimum-take, or the ratepayer base — because the cost-allocation regime in your jurisdiction can add single-digit-to-double-digit ¢/kWh of effective cost and is increasingly the deciding political variable on whether you get to build at all.
Which power-cost components are fixed at contract signing (and therefore irreversible for the tenor) versus which float with the market (and therefore must be hedged) — because a 15-year asset financed against a merchant energy curve is a fundamentally different risk than one behind a fixed-price PPA.

The previous chapter answered the binary question: can you get power to this site, and when. Chapter 3.2 treated interconnection as a yes/no gate against a schedule. This chapter answers the question that decides the economics once the answer is yes: what does that power actually cost, in what structure, and who bears the cost of the grid you are about to stress? The two questions are separable on purpose. A site can clear the speed-to-power gate and still be uneconomic, because the megawatts arrive at a price, a shape, and a set of curtailment and cost-allocation terms that the GPU fleet cannot earn against.

The reason this chapter carries so much weight is a single number that recurs across every credible cost model: power is 25–60% of operating cost over the life of an AI data center, and at scale it is the largest single opex line by a wide margin (domain synthesis; Epoch AI 1 GW model, 2026). Silicon dominates capex; electricity dominates opex. That makes the power-cost structure the dominant TCO lever you can still pull after the building exists — you cannot re-spec the chips, but you can, with enough foresight at siting, change the delivered cost of energy by a factor of two or three. This chapter works through each fork in the power-cost stack and the dollars that ride on getting it wrong.

Screen one: power availability is a physical question before it is a price

Before any tariff analysis, a site faces a physical screen that has nothing to do with price and everything to do with what steel is already in the ground. Three variables decide whether a site is powerable on a reasonable schedule at all: transmission proximity, voltage class, and substation headroom. Get these wrong and the cheapest energy market in the country is irrelevant, because you will spend three-to-five years and tens of millions building the wires before a single electron arrives.

Transmission proximity is the first cut. A gigawatt-class campus needs a high-voltage transmission tap — typically 230 kV or 345 kV, and increasingly 500/765 kV as backbone projects (ERCOT's ~2,500-mile 765 kV plan, PJM and SPP build-outs) reach the load. A site adjacent to an existing high-voltage line with spare thermal capacity is a fundamentally different proposition than one that needs miles of new line, every mile of which carries its own permitting, right-of-way acquisition, and 100+-week conductor and structure lead times. The distance to a viable tap is, in practice, a schedule variable, not a geography variable.

Voltage class sets both the equipment you must buy and the losses you will eat. The higher the delivery voltage, the more power you can move per ampere and the lower the I²R losses, but the more expensive and longer-lead the step-down transformers and the larger the customer substation footprint. A 1 GW load taken at 345 kV is an entirely different electrical interface — and a different substation cost — than 200 MW taken at 138 kV. This is an irreversible decision: the voltage class is baked into the substation, the protection scheme, and the interconnection agreement, and you cannot re-pick it mid-life without rebuilding the interface.

Substation headroom is the variable that separates a 12-month energization from a 5-year one. A utility substation with spare transformer capacity and an open bay can absorb a new large load by adding a feeder; one that is already at its firm rating requires a new transformer bank (128–160+ week lead time; up to ~60 months in constrained markets) or an entirely new substation. The highest-value pre-diligence a developer can do is to map, for every candidate site, the nameplate and firm rating of the serving substation, its current loading, and the queue of other large loads ahead of you for the same headroom. Headroom is a depletable shared resource, and you are rarely the only one shopping for it.

The fork that precedes price: brownfield headroom vs greenfield substation

The single most consequential availability decision is whether your load fits inside existing substation and transmission headroom or forces a greenfield substation plus network upgrades. The first path is measured in months and existing-asset cost; the second is measured in years, 100+-week long-lead transformers, new right-of-way, and a network-upgrade bill that may run into the hundreds of millions. The trap is that the greenfield site often has the cheaper energy — stranded generation, a low-LMP node, a hungry rural utility — which is exactly why it has no headroom: the wires were never built for gigawatt load. A lower ¢/kWh on energy is worthless if you pay for it with a three-year delay and a nine-figure upgrade. Screen for headroom first, then price the energy inside the sites that survive. → long-lead equipment in Chapter 3.2; electrical interface engineering in Chapter 4.8.

The power-cost stack: decomposing the headline rate

Operators routinely shop on a single number — a delivered ¢/kWh — and that number is almost always a trap, because it averages together components that behave very differently under an AI load shape. A defensible power-cost model decomposes the headline rate into four parts, each with its own driver, its own volatility, and its own hedge:

Energy (the commodity). In organized markets this is the nodal price / Locational Marginal Price (LMP) — the marginal cost of energy at the specific bus where you take power. LMP is the part everyone quotes, and it is the part that varies most by location and by hour. The same MWh can cost a few dollars at a wind-rich West Texas node and a hundred-plus dollars at a congested load center in the same interval.
Congestion / basis. LMP decomposes into a system energy price plus a congestion component plus a small loss component. Congestion is the value of transmission scarcity between generation and your node, and it is the most under-modeled line in a merchant power plan. A node downstream of a transmission constraint can settle persistently above the hub; a node co-located with stranded generation can settle below it, occasionally negative.
Capacity. In capacity markets (PJM, ISO-NE, MISO) you pay a separate charge for the resource adequacy your load obligates the system to procure. PJM's capacity auction clearing prices spiked roughly an order of magnitude in 2025–2026 as data-center load tightened the supply-demand balance, turning capacity from a rounding error into a material line item.
Demand charges. Regulated retail tariffs bill the largest line not on energy consumed but on peak demand (¢/kW of the monthly maximum), often with a ratchet that holds the charge near the annual peak for months. For a high-load-factor data center this is usually favorable — a flat 90%+ load factor amortizes the demand charge over enormous consumption — but for a spiky or curtailed load it can dominate.

The decomposition matters because two sites with the same delivered ¢/kWh can have completely different risk profiles. One may be 90% energy and 10% congestion (a merchant-exposed, basis-risk-heavy site); the other may be 50% capacity and demand charges (a regulated, schedule-stable site with little hourly exposure but a high floor). You cannot hedge what you have not decomposed. → PPA structures, basis risk, and the hedging toolkit (CRRs/FTRs, heat-rate hedges) live in Chapter 3.4.

The power-cost stack → driver, volatility, and how to manage it

Cost component	What it prices	Primary driver	Volatility	How it's managed
Energy (LMP)	Marginal energy at your bus, per MWh	Fuel + dispatch + your nodal location	High — hourly, can swing 10x+ intraday	Fixed-price or indexed PPA; load-shifting to off-peak (batch)
Congestion / basis	Transmission scarcity between generation and your node	Line constraints; queue of generation upstream	High and location-specific; can be negative	Nodal siting; CRRs/FTRs; co-location with generation
Capacity	Resource adequacy your load obligates	Reserve margin; capacity-auction clearing	Stepwise — auctions; PJM spiked ~10x in 2025-26	Demand response participation; curtailable interconnection
Demand charge	Monthly peak kW (often ratcheted)	Your peak demand vs your average	Low if load factor is high and flat	High, flat load factor; transient smoothing; peak shaving
Transmission / delivery	Use of the grid to deliver to you	Utility tariff; large-load rate class	Low-moderate; set by regulator	Voltage class (higher = lower per-MWh delivery); tariff election

Component decomposition of a delivered electricity price for a large-load AI campus in an organized market. Manageability is the practitioner's lever, not a guarantee. Percentages are illustrative of how mix varies by site, not fixed shares.

The main reason an AI data center is a good power customer — and the reason it can sometimes secure rates a flickering industrial load cannot — is its load factor. A facility running training or steady inference at a flat 80–95% of contract demand around the clock is the dream customer for a utility recovering the fixed cost of a substation and a line: it amortizes those fixed charges over the maximum possible consumption, driving the effective ¢/kWh of the delivery and demand components down toward their floor. This is why a high-load-factor data center can clear an economic deal on a tariff that would crush a peaky factory. The corollary is a warning: anything that lowers your load factor — curtailment, demand-response events, training-job idle gaps, a half-full hall during ramp — raises your effective per-MWh cost on the fixed components, because the same demand and delivery charges now spread over fewer megawatt-hours. The economics of curtailment are not just about the energy you forgo; they are about the fixed charges you keep paying on power you are not drawing.

25-60%

power as a share of AI data center operating cost; the largest single opex line at scale

2026Domain synthesis; Epoch AI 1 GW TCO model

~$0.6B/yr

energy cost in the canonical 1 GW model — the single largest opex line

2026Epoch AI / domain finance synthesis

~14.4-14.5¢/kWh

average commercial rate in the top-10 US data center states (≈ identical to other states)

2025-26Electric Choice / RealClearEnergy synthesis

~8.7¢/kWh

Texas/ERCOT commercial rate — among the lowest in the US on the deregulated market

2026Electric Choice

$0.011/kWh

new Virginia data-center electricity consumption tax (effective Jul 1 2026, sunsets Jul 1 2028) — an added cost line

2026Virginia HB30 (2026 budget, Item 3-5.24); Data Center Knowledge

~$10-12B/GW/yr

revenue per GW of AI capacity — the prize that power cost is netted against (contested — single-source)

2025SemiAnalysis (onsite gas)

~3-7+ yr

large-load interconnection lead time; up to ~10 yr in worst queues — the availability gate

2025ERCOT / PJM filings synthesis

~98-100 GW

US grid headroom integratable at 0.5% annual curtailment — the basis for flexible/non-firm deals

2025Duke Nicholas Institute

Nodal pricing, congestion, and the basis-risk trap

In an organized market — ERCOT, PJM, MISO, SPP, CAISO, ISO-NE — energy does not have one price; it has a price at every node, recomputed every five minutes. The Locational Marginal Price at your specific bus is what you actually pay (or, behind a PPA, what your settlement is referenced against). The temptation is to site at the cheapest-LMP node you can find. The trap is that the cheapest nodes are cheap for a reason: they sit downstream of stranded generation or behind a transmission constraint that depresses the local price — and that same constraint can reverse violently when the wind dies or the constraint binds the other way.

West Texas is the canonical illustration. Wind- and solar-rich nodes in ERCOT's West and Panhandle zones see frequent negative congestion — the local price drops below the hub because generation is bottled up behind transmission limits. A load co-located there can capture remarkably cheap energy. But the same congestion that gives you cheap power when the wind blows gives you expensive power, or curtailment, when it does not — and the basis between your node and the trading hub you likely hedged against can blow out, leaving you paying the difference. This is nodal basis risk, and it is the most expensive lesson in merchant power: a hedge struck at the hub does not protect a load settling at a node, and the gap between them is exactly where a low-LMP siting decision turns into a loss. → the full basis-risk and VPPA-settlement treatment, including negative-settlement stress tests, is in Chapter 3.4; the lender's view of that exposure in Chapter 2.5.

Deep dive: why the cheapest node is rarely the cheapest power

The instinct to chase the lowest LMP node treats nodal price as if it were a fixed property of a location, like land cost. It is not — it is the output of a dispatch optimization that depends on what generation is online, what transmission is constrained, and what every other load is doing in that interval. A node sits below the hub for one of two reasons, and they have opposite consequences.

Generation-pocket nodes are cheap because abundant local generation cannot all export — the classic West Texas wind case. Site a load here and you absorb the surplus, often at near-zero or negative prices. But this is precisely where curtailment risk concentrates: when the local generation drops or a line trips, the node can spike, and a flexible-interconnection agreement may force you to shed load exactly when you most want to run. You captured cheap energy by accepting correlated curtailment and price risk.

Load-pocket nodes are the inverse — expensive because demand exceeds what local lines can import. A load center behind a constraint settles persistently above the hub, and a new gigawatt of data-center load makes the constraint worse, lifting your own price and everyone else's. Adding load to a load pocket is self-defeating: you raise the price you pay by the act of consuming.

The practitioner's move is to model the distribution of hourly nodal price at the candidate bus across at least a full weather year, not the annual average — because the average flatters a node whose risk lives in the tails. Then decide whether to take that exposure raw (merchant), hedge it with a financial instrument and accept the residual basis (CRRs/FTRs against the hub), or eliminate it with a co-located or behind-the-meter supply that bypasses the nodal market entirely. → instruments and structures in Chapter 3.4; co-located and on-site generation in Chapter 3.5.

Curtailment exposure as a priced lever

The defining power decision of the 2026 era is no longer just "firm grid or not" — it is how much curtailment you will accept in exchange for queue speed and a lower energy charge. Every major RTO is building a faster interconnection lane for loads that agree to be curtailed: ERCOT's mandatory curtailment / 'kill switch' for large loads interconnecting after the end of 2025, SPP's price-responsive curtailment and non-firm tracks, PJM's non-capacity-backed and interim arrangements, and the FERC flexible-load study concept. The empirical prize is large: a Duke study found roughly 98–100 GW of US grid headroom integratable at just 0.5% annual curtailment — meaning a load willing to be shed for the equivalent of about two days a year unlocks capacity that firm service simply cannot get for years.

Curtailment is therefore a priced lever, not a binary. The question is whether your workload's goodput economics survive the expected curtailment-hours. A batch-inference or checkpoint-tolerant training fleet can absorb a few hundred curtailment-hours a year by rescheduling work or riding through on storage, paying for the privilege of a faster, cheaper interconnection. An always-on inference business serving an SLA cannot — every curtailed hour is breached revenue, and the cheap non-firm energy is a false economy. The honest underwriting question is not 'what is the curtailment cap' but 'what is the cost of the expected curtailment-hours given my workload, netted against the queue-time and energy savings I am buying.' → the load-flexibility and grid-services revenue framing lives in Chapter 15.8; curtailable/non-firm interconnection mechanics in Chapter 3.2.

Curtailment is cheap exactly when goodput is cheap to lose

The reason flexible interconnection is the biggest single schedule-and-cost lever of 2026 is a happy alignment: the workloads that can most tolerate curtailment — batch inference, embeddings, eval sweeps, checkpoint-tolerant pre-training — are the same workloads that have the lowest cost of a lost hour. A curtailed batch job reschedules; a curtailed checkpointed training run resumes from its last checkpoint with a bounded loss measured in minutes of recompute. So the load that accepts 0.5% curtailment to skip three years of queue and shave its energy charge is often giving up almost nothing it values. The mismatch — and the expensive mistake — is signing a curtailable agreement to win the queue, then filling the hall with latency-bound inference that cannot survive a single shed event. Match the curtailment terms to the workload's interruption tolerance, not to the queue you wanted to jump. → the goodput-vs-availability reframing in Chapter 1.8 applies directly here.

Network upgrade cost allocation: the 'who pays' question

A gigawatt of new load does not arrive on the existing grid for free — it triggers transmission and distribution upgrades, sometimes hundreds of millions of dollars of them. The defining regulatory and political question of 2025–2026 is who pays: the data center that caused the upgrade, or the ratepayer base that did not. This was a footnote five years ago. It is now frequently the deciding variable on whether a project gets approved at all, because rising residential bills in data-center-heavy regions have made cost allocation a live political fight.

The mechanism through which the answer is set is the large-load tariff — a new customer class, now approved in 23+ states, written specifically to push the cost of the buildout onto the loads that cause it. The Oregon POWER Act template is representative and worth memorizing because variants of it are spreading: a 20 MW+ threshold defines the class; the large load pays 100% of the distribution-upgrade cost it triggers; a 90% minimum demand charge (a take-or-pay floor — you pay for 90% of contracted demand whether you draw it or not) protects the utility against stranded investment if you under-build or leave; 10–30 year contract terms match the cost recovery to the asset life; and a per-kWh surcharge (≈1¢/kWh above 100 MW in the template) layers on top. ERCOT's SB6 regime is the Texas analogue, with a 75 MW large-load definition, mandatory curtailment, and a ≥$100,000 screening study fee. Virginia, the largest US market, layered on a flat $0.011/kWh data-center consumption tax effective July 2026.

The consequence for the model is direct and large: these allocation mechanisms can add anywhere from a fraction of a cent to several cents per kWh of effective cost, and the minimum-take floor converts a variable energy bill into a fixed obligation that behaves like debt. A take-or-pay floor means the curtailment savings discussed above are partially illusory — you keep paying the demand floor on power you are not drawing. The 'who pays' regime in your jurisdiction is therefore not a compliance detail; it is a first-order input to the power-cost stack and, increasingly, to whether the project clears its social-license gate at all. → the regulatory mechanics and the federal-vs-state collision over large-load interconnection are owned by Chapter 3.2; the macro load-growth and cost-shift narrative in Chapter 16.1.

Network-upgrade cost allocation → who bears the cost, and the consequence

Allocation regime	Who pays the upgrade	Typical terms	Effect on power cost	Project consequence
Legacy / socialized	Ratepayer base (everyone)	Standard tariff; no special class	Lowest direct cost to the operator	Politically unsustainable; driving the backlash and new tariffs
Large-load tariff (Oregon-style)	The large load (cost-causer)	20 MW+ class; 100% distribution upgrade; 90% min demand; 10-30 yr	Adds fraction-to-several ¢/kWh; take-or-pay floor	Defensible and durable; converts energy bill into debt-like obligation
ERCOT SB6 (Texas)	The large load (75 MW+)	Mandatory curtailment; ≥$100k study fee; 765 kV backbone	Lower energy; curtailment risk priced in	Faster queue if flexible; SLA-incompatible for firm inference
Consumption tax (Virginia)	The data center (flat per-kWh)	$0.011/kWh on all DC electricity from Jul 2026	Adds ~1.1¢/kWh flat to every MWh	Predictable but unavoidable; erodes the low-rate advantage

Large-load tariff and cost-allocation regimes vary by jurisdiction; the Oregon POWER Act template is the most-copied. Figures are template/representative terms, not universal.

Power as the dominant TCO lever

Pull the threads together and the chapter's central claim follows: after the chips are bought, power is the largest controllable variable in the lifetime cost of the asset. Silicon dominates capex and is essentially fixed at procurement — you buy the GPUs at market and depreciate them on a contested schedule (→ Chapter 1.8). The building shell is a small and slow-moving line. But energy — at 25–60% of opex, the single largest line at scale — is the cost you can still move by a factor of two or three through decisions made at siting: which node, which voltage class, which tariff, how much curtailment, behind-the-meter or grid.

The leverage is starkest when you net it against the revenue the asset earns. At roughly $10–12B of revenue per GW per year, a campus is netting an enormous topline against an energy bill that, mis-structured, can run to many hundreds of millions a year. A 2¢/kWh difference in delivered power cost — entirely achievable between a well-sited, well-hedged location and a poorly-structured one — is on the order of $175M/yr on a 1 GW facility at 100% load factor. That single siting-and-structuring decision dwarfs nearly every engineering efficiency in the rest of this guide. It is the reason power-cost structure earns its own chapter, and the reason the screen runs availability first, structure second, and only then everything else.

The final discipline is to sort the power-cost components by reversibility, the same way Chapter 1.1 sorted the scoping decisions. The irreversible components are fixed at contract: the voltage class and substation interface, the large-load tariff election and its take-or-pay floor, the contract tenor, and a fixed-price PPA's strike. The reversible components float and must be hedged or actively managed: the merchant energy and congestion exposure, the capacity-auction outcome, and the curtailment-hours you actually incur. A 15-year asset financed against a merchant energy curve is a different — and far riskier — instrument than one behind a matched-tenor fixed-price supply. Which of those two you are is a power-cost-structure decision, made here, that the lenders in Chapter 2.5 will price ruthlessly.

Deep dive: the power-tenor-vs-GPU-life matching problem, previewed

The most subtle power-cost decision is one of duration matching, and it sits at the seam between this chapter and the next. A large-load tariff or a fixed-price PPA often runs 10–30 years — the term the utility or generator needs to recover the substation, line, or plant it built for you. The GPU fleet inside the building obeys a 2–3 year frontier-economic life and a 5–6 year book life (→ Chapter 1.8). The mismatch is structural: you are signing a multi-decade fixed power obligation against a revenue stream generated by hardware that turns over four-to-ten times within that obligation.

The consequence is a real risk, not an accounting curiosity. If token prices deflate, if your workload mix shifts, or if a refresh strands part of the hall, the power obligation does not shrink — the take-or-pay floor keeps billing whether or not the GPUs behind it are earning. Conversely, a too-short power contract leaves you re-exposed to a merchant market that may have moved against you precisely when your fleet is most valuable. The matching problem has no free answer; it is a deliberate choice about which tenor of risk you would rather carry — fixed-price certainty that may outlive the workload, or merchant flexibility that may spike. The structures that manage this seam — physical vs virtual PPAs, indexed vs fixed, the firming and co-location options — are the subject of Chapter 3.4, and the downside stress tests of getting it wrong are run in Chapter 1.8.

This chapter owns the cost and structure of grid power once a site is energizable; it deliberately stops at the meter and at the PPA boundary. The interconnection process, queues, and speed-to-power that decide whether and when the power arrives are in Chapter 3.2; the reordered siting hierarchy that makes power the new #1 screen is in Chapter 3.1. The energy-supply strategy that this chapter sets up — grid PPA vs behind-the-meter vs co-location, physical vs virtual PPAs, nodal basis and VPPA-settlement risk, CRRs/FTRs, and the power-tenor-vs-GPU-life match — is the whole of Chapter 3.4, with on-site generation in Chapter 3.5. The electrical interface (substation, voltage class, transformers, protection) is engineered in Chapter 4.8. Curtailment and flexibility reframed as a revenue source live in Chapter 15.8; the macro load-growth and cost-shift narrative in Chapter 16.1; and the financing view of merchant-vs-contracted power exposure in Chapter 2.5 and Chapter 1.8. Metric definitions (load factor, LMP, $/MWh) are pinned in Chapter 0.3.