Guide › Strategy, Workload Archetypes & Economics › 1.6

Chapter 1.6

Procurement Archetypes: Build vs Buy vs Rent

How you acquire capacity is a bet on time, control, and the durability of your demand — and in a power-bound, fast-depreciating market the right answer is almost never the one that minimizes unit cost, because the expensive mistakes are made in the time and optionality dimensions, not the per-GPU-hour one.

POWER-BOUNDDENSITY-RAMP

What you'll decide here

Which procurement archetype — greenfield self-build, brownfield retrofit, wholesale/retail colocation, or neocloud/GPU rental — matches your time-to-power need, capital posture, control requirement, and (above all) the confidence you have in your own demand forecast.
Whether your demand is durable enough to underwrite an irreversible build, or uncertain enough that you should pay the option premium of leasing or renting to preserve the right to exit.
Which hybrid pattern (build-core-rent-edge, colo-anchor-plus-cloud-overflow, burst-to-neocloud) lets you anchor a base load you are sure of while keeping the volatile tail of demand on someone else's balance sheet.
Whether your power gate is the grid interconnection queue or a behind-the-meter generation path — because that single fact can compress or blow up the time-to-power of every other procurement choice.
Which decisions in the procurement stack are reversible (the rental contract, the overflow provider, the colo expansion option) and which are irreversible (the self-build slab, the take-or-pay power contract, the multi-year BTS lease) — and therefore where to spend your hedging budget.

Chapter 1.1 established the first orthogonal question — what runs here. This chapter is the second — how you acquire the capacity to run it. The two are independent: a frontier pre-training run can be served from a self-build, a colo, or a neocloud, and an online-inference business can be any of the four. But the procurement decision has a character all its own. The workload question is answered once and is mostly about physics. The procurement question is answered repeatedly, is mostly about finance and time, and is the place where most operators in 2026 are actually getting hurt — not because they pick the wrong cooling, but because they commit capital to a demand forecast they could not honestly defend, or because they rent at a premium for years a load they should have owned.

The fork is build vs buy vs rent, across four paths: greenfield self-build, brownfield retrofit, colocation (wholesale or retail), and neocloud / GPU rental. We trace each against the four levers that actually decide it — time-to-power, capital intensity, control, and the workload-duration / demand-certainty axis that quietly dominates the other three. We then treat the two structural realities that reshape the whole decision in 2026: the power gate (grid interconnection vs behind-the-meter generation), and demand uncertainty, which converts the build-vs-lease choice from a cost comparison into an options problem. The quantitative NPV that scores these paths lives in Chapter 1.8; the deal mechanics and accounting in Chapter 2.5. This chapter is the strategy that sits above both.

The four levers, and which one actually decides

Practitioners list four decision drivers — time-to-power, capital intensity, control, workload duration — as if they were peers. They are not. The first three are real constraints, but they are usually satisfiable in more than one way. The fourth — how long and how certainly the workload will run — is the lever that breaks ties and, more often than not, makes the decision outright. The reason is depreciation. An AI cluster is the rare capital asset whose economic life (2–3 years at the frontier) is shorter than the financing tenor used to buy it (5–6 year book life, longer debt). Commit to a self-build for a workload that evaporates in eighteen months and you own a stranded asset against a loan that outlives the revenue. Rent that same workload and you simply stop paying.

Time-to-power is the lever that has moved most since 2020, because the binding constraint is no longer chips or capital but megawatts. A neocloud delivers GPUs in minutes; a colo delivers a live cluster in 6–12 months; a self-build delivers in 24–36 months — and that self-build number is now gated less by construction than by the interconnection queue behind it, which can run 4–7 years in the densest US hubs. When time-to-power is the binding constraint, the procurement decision collapses toward whoever already holds energized capacity, almost regardless of unit cost. This is why neoclouds exist as a category and why colocation pricing hit record highs in 2025 at roughly ~1% vacancy: the scarce good is not space, it is power that is already on.

Capital intensity is the lever that decides who can play at all. A 1 GW self-build runs on the order of tens of billions up front; a colo lease converts most of that into a per-kW-month operating expense; a neocloud converts all of it into a per-GPU-hour rate. For a startup, an enterprise dabbling, or a sovereign program racing a political clock, the capex wall alone forecloses self-build. Control is the lever that pulls the other way: only a self-build (and to a bounded degree a retrofit) gives you authority over density, power architecture, cooling modality, fabric topology, and reliability posture — the very levers Chapter 1.1's cascade showed are load-bearing for a training-shaped facility. Rent, and you inherit the provider's fabric, oversubscription, and uptime — fine for inference, often disqualifying for a frontier training run that needs a 1:1 non-blocking back-end.

The master procurement fork: is your demand durable, or is it a bet?

The build-vs-rent fork is, at bottom, a question about your own forecast. If the load is durable, large, and well-forecast — your own first-party demand, a signed multi-year take-or-pay, a sovereign mandate — then owning it (self-build or long BTS lease) wins on long-run unit cost and you should accept the capex and the lead time. If the load is spiky, short, experimental, or genuinely uncertain, then renting wins not because it is cheaper per hour (it is not — it carries the highest unit cost) but because it preserves the option to walk away when the bet doesn't pay. The cardinal error of the 2026 era is treating a bet like a certainty: financing an irreversible build against a demand curve you assumed rather than contracted. The second error is its mirror: renting a load you are certain of for so many years that you pay the provider's margin several times over what owning would have cost. Decide which side of this fork you are on before you sign anything — it is the least reversible financial decision in the project.

The four archetypes, decision by decision

Greenfield self-build is the maximal-control, lowest-long-run-unit-cost, slowest, most capital-intensive path. You design the slab, the power chain, the cooling plant, and the fabric to your workload's exact cascade, and at scale and high utilization you reach the lowest cost per GPU-hour available — on the order of $0.74/GPU-hr for an H100-class cluster at 2,048-GPU scale and 90% utilization, against a neocloud median several times that. The price is 24–36 months to a live cluster, the deepest capital commitment, and an interconnection slot that is itself the scarcest asset in the project. Self-build is correct only when the demand is durable enough to amortize all of that — and disciplined enough to keep utilization high, because the entire unit-cost advantage evaporates below roughly 70% utilization, the breakeven where a debt-financed cluster flips from cash-positive to cash-negative.

Brownfield retrofit trades capital and schedule for a hard physics ceiling. Converting an existing air-cooled hall to host dense AI racks is fast and cheap relative to greenfield — roughly $2–3M/MW for the cooling upgrade alone, $5–10M/MW for a full AI retrofit — but the existing slab, plenum, electrical headroom, and (critically) the absence of facility water cap how far you can push density. A hall scoped for 40 kW air-cooled racks cannot absorb a 132 kW liquid-cooled NVL72 generation; an estimated two-thirds of pre-2015 data centers are simply unsuitable for frontier AI density without effectively rebuilding them. Retrofit is the right call for bridge capacity and for modest-density inference that lives comfortably under the air-cooling cliff — and the wrong call the moment your roadmap crosses into training or next-generation dense inference. → Chapter 5.4.

Colocation — wholesale (you lease a powered hall and own the IT) or retail (you rent cabinets and power by the rack) — buys time-to-power by renting someone else's energized shell. A live 50k+ GPU cluster can stand up in a wholesale hall in 6–12 months, against the 24–36 of a self-build, because the long-lead power and building work is already done. The cost is paid in a per-kW-month lease (global wholesale averaged roughly $217/kW-month in 2025, a record, ranging from ~$120 in Atlanta to ~$450 in Singapore) and in a shared shell whose redundancy tier and power envelope you accept rather than design. Colo is the natural home for medium-term, scaling, demand-uncertain workloads — and the anchor of most hybrid strategies.

Neocloud / GPU rental compresses time-to-first-job to days or weeks and converts capex entirely into opex, at the highest per-GPU-hour rate and the least control over the underlying fabric and reliability. Neocloud rates run 40–85% below the hyperscalers (an 8-GPU node averaging ~$34/hr on a neocloud vs ~$98/hr on a hyperscaler in 2026), which is what makes rental viable for spiky, short, experimental, or burst-overflow demand. But the category carries two 2026-specific cautions: pricing is volatile (the one-year H100 contract index rose ~40% between October 2025 and March 2026 as capacity tightened), and on-demand capacity has been effectively sold out across most GPU types with reserved capacity booked months ahead — so the "rent it instantly" promise is conditional on a market that periodically has nothing to rent.

Procurement archetype — the build vs buy vs rent decision

Archetype	Time-to-power	Capital	Control	Unit cost (per GPU-hr)	Best-fit demand profile
Greenfield self-build	24–36 mo (queue-gated; 4–7 yr in dense hubs)	Highest — full capex	Maximal — power, cooling, fabric, tier all yours	Lowest at scale (~$0.74 @ 2,048 GPU, 90% util)	Durable, large, well-forecast, multi-year first-party load
Brownfield retrofit	6–18 mo	Moderate — $2–3M/MW cooling; $5–10M/MW full AI	Bounded by existing slab, power, plenum, water	Low–moderate; stranded-capacity risk if density caps out	Bridge capacity; modest-density inference under the air cliff
Colocation (wholesale/retail)	6–12 mo to a live cluster	Capex-light — lease (~$217/kW-mo avg) + your IT	Shared shell; you own the IT, accept the envelope	Moderate; lease + IT depreciation	Medium-term, scaling, demand-uncertain; hybrid anchor
Neocloud / GPU rental	Days to weeks (when capacity exists)	Opex only	Least — provider owns fabric & reliability	Highest (~$2.3–3.5/hr neocloud median; volatile)	Spiky, short, experimental, or burst-overflow load

Lead times, retrofit costs, and pricing are 2026 practitioner ranges (SemiAnalysis, JLL/CBRE, Introl, McKinsey). Time-to-power for self-build is gated by the interconnection queue behind it, not construction. Unit-cost column is directional; see keynumbers for sources and vintages.

Down the unit-cost column the temptation is to self-build everything. Down the time-to-power and demand-profile columns the discipline reasserts itself: the cheapest-per-hour option is also the slowest and the one that punishes a wrong demand forecast most severely. The correct reading is that these archetypes are not competitors for a single decision — they are complements layered against the shape of your demand curve. That is what the hybrid strategies operationalize.

Hybrid procurement: layering certainty against volatility

In practice almost no serious operator picks one cell. The demand curve has a stable base and a volatile tail, and the right strategy matches each layer to the archetype whose cost structure fits it. Three patterns recur:

Build-core-rent-edge. Own the durable base load where you are most certain of demand and most need control (a self-built training core), and rent the geographically distributed inference tail from colos and neoclouds near users, where proximity matters more than unit cost and demand is harder to forecast site by site. This is the canonical hyperscaler and large-neocloud posture, and it maps cleanly onto Chapter 1.1's training-shaped vs inference-shaped fork: own the training-shaped, rent the inference-shaped.

Colo-anchor-plus-cloud-overflow. Anchor a committed, predictable load in a wholesale colo on a multi-year lease (capex-light, fast, controllable enough), and absorb peaks — the bursty 30%-to-90% swings an online-inference business sees in minutes — by bursting to a neocloud at the margin. You pay the rental premium only on the volatile fraction, not the whole fleet, which is the point: rental's high unit cost is acceptable precisely when it is applied to the part of demand you could not have committed to in advance.

Burst-to-neocloud (bridge and experiment). Run steady state on owned or colo capacity, and use neocloud rental as a bridge while a self-build energizes, as elastic capacity for a one-off training run or evaluation sweep, or as the proving ground for a workload whose durability you do not yet trust. The moment a rented workload proves durable and large, the hybrid logic says re-home it to owned capacity — the crossover point where owning beats renting is a function of utilization, depreciation schedule, and resale liquidity, and is treated quantitatively in Chapter 1.8.

Rental's real product is optionality, not GPUs

The instinct is to read neocloud rental as overpriced compute — and on a pure per-GPU-hour basis, it is, by a multiple. But that framing misses what you are actually buying. Rental's product is the option to be wrong: to discover the workload was a dead end and stop paying, to discover it was a hit and re-home it, to wait out an interconnection queue without idling, to learn your real demand curve before you pour concrete against an assumed one. Priced as compute, neocloud looks like a bad deal you tolerate for speed. Priced as an option on an uncertain demand forecast, it is frequently the cheapest line in the plan — the premium you pay to avoid committing tens of billions to a curve you cannot yet defend. The operators who get this wrong in 2026 are not the ones who rent too much; they are the ones who treat a bet as a certainty and build against it.

24–36 mo / 6–12 mo / days–weeks

time-to-power: greenfield self-build vs wholesale colo (live 50k+ GPU cluster) vs neocloud

the speed-vs-control trade — self-build costs you two years before a dollar comes in

2026SemiAnalysis; JLL 2026 Outlook; Introl

$2–3M / $5–10M per MW

brownfield retrofit cost: cooling-only vs full AI retrofit; ~2/3 of pre-2015 DCs unsuitable for frontier density

most existing real estate can't be cheaply upgraded — don't assume your old shells fit AI

2025Introl / Tetra Tech / Schneider synthesis

~$217/kW-month

global wholesale colo average 2025 (record); ~$120 Atlanta to ~$450 Singapore; ~1% vacancy

near-zero vacancy means you take what's offered at the price offered — a seller's market

2025JLL / CBRE synthesis

~$0.74/GPU-hr

self-build TCO at 2,048-GPU scale, 90% utilization (~$1.03 small clusters) vs neocloud median ~$2.3–3.5/hr (contested — single-source)

the cost floor that decides whether owning beats renting at your utilization

2025SemiAnalysis cost / H100 rental analyses

40–85% below

neocloud GPU rental vs hyperscaler pricing (8-GPU node ~$34/hr neocloud vs ~$98/hr hyperscaler)

a hyperscaler can cost several times a neocloud — the convenience premium is huge

2026SemiAnalysis H100 Index / AM Compute

~+40%

rise in the 1-year H100 rental contract index, Oct 2025 to Mar 2026, as capacity tightened; on-demand largely sold out

rental prices are climbing and supply is gone — lock capacity now or pay more later

2026SemiAnalysis H100 Rental Index

~70%

breakeven utilization for a debt-financed cluster; swings -$330k to +$340k/mo (55% vs 85%) on a 1,024-GPU H100 build (contested — single-source)

below this the same hardware that prints money bleeds cash

2025AM Compute / McKinsey

~3–7+ yr

US large-load grid interconnection lead time end-to-end; up to ~10 yr in worst queues — the gate behind self-build

the power wait is why self-build is slow — capital and chips aren't the bottleneck

2026LBNL Queued Up; ERCOT / PJM filings

The power gate: grid interconnection vs behind-the-meter

Every procurement choice above assumes power exists to be procured. In 2026 that assumption is the project's most dangerous one. Time-to-power is set less by how fast you can build than by how fast you can energize, and energization runs through one of two gates — grid interconnection or behind-the-meter generation — whose lead times can dominate every other line in the schedule. A self-build that takes 24 months to construct but sits behind a 5-year interconnection queue is, functionally, a 5-year project. This is why the power gate belongs in the procurement chapter and not only in the power chapters: it can invert the entire build-vs-rent ranking.

Grid interconnection is the default and, increasingly, the bottleneck. US large-load interconnection runs roughly 3–7 years end-to-end and up to ~10 in the worst queues, against a national queue measured in thousands of gigawatts of requested capacity. The regulatory ground is moving fast: FERC's December 2025 PJM colocation order forced standardized service terms for loads co-located with generation, and DOE's late-2025 directive sought to federalize interconnection for loads above 20 MW — both aimed at the queue, both unresolved enough as of mid-2026 to be a planning risk rather than a planning certainty. The consequence for procurement is direct: if your candidate site's queue position is bad, colo or neocloud is not a fallback, it is the only path to revenue inside your depreciation window.

Behind-the-meter (BTM) generation — gas turbines, on-site generation, increasingly nuclear and SMR PPAs — is the escape hatch, and it has become a primary procurement gate in its own right. On-site gas can bring power in 18–36 months, faster than many grid queues, which is why announced BTM gas reached on the order of 82–101 GW cumulatively by 2026 (though only a few GW were actually online by mid-year — an announcement-stage figure, not a built one). Aeroderivative turbines, the fast option, now carry 18–36 month lead times themselves as orders surge. Choosing BTM buys speed-to-power and revenue — getting 200 MW online six months early is worth ~$1B at ~$10–12B/GW/yr of AI revenue — but trades grid economics, exposes you to fuel and emissions risk, and locks in generation assets with their own depreciation profile. → Chapter 3.2 (speed-to-power) and Chapter 3.1 (the reordered siting hierarchy).

Deep dive: powered shell vs build-to-suit vs colo lease — buying optionality under demand uncertainty

Within the build-and-lease spectrum sit three structures that look similar on a balance sheet and behave very differently under demand uncertainty. Understanding the difference is how a developer or operator keeps the irreversible decisions reversible.

Build-to-suit (BTS) is a developer constructing a facility to your specification, which you then occupy on a long lease — commonly 15 years at roughly $150–220/kW-month for a credit tenant. It is real-estate-like: you get a purpose-built hall without the construction capex, but you commit to the term, and the term is the risk. A 15-year lease against a workload whose economic life is measured in single-digit years is a demand bet dressed as a real-estate deal. BTS is right when your base load is genuinely durable and your credit can command favorable terms.

Powered shell decouples the slow part (the building, the power, the cooling-ready envelope) from the fast part (the IT fit-out). A developer delivers an energized, cooling-ready shell — quoted in $/SF rather than $/kW of critical load — and you complete the interior to your own density and cooling spec, in roughly half the time of a full turnkey build. The strategic value is optionality: the powered shell preserves your right to fit out for the generation and density you actually face when the time comes, rather than freezing that decision at lease signing. It is the structure that most directly converts Chapter 1.1's density-ramp trap into a hedge — reserve the irreversible substrate (power, water, floor loading, cooling-ready envelope), defer the reversible IT spend.

Colo lease (occupying an existing wholesale hall) preserves the most optionality of the three — the option to exit at lease end, to expand into adjacent capacity, or to walk — at the cost of accepting the shell's existing redundancy tier and power envelope. The ranking by optionality is therefore colo lease > powered shell > BTS > self-build, and the ranking by control is the exact inverse. Where you sit on that spectrum should be set by how confident you are in your demand: the less certain the forecast, the more you should pay for the right to change your mind. The quantitative NPV of leasing optionality under demand uncertainty is built out in Chapter 1.8; the structuring and accounting in Chapter 2.5.

Reversible vs irreversible in the procurement stack

Chapter 1.1 sorted the physical decisions by reversibility. The procurement stack has its own register, and it is where the demand-uncertainty hedge actually gets spent.

Irreversible (decide once, hedge now): the self-build slab and its interconnection slot; the take-or-pay power contract (a multi-year firm commitment to buy energy whether or not you use it); the long BTS lease; and the behind-the-meter generation assets, which carry their own depreciation and cannot be unwound cheaply. These are the decisions where a wrong demand forecast compounds — you pay for capacity you do not use against a contract you cannot exit.

Reversible (defer, keep cheap): the neocloud provider and the rental volume (switch or stop monthly); the colo expansion option (exercise it only when demand materializes); the burst-overflow relationship; and the IT fit-out generation within a powered shell. The strategic move, identical in spirit to Chapter 1.1's, is to convert irreversible commitments into reversible ones wherever the option premium is cheap: a powered shell instead of a BTS, a colo lease with expansion rights instead of a self-build, a burst-to-neocloud bridge instead of energizing owned capacity ahead of demand. You pay the rental and lease premiums precisely as insurance against the demand curve you assumed being wrong. → refresh and re-home execution in Chapter 14.9.

The take-or-pay and over-rental traps

Two symmetric mistakes recur. The first is the take-or-pay trap: signing a multi-year firm power contract (or a 15-year BTS lease) sized to a demand forecast that does not materialize, so you pay for megawatts you cannot fill against a depreciation clock that runs regardless. This is how a self-build's unit-cost advantage inverts — below ~70% utilization the owned cluster is cash-negative, and a take-or-pay floor removes your ability to scale the cost down with the shortfall. The second is the over-rental trap: keeping a large, durable, high-utilization base load on neocloud rental for years because rental felt safe, paying the provider's 40–85% margin again and again on a workload you should have re-homed to owned capacity the moment its durability was proven. The first error over-commits to certainty you do not have; the second under-commits despite certainty you do. The register that prevents both is a periodic re-test of every rented load against the own-vs-rent crossover, and every owned commitment against realized utilization.

Procurement decision sequence

The archetype choice is not a single pick but a short ordered set of questions, each of which can foreclose later options:

1. How certain is the demand? Durable and contracted → own (self-build or BTS). Spiky, short, or a genuine bet → rent. Somewhere between → colo-anchor with rental overflow. This is the master fork and it gates everything below.
2. What is the time-to-power need? Revenue needed in weeks → neocloud. In months → colo. In years, and demand justifies the wait → self-build. If the need is fast but the demand is durable, bridge with rental while a build energizes.
3. What does the workload demand in control? Frontier training needing a 1:1 non-blocking fabric and a specific density/cooling cascade → own or wholesale colo. Inference that fits a node → any path, including rental.
4. What is the power gate? A bad interconnection queue can convert "self-build" into "colo or neocloud by necessity," or push you toward a behind-the-meter path with its own lead time and economics. Resolve this before committing to any owned path.
5. Which commitments are you about to make irreversible? Tag every take-or-pay, lease term, slab, and generation asset, and ask whether a powered shell, an expansion option, or a rental bridge could keep it reversible at an acceptable premium.

The output of this sequence is the procurement layer of the design-basis document from Chapter 1.1: which load is owned, which is leased, which is rented, what the power gate is, and which commitments are hedged versus fixed — signed before any long-lead equipment or power contract is ordered.

Anti-patterns

Each recurring procurement mistake comes from optimizing one lever in isolation and ignoring the demand-certainty axis that should have governed it:

Building against an assumed demand curve. Self-building (or signing a long take-or-pay) for a workload whose durability was assumed, not contracted. When the curve disappoints, you own a stranded asset that runs cash-negative below ~70% utilization against debt that outlives the revenue. The fix is to rent or colo the uncertain fraction until it proves durable.
Renting a certainty. Keeping a large, durable, high-utilization base load on neocloud for years out of inertia, paying a 40–85% margin repeatedly on compute you should own. The fix is the own-vs-rent crossover re-test in Chapter 1.8.
Ignoring the power gate. Selecting a self-build site on land and construction economics while the interconnection queue behind it quietly makes the project a 5–7 year one — missing the depreciation window entirely. The fix is to treat queue position and the BTM alternative as a first-order procurement input, not a later power-engineering detail.
Retrofitting past the air-cooling cliff. The procurement-flavored version of Chapter 1.1's cooling-cliff anti-pattern: choosing a cheap, fast brownfield path for a roadmap that will cross into training-class density, then stranding capacity at a cost that would have funded a purpose-built liquid hall. → Chapter 5.4.

This chapter sits between the workload framing and the economics. The five workload archetypes that set the control requirements behind each procurement choice are in Chapter 1.2 (training), Chapter 1.3 (inference), Chapter 1.4 (post-training/RL), and Chapter 1.5 (edge); the master scoping frame is Chapter 1.1 and the per-subsystem requirements matrix is Chapter 1.7. The quantitative own-vs-lease-vs-rent NPV, the leasing-optionality valuation, and the own-vs-rent crossover live in Chapter 1.8; the deal structuring, financing, and accounting in Chapter 2.5. The power gate that can invert the whole ranking is engineered in Chapter 3.1 (siting hierarchy) and Chapter 3.2 (speed-to-power); the retrofit physics in Chapter 5.4; and the refresh and re-homing execution in Chapter 14.9.