
Meta CEO Mark Zuckerberg attends a dinner hosted by US President Donald Trump with tech leaders for a dinner in the State Dining Room of the White House in Washington, DC, on September 4, 2025. SAUL LOEB/Getty images
Meta Platforms is developing plans to sell its surplus AI computing capacity to outside customers, Bloomberg News reported Wednesday, a move that would make the social media company a direct competitor to Amazon Web Services, Microsoft Azure, and Google Cloud — while simultaneously threatening two of its own largest infrastructure suppliers. Meta shares surged more than 10% on the news, adding roughly $149 billion in market value to a stock that had fallen nearly 15% against the S&P 500 this year. For developers, cloud buyers, and anyone tracking where AI infrastructure is headed, the report delivers a concrete answer to a question Wall Street has been asking all year: what is Meta planning to do with $125 billion to $145 billion in capital expenditure?
Read more: AI Compute Shortage Forces Google to Ration Gemini for Meta Despite $460B Backlog
The effort sits inside an internal initiative called Meta Compute, which is responsible for building and managing the company's AI infrastructure. Three executives lead it: Santosh Janardhan, Meta's head of infrastructure; Daniel Gross, who oversees Meta Superintelligence Labs; and Dina Powell McCormick, the company's president. That combination — an infrastructure chief, an AI research head, and a senior business executive — signals an organized push into the enterprise market, not a speculative side project.
Two distinct offerings are under consideration, according to people familiar with the matter who asked not to be named because the plans remain in development.
The first is a hosted AI model service. Developers would pay to run queries against AI models — including Muse Spark, Meta's proprietary closed-weight foundation model launched on April 8 — hosted on Meta's infrastructure. The model is accessible only through Meta's platform, through a mechanism similar to how Amazon Web Services structures its Bedrock product, which gives developers API access to foundation models without requiring them to provision or manage any hardware. The second is raw compute capacity: leasing bare GPU cycles to third parties, following the neocloud model pioneered by companies like CoreWeave.
Meta declined to comment. The plans are still in development and could change.
The two product offerings map to two fundamentally different cloud business models — and understanding the difference matters for any developer evaluating Meta as a future vendor.
Hosted AI model services work through an abstraction layer. When a developer sends a prompt, the request hits an API endpoint; the cloud provider handles GPU allocation, load balancing, model weight management, inference optimization, and failover. Under the hood, AWS Bedrock uses PyTorch compilation, CUDA graph optimizations, and kernel fusion to reduce inference latency while keeping the hardware invisible to the customer. Billing is per-token or per-invocation. The customer's engineering team writes application code and never touches a GPU.
Raw compute rental — the neocloud model — is the opposite. The customer gets bare-metal access to GPU hardware, typically NVIDIA H100s or Blackwell-generation chips, with no virtualization overhead. Bandwidth between GPUs inside a cluster uses NVLink-4, which delivers 900 gigabytes per second of intra-node throughput, and InfiniBand for ultra-low-latency communication across racks. Customers deploy their own software stacks directly on the hardware. Pricing is per GPU-hour — CoreWeave has charged roughly $1.39 per hour for an H100, compared to approximately $3.67 per hour on Azure for equivalent performance — and commercial terms typically run as multi-year take-or-pay contracts, meaning the customer commits to paying for a fixed number of GPU-hours at a fixed rate regardless of whether they use the capacity.
Meta's structural advantage, if it enters this market, is scale. The company is spending $125 billion to $145 billion on AI infrastructure in 2026 alone, including a 2,250-acre hyperscale campus in Louisiana and a one-gigawatt data center under construction in the American Midwest. At that size, Meta can enter the GPU rental market with a fleet that rivals or exceeds what CoreWeave or Nebius currently operate, without having taken on the GPU-collateralized debt that neoclouds rely on to finance their hardware.
Meta's surplus capacity is structural, not accidental. AI training workloads are inherently bursty: running a frontier model training run consumes GPUs at close to 100% utilization for days or weeks, then the cluster sits at 30% to 50% utilization during the inference serving period between training runs. At Meta's capex scale — a buildout sized to train future generations of Muse Spark and its successors — the company has been constructing for peak training demand. The result is a large block of GPU capacity that goes underutilized whenever active training runs are not in progress. That surplus is what Meta Compute would monetize.
The precedent is SpaceX's xAI unit, which built its Colossus 1 data center in Memphis to train the Grok AI model but found itself with more capacity than it could fill internally. In May, Anthropic signed a deal to lease all of Colossus 1's compute at $1.25 billion per month through May 2029; in June, Google agreed to pay $920 million per month for capacity at Colossus 2. The SpaceX parallel matters because it established a market price for hyperscale GPU capacity leased to major AI developers — and Meta is watching.
The sharpest market reaction Wednesday was not Meta's gain — it was the damage inflicted on neocloud providers. CoreWeave shares fell roughly 13% to 15%; Nebius fell by a similar margin. IREN declined about 6.5%.
The concern is structural. Neocloud companies built their businesses on a scarcity condition: AI demand moved faster than the traditional cloud providers could expand GPU capacity, creating a window for specialized GPU-rental shops. CoreWeave filled that window partly by signing a $21 billion capacity agreement with Meta — a deal announced April 9 and running through December 2032. Nebius signed a deal worth up to $27 billion with Meta over five years, with capacity commitments beginning in 2027. Those contracts are now double-edged. Meta has been one of the neoclouds' most important customers. If Meta builds out enough internal capacity to supply itself — and then sells the rest externally — it simultaneously removes a source of demand and adds a source of supply to the same market.
D.A. Davidson managing director Gil Luria put it directly: "The impact of adding Meta's capacity to the market is more likely to be on neoclouds than the big hyperscalers. Those companies like CoreWeave and Nebius rely on Meta for their growth and Meta may not need them anymore."
The financing implication is real. The neocloud business model depends on multi-year take-or-pay contracts to secure collateral for GPU-backed debt. CoreWeave's $8.5 billion debt facility — the first investment-grade rated GPU-backed financing, priced in March 2026 — borrowed against exactly this kind of contracted cash flow. If Meta's contracts with CoreWeave and Nebius face renegotiation or non-renewal in coming years, the collateral underpinning that debt structure weakens.
Read more: Meta's $14.3B AI Bet Hits a Training Data Wall: Zuckerberg Admits Mistakes
Meta's push into cloud infrastructure also reflects a competitive lesson from earlier this year. Google restricted Meta's access to its Gemini AI models around March 2026 because it could not supply the full computing capacity Meta was requesting, according to Financial Times reporting. The shortfall forced Meta to tell employees to ration AI token consumption and pushed the company to accelerate its reliance on Muse Spark — its own closed-weight model — in place of Gemini for content moderation, scam detection, and internal coding workflows.
That episode illustrates why a company at Meta's scale cannot simply purchase its way to sufficient AI compute: even Google, one of the world's largest data center operators, ran short. Owning the infrastructure — and controlling who uses it — is the only reliable hedge. A cloud business turns that hedge into a revenue line.
The incumbent cloud providers — AWS, Azure, and Google Cloud — have spent decades building the software platforms, enterprise sales capabilities, developer ecosystems, and customer support infrastructure that sit above the hardware. Meta has data centers and GPUs. It does not yet have a published pricing model, an announced launch date, a disclosed customer pipeline, or an enterprise sales organization oriented toward external cloud customers.
Analysts note that the competitive threat to established cloud providers is limited in the near term. AWS, Azure, and Google Cloud are deeply embedded in enterprise software stacks through identity management, database services, networking, and developer tooling that Meta cannot replicate quickly. The nearer-term disruption is at the infrastructure layer — where CoreWeave, Nebius, and other GPU-specialist providers compete primarily on hardware availability and price, not on software depth.
Zuckerberg framed the opportunity carefully at Meta's May shareholder meeting. "It's definitely on the table," he told investors, adding that companies were approaching Meta "almost every week" asking to buy AI model access or spare computing power at a premium. "We haven't done that yet, because we think that we have a use for the compute. But obviously, if we get to a point where we feel that we have overbuilt, then that is an option that we have."
That framing — not a commitment, but a credible signal — was enough to move the market sharply. Whether Meta follows through with a commercially viable cloud product, or treats compute sales as an opportunistic pressure valve during periods of genuine excess, will determine whether today's neocloud selloff reflects the beginning of a structural shift or a one-day repricing on an unconfirmed report.
What is Meta Compute, and how would it work?
Meta Compute is an internal initiative at Meta Platforms to commercialize the company's AI computing infrastructure. Under plans reported by Bloomberg on July 1, 2026, it would offer two services: developer access to AI models hosted on Meta's own GPU infrastructure — similar to how AWS Bedrock lets developers query third-party AI models through a single API — and raw GPU capacity rented to third parties by the hour, similar to what CoreWeave and other neocloud providers offer. No pricing, launch date, or customer pipeline has been officially announced.
Why are CoreWeave and Nebius stock falling on this news?
Both companies are major suppliers of AI computing capacity to Meta — CoreWeave holds a $21 billion agreement through December 2032, and Nebius holds a deal worth up to $27 billion over five years. If Meta brings enough of its own infrastructure online to supply its internal needs and then sells the rest externally, it simultaneously loses the neoclouds as a customer while adding itself as a competitor in the same market. The double impact threatens both the revenue and the contract-renewal outlook that neocloud businesses use to secure GPU-collateralized debt financing.
What makes a neocloud different from AWS or Google Cloud?
Neoclouds like CoreWeave and Nebius specialize in bare-metal GPU access for AI training and inference workloads, with pricing that has typically run 50% to 66% below what hyperscalers charge for equivalent performance. Where AWS and Google Cloud offer broad software ecosystems, databases, and enterprise tooling on top of their hardware, neoclouds offer raw GPU cycles under multi-year take-or-pay contracts — competing on hardware availability and price rather than software depth. Meta's entry would put it in that price-and-availability race before it has built the software layer.
What is the largest implication for developers looking to buy AI compute?
A major new entrant at hyperscaler scale — with no need to take on GPU-collateralized debt and no legacy software platform to protect — could meaningfully compress GPU rental pricing over the next one to two years, especially if other hyperscalers follow Meta's lead. Developers currently locked into multi-year neocloud contracts at today's rates may find those contracts are priced above market if Meta and similar players absorb the structural surplus the current overbuild is creating.
