The Ultimate Guide to GPU cloud
A curated American edition of TechDay news, analysis, interviews, reviews, job moves, and related resources for GPU cloud.
What to know about GPU cloud
GPU cloud brings the power of advanced graphics processing units to on-demand infrastructure, unlocking large-scale AI, high‑performance computing, and rich media workloads without owning any hardware. From deep learning and generative AI to video processing and cloud gaming, this tag explores how GPU‑accelerated clouds are reshaping what developers, data scientists, and enterprises can build and deploy.
Stories here follow the fast-growing ecosystem around NVIDIA GPU Cloud and other GPU platforms, including new integrations with hyperscale providers, colocation and sovereign AI data centres, and edge deployments for 5G and telco networks. You’ll find coverage of partnerships, benchmarks such as MLPerf results, and validation programmes that show which clouds and software stacks deliver the most efficient GPU‑based training and inference.
These articles also dive into the tooling, orchestration, and security needed to run production AI at scale: vendor‑agnostic GPU control planes, AI factory architectures, liquid cooling for dense clusters, and guidance on governance and isolation. Whether you’re evaluating AI cloud providers, planning multi‑cloud or hybrid GPU strategies, or looking to understand how companies are commercialising GPU capacity, this tag offers a concise guide to the latest developments in GPU cloud infrastructure.
American GPU cloud News
Regional stories with direct local relevance
Hivemind & Berkeley launch darkmatter lab for AI research
Selected AI and blockchain projects at Berkeley will each receive at least USD $1 million in support before they form companies.
Portal26 launches free Claude governance for firms
Firms using Anthropic's Claude can now track usage and costs more closely as Portal26 rolls out a free governance tier.
Opaque hires Microsoft veteran as Chief Platform Officer
The appointment signals a push to help regulated firms deploy AI agents without risking data leaks or unauthorised actions in sensitive systems.
Analyst Insights
Research and market analysis connected to GPU cloudFeatured News
Expert Columns
Interviews
Interviews and video coverage from the networkRecent GPU cloud News
Agentic AI Foundation adds agentgateway as hosted project
The addition gives companies a shared layer for securing and routing AI traffic as agentic systems move into production.
PEAK:AIO & Los Alamos launch Lattice for AI storage
The open-source system is designed to ease storage bottlenecks that can leave costly GPUs underused in AI and high-performance computing clusters.
NetApp and Cisco expand FlexPod with enterprise AI systems
Enterprises could cut integration work and security risk as pre-tested FlexPod systems are aimed at production AI deployments and edge use cases.
CIQ expands Fuzzball to span five clouds & on-prem
The update lets AI and HPC teams move workloads across five clouds and on-premises, cutting duplication and simplifying GPU access.
Microsoft unveils AI agents, models & security tools
Developers and enterprise customers will get more AI controls as Microsoft adds agents, in-house models and security tools across its software stack.
OpenSpace tops 1,000 data centre projects worldwide
Rising demand for AI infrastructure is driving faster uptake of digital site monitoring, with OpenSpace now used on more than 1,000 projects.
Wallarm launches AI control platform on AWS Marketplace
Firms racing to deploy generative AI are exposing themselves to data incidents and compliance gaps, Wallarm says, as oversight lags.
Microsoft AI launches seven new models across tasks
The models are aimed at developers and enterprises, with Microsoft saying internal training could cut costs and improve control in regulated industries.
Linux Foundation launches Tokenomics Foundation for AI costs
Rising AI bills are pushing enterprises to seek neutral benchmarks, as token costs are now a CEO-level concern and newer model prices climb.
Delta launches modular AI data centre to speed build
AI operators could bring new capacity online faster, as Delta says its prefabricated system may cut data centre deployment time by 60%.
Lightmatter joins NVIDIA NVLink Fusion AI ecosystem
The move could help hyperscalers cut cabling in dense AI clusters by half as optical links become central to NVIDIA's custom-chip strategy.
CrowdStrike lifts guidance & announces four-for-one split
Investors got stronger sales, record free cash flow and higher full-year forecasts as the cybersecurity group also unveiled a four-for-one stock split.
AIONOS & Black Box form AI infrastructure alliance
Enterprises in India and beyond stand to gain a single vendor for AI infrastructure and software as the firms target GCC demand and global expansion.
Yondr secures dual facilities to fund Europe growth
The data centre developer gains extra funding headroom as tightening power access makes new sites harder to secure across Europe and North America.
Intel unveils Xeon 6+ & widens AI push at Computex
Rising AI inference demand is reshaping server and device design, prompting Intel to push new processors, edge systems and rackscale infrastructure.
Vultr named NVIDIA Exemplar Cloud after Blackwell tests
The benchmark win could help enterprises compare AI cloud performance more clearly as demand grows for reliable large-scale model training.
Nvidia & OpenNebula deepen integration for AI factories
Nvidia deepens its OpenNebula tie-up to automate multi-tenant 'AI factories', unifying GPUs, DPUs and networking under one control plane.
Zadara aligns AI cloud platform with NVIDIA security guide
Zadara aligns its sovereign multi-tenant AI cloud with NVIDIA's security guide to boost isolation, governance and shared GPU utilisation.