From Supercomputing 2023,
WekaIO (WEKA) announced that it is working with Applied Digital Corporation, a designer, builder, and operator of next-generation digital
infrastructure for high-performance computing (HPC) applications and workloads,
to provide an underlying accelerated software architecture underpinning its
recently launched AI Cloud Service. Applied Digital is developing a turnkey
offering that allows enterprise customers to purchase and use GPU resources
on-demand, at scale, across its data centers for AI model training and
inference in virtualized, bare-metal, and containerization environments.
Artificial intelligence (AI) holds significant
business potential that many enterprises are eager to capitalize on, like
supporting the creation of new value propositions, increasing operational
efficiencies, and accelerating research and development or time to insight.
However, the high cost and complexity of owning and
operating thousands of GPUs for AI is prohibitive. Customers turn to providers
like Applied Digital for cloud access to support AI and HPC workloads. Applied
Digital's customers use some of the world's largest GPU clusters, with plans to
scale to 10,000 GPUs or more per cluster, requiring data access of one terabyte
per second or more.
The WEKA® Data Platform optimizes data pipelines for
generative AI and HPC workloads, boosting GPU data center efficiency. WEKA
customers can achieve 1.8 terabytes per second of bandwidth in a single rack,
lowering infrastructure costs and energy usage, improving GPU utilization by up
to 20 times, and reducing training time by 10-100 times.
Applied Digital has partnered with WEKA to improve
GPU utilization and efficiency, deliver high-performance data storage and
management capabilities, simplify data onboarding, and allow customers to
seamlessly scale deployments ranging from a fraction of a GPU to tens of
thousands of GPU servers.
"The rapid rise of generative AI has intensified
demand for the GPUs required to build AI models and applications, but many
businesses don't want to deal with the complexity or expense of owning and
housing them," said Mike Maniscalco, Chief Technology Officer at Applied
Digital. "Customers looking for a facility to provide access to large-scale GPU
servers on-demand are hard-pressed to find it. Applied Digital's AI Cloud
Service offerings were created to improve that access, and the WEKA Data
Platform has been key to helping our GPUs run more efficiently and
cost-effectively at scale with exceptional performance - and pass those
benefits on to our customers."
"Visionary companies like Applied Digital are wisely
responding to skyrocketing demand for GPU-accelerated computing resources at
scale by creating innovative "GPU-as-a-service" offerings to support the
development of generative AI applications on demand. But running large-scale,
multi-GPU clusters is hugely performance and resource-intensive - if you can't
fully maximize the potential of your GPUs or keep their energy consumption in
check, you can't deliver optimal productivity, cost optimization, and value to
your customers," said Jonathan Martin, president at WEKA. "This is quite
literally what the WEKA Data Platform was built for. WEKA is proud to be on the
bleeding edge of this rapidly emerging market, helping to bring maximum
performance, flexibility, efficiency, and sustainability to the world's leading
GPU cloud providers."