Covering Scientific & Technical AI | Wednesday, March 19, 2025

Together AI Taps Dell Technologies to Scale AI Acceleration Cloud for the Enterprise 

March 13, 2025 -- Together AI is teaming up with Dell Technologies to scale its AI cloud platform. This collaboration will integrate an end-to-end high-performance system with advanced compute and networking to drive scalability and efficiency to enterprises, startups and researchers building the future of artificial intelligence.

This collaboration leverages the Dell AI Factory with NVIDIA, delivered as liquid-cooled, scalable units optimized for complex AI and HPC tasks. Fully integrated Dell IR7000 racks with NVIDIA Blackwell accelerated computing at the core, with Dell Professional Services supporting each step of the deployment to ensure seamless and fast delivery.[I]

“Our teams worked hand-in-hand with Together AI and NVIDIA to quickly design and deliver AI systems that will provide the capabilities needed to drive AI innovation,” said Arthur Lewis, president, Infrastructure Solutions Group, Dell Technologies. “Together AI’s cloud platform, underpinned by Dell infrastructure, sets the standard for premium, scalable and efficient AI.”

“The work we’re doing with Dell Technologies and NVIDIA underscores our commitment to accelerating scalable, AI-driven advancements worldwide,” said Vipul Ved Prakash, CEO, Together AI. “Our cloud platform, combined with the performance and reliability of Dell infrastructure featuring NVIDIA technology, will provide customers with the speed and flexibility needed for the new wave of open source reasoning models and the coming era of superintelligence.”

High Performance AI Infrastructure

Dell and Together AI’s engineers worked closely to build an optimized AI cluster capable of performing complex AI and HPC tasks. The fully integrated Dell IR7000 racks feature Dell PowerEdge XE9712 servers, equipped with NVIDIA GB200 NVL72 platform, and PowerEdge XE9680 servers, featuring NVIDIA HGX B200 platform.

The cluster also leverages NVIDIA NVLink and NVIDIA Quantum-2 InfiniBand to enable high-speed, low-latency communication between GPUs, ensuring efficient data transfer for large-scale AI training and inference workloads. This architecture minimizes bottlenecks, accelerates distributed model execution and allows seamless scaling across multiple GPU nodes.

Fast Training and Inference Performance

Together AI’s infrastructure, accelerated by the Dell AI Factory with NVIDIA, underpins Together GPU Clusters and the Together Inference & Fine-Tuning Platform for fast training and inference performance. Together AI helps enterprises to deploy advanced reasoning models that leverage the NVIDIA Blackwell accelerated platform’s high-bandwidth memory, NVLink for multi-GPU efficiency and mixed-precision Tensor Cores—key to optimizing Mixture of Experts architectures commonly used by reasoning models. With Together AI’s research-backed optimizations, including FlashAttention-3, Speculative Decoding, and Mixture of Agents, enterprises can accelerate time-to-market for AI applications while maintaining high operational efficiency and reliability.

About Dell Technologies
Dell Technologies (NYSE: DELL) helps organizations and individuals build their digital future and transform how they work, live and play. The company provides customers with the industry’s broadest and most innovative technology and services portfolio for the AI era.


Source: Dell Technologies

AIwire