Keysight Applied sciences has launched Keysight AI (KAI) Data Centre Builder, a software program suite designed to emulate real-world workloads to guage how new algorithms, elements and protocols influence the efficiency of synthetic intelligence (AI) coaching.
Basically, the workload emulation capabilities of the KAI Knowledge Centre Builder are attributed with enabling AI operators, graphics processing unit (GPU) cloud suppliers and infrastructure suppliers to deliver sensible AI workloads into their lab setups to validate the evolving designs of AI clusters and new elements. They’re stated to have the ability to experiment to fine-tune mannequin partitioning schemas, parameters and algorithms to optimise the infrastructure and enhance AI workload efficiency.
Behind the KAI Knowledge Centre Builder’s operation is the truth that AI operators use numerous parallel processing methods, equivalent to mannequin partitioning, to speed up AI mannequin coaching. Aligning mannequin partitioning with AI cluster topology and configuration enhances coaching efficiency. Keysight Applied sciences stated that throughout the AI cluster design section, vital questions are greatest answered by way of experimentation, with many targeted on information motion effectivity between the GPUs.
Key concerns embody scale-up design of GPU interconnects inside an AI host or rack; scale-out community design, together with bandwidth per GPU and topology; configuration of community load balancing and congestion management; and tuning of the coaching framework parameters.
KAI Knowledge Centre Builder’s workload emulation functionality is constructed to combine giant language mannequin (LLM) and different AI mannequin coaching workloads into the design and validation of AI infrastructure elements, particularly networks. It’s meant to allow tighter synergy between {hardware} design, protocols, architectures and AI coaching algorithms, boosting system efficiency.
The workload emulation service reproduces community communication patterns of real-world AI coaching jobs to speed up experimentation, scale back the educational curve essential for proficiency and supply deeper insights into the reason for efficiency degradation, which isn’t simply attainable by experimenting with actual AI coaching jobs.
The end result, stated Keysight, is that its prospects can entry a library of LLM workloads equivalent to GPT and Llama, with a number of standard mannequin partitioning schemas like information parallel (DP), totally sharded information parallel (FSDP) and three-dimensional (3D) parallelism.
The corporate added that utilizing the workload emulation software within the KAI Knowledge Centre Builder enabled AI operators to experiment with parallelism parameters, together with partition sizes and their distribution over the accessible AI infrastructure (scheduling) and perceive the influence of communications in and amongst partitions on general job completion time (JCT). It’s additionally attributed with permitting customers to determine low-performing collective operation, and drill all the way down to determine bottlenecks and analyse community utilisation, tail latency, and congestion to grasp the influence they’ve on JCT.
“As AI infrastructure grows in scale and complexity, the necessity for full-stack validation and optimisation turns into essential,” stated Ram Periakaruppan, vice-president and common supervisor of community take a look at and safety options at Keysight.
“To keep away from expensive delays and rework, it’s important to shift validation to earlier phases of the design and manufacturing cycle. KAI Knowledge Middle Builder’s workload emulation brings a brand new degree of realism to AI part and system design, optimising workloads for peak efficiency.”
…………………………………………
Sourcing from TechTarget.com & computerweekly.com
DYNAMIC ONLINE STORE
Subscribe Now
Leave a Reply