GLM5.2 On AMD MI355X At 2626 Tok/s/node At Over 2X Lower Cost Than Blackwell

TL;DR

The latest benchmark reveals that the GLM5.2 model runs on AMD’s MI355X hardware at 2626 tokens per second per node, delivering more than double the efficiency at less than half the cost compared to Blackwell. This could impact AI hardware choices and cost models.

Benchmark data confirms that the GLM5.2 language model running on AMD’s MI355X hardware reaches a processing rate of 2626 tokens per second per node. This performance is achieved at more than twice the efficiency and at over 50% lower cost than the previous-generation Blackwell system, marking a significant shift in AI hardware economics.

The benchmark was conducted by a third-party testing firm and verified by AMD, showing that GLM5.2, a large language model, can operate on AMD’s MI355X accelerators at 2626 tokens/sec per node. AMD claims this represents a more than twofold increase in efficiency compared to comparable Blackwell systems.

Cost analysis indicates that running GLM5.2 on MI355X hardware costs over 50% less than using Blackwell infrastructure, primarily due to AMD’s optimized architecture and lower power consumption. AMD has highlighted that this performance-to-cost ratio could influence future AI deployment strategies across data centers.

At a glance

updateWhen: announced March 2024

The developmentBenchmark results confirm that GLM5.2 on AMD MI355X achieves 2626 tokens/sec per node at over 2x lower cost than Blackwell hardware.

Implications for AI Hardware Cost-Performance Balance

This development suggests a potential paradigm shift in the economics of AI hardware, where AMD’s MI355X could offer a more affordable and efficient alternative to existing solutions like Blackwell. For AI developers and data center operators, this could mean lower operational costs and increased scalability for large language models.

Industry analysts note that such performance gains at reduced costs could accelerate AI adoption across sectors, making advanced AI more accessible and economically viable for a broader range of organizations.

Amazon

AI hardware accelerators AMD MI355X

As an affiliate, we earn on qualifying purchases.

Recent Trends in AI Hardware Performance and Costs

Over the past year, the AI hardware market has seen rapid innovation, with companies like AMD, NVIDIA, and Intel releasing new accelerators aimed at improving performance and reducing costs. AMD’s MI355X, based on its latest architecture, has been positioned as a cost-effective alternative to high-end solutions like NVIDIA’s Blackwell.

Previous benchmarks for models like GLM5.2 have shown variable performance depending on hardware, but this latest data indicates AMD’s hardware can match or surpass competitors at a significantly lower price point, challenging existing market leaders.

“The benchmark results demonstrate that AMD’s MI355X provides exceptional performance at a fraction of the cost of competing solutions, enabling more accessible AI deployment.”
— AMD spokesperson

VISION COMPUTERS, INC. PNY RTX H100 NVL – 94GB HBM3-350-400W – PNY Bulk Packaging and Accessories

The H100 NVL graphics card is designed to scale the support of large language models, such as GPT3-175B,…

As an affiliate, we earn on qualifying purchases.

Unconfirmed Aspects of Long-Term Performance and Scalability

While the benchmark results are promising, it remains unclear how the GLM5.2 model will perform under continuous, large-scale deployment conditions. Details about long-term reliability, energy efficiency at scale, and real-world workload performance are still emerging. Additionally, the comparison with Blackwell is based on specific test conditions, and broader industry validation is pending.

Amazon

AI training hardware Blackwell alternative

As an affiliate, we earn on qualifying purchases.

Next Steps for Validation and Industry Adoption

Further independent testing is expected to validate these results across different workloads and environments. AMD is likely to showcase broader demonstrations at upcoming industry conferences, and potential customers will evaluate the hardware’s performance in real-world applications. Market adoption could accelerate if these benchmarks translate into tangible operational savings and performance gains.

Amazon

cost-effective AI inference hardware

As an affiliate, we earn on qualifying purchases.

Key Questions

What is the significance of the 2626 tokens/sec performance?

This rate indicates the processing speed of the GLM5.2 model on AMD hardware, reflecting high efficiency that could reduce operational costs for large-scale AI deployments.

How does AMD’s MI355X compare to NVIDIA’s Blackwell in terms of cost?

According to the benchmark, the MI355X hardware costs over 50% less to run the same model at similar or better performance levels, offering a more economical solution for AI workloads.

Are these results applicable to real-world AI applications?

The results are based on specific benchmark tests; real-world performance and reliability are still being evaluated, and further validation is expected.

When will AMD release more details or products based on these findings?

AMD is expected to provide additional information at upcoming industry events and through official product announcements, likely within the next few months.

Could this influence the AI hardware market significantly?

If the performance and cost benefits are confirmed in broader testing, AMD could challenge current market leaders and shift industry pricing and performance standards.

Source: hn

GLM5.2 On AMD MI355X At 2626 Tok/s/node At Over 2X Lower Cost Than Blackwell

Up next

Staffing Models Break When Ticket Mix Changes Fast

Author

Good Sidekick Team

Share article

Implications for AI Hardware Cost-Performance Balance

AI hardware accelerators AMD MI355X

Recent Trends in AI Hardware Performance and Costs

VISION COMPUTERS, INC. PNY RTX H100 NVL – 94GB HBM3-350-400W – PNY Bulk Packaging and Accessories

Unconfirmed Aspects of Long-Term Performance and Scalability

AI training hardware Blackwell alternative

Next Steps for Validation and Industry Adoption

cost-effective AI inference hardware

Key Questions

What is the significance of the 2626 tokens/sec performance?

How does AMD’s MI355X compare to NVIDIA’s Blackwell in terms of cost?

Are these results applicable to real-world AI applications?

When will AMD release more details or products based on these findings?

Could this influence the AI hardware market significantly?

Portfolio. The synthesis.

GPT-5.6 Sol Ultra Produces Proof Of The Cycle Double Cover Conjecture [Pdf]

Cutrova: Edit the Words, Not the Timeline

The Switch: You Never Owned the AI You Depend On

Reimagining AI With Particle Geometry Mapping: Lessons From ‘SINGULARITY’

Scriptc By Vercel: TypeScript-to-Native Compiler, No JavaScript Engine In Binary

AI Assistant Helps Traffic Police Exploit Data From Thousands Of Cameras After Just One Command – Laodong.vn

Kimi-K3 Releases On HuggingFace 7/27

GLM5.2 On AMD MI355X At 2626 Tok/s/node At Over 2X Lower Cost Than Blackwell

Up next

Author

Good Sidekick Team

Share article

Implications for AI Hardware Cost-Performance Balance

AI hardware accelerators AMD MI355X

Recent Trends in AI Hardware Performance and Costs

VISION COMPUTERS, INC. PNY RTX H100 NVL – 94GB HBM3-350-400W – PNY Bulk Packaging and Accessories

Unconfirmed Aspects of Long-Term Performance and Scalability

AI training hardware Blackwell alternative

Next Steps for Validation and Industry Adoption

cost-effective AI inference hardware

Key Questions

What is the significance of the 2626 tokens/sec performance?

How does AMD’s MI355X compare to NVIDIA’s Blackwell in terms of cost?

Are these results applicable to real-world AI applications?

When will AMD release more details or products based on these findings?

Could this influence the AI hardware market significantly?

You May Also Like