FuriosaAI's RNGD: A Game Changer in AI Inference Chips

August 28, 2024, 11:32 pm
AMD
AMD
CenterDataDevelopmentHardwareMediaProductResearchSoftwareTechnologyWireless
Location: United States, California, Santa Clara
Employees: 10001+
Founded date: 1969
퓨리오사에이아이
퓨리오사에이아이
DeepTechLearn
Qualcomm
Qualcomm
B2CDesignDevelopmentHardwareITManagementMessangerMobileTimeWireless
Location: United States, California, Sorrento Valley
Employees: 10001+
Founded date: 1985
Supermicro
Supermicro
BuildingCenterCloudDataEnterpriseITProductProviderStorageTechnology
Location: United States, California, San Jose
Employees: 1001-5000
Founded date: 1993
In the fast-paced world of artificial intelligence, innovation is the lifeblood. FuriosaAI has just injected a potent dose of creativity into the AI semiconductor landscape with the unveiling of its latest chip, RNGD. This new AI inference accelerator is not just another player in the game; it’s a potential game changer.

Unveiled at Hot Chips 2024, RNGD (pronounced "Renegade") is designed to redefine efficiency in data centers. For years, the AI hardware arena has been dominated by legacy chipmakers and flashy startups. FuriosaAI, founded in 2017 by engineers with roots in tech giants like AMD, Qualcomm, and Samsung, is here to disrupt that status quo. Their mission? To deliver cutting-edge technology at breakneck speed.

RNGD is not just a chip; it’s a promise of performance. The company has successfully brought this silicon marvel to life, thanks to a partnership with TSMC. This collaboration has allowed FuriosaAI to achieve a seamless technology development process. Their first-generation chip, launched in 2021, set the stage for this rapid evolution. Within weeks of receiving silicon, they showcased impressive MLPerf benchmark results, boasting a staggering 113% performance increase in subsequent tests.

Early tests of RNGD have shown remarkable results with large language models (LLMs) like GPT-J and Llama 3.1. Imagine a single RNGD PCIe card delivering between 2,000 to 3,000 tokens per second. That’s the kind of throughput that turns heads and opens doors. With models boasting around 10 billion parameters, RNGD is poised to handle the heavy lifting of modern AI workloads.

The architecture of RNGD is a feat of engineering. It employs a non-matmul Tensor Contraction Processor (TCP) architecture, striking a perfect balance between efficiency, programmability, and performance. This is not just about raw power; it’s about smart design. The robust compiler co-designed for TCP treats entire models as single-fused operations, optimizing performance while minimizing energy consumption.

Efficiency is the name of the game. With a thermal design power (TDP) of just 150W, RNGD stands in stark contrast to the 1000W+ requirements of leading GPUs. This is green computing at its finest. The chip’s 48GB of HBM3 memory allows it to run demanding models like Llama 3.1 8B efficiently on a single card. In a world increasingly focused on sustainability, RNGD is a beacon of hope.

Industry partners are already singing its praises. Supermicro, a key player in the tech landscape, recognizes the potential of RNGD to drive green computing. By integrating this technology, they can reduce power consumption while maintaining exceptional inference performance. This is a win-win scenario for both the environment and the bottom line.

The collaboration between GUC and FuriosaAI is another testament to the chip’s potential. Achieving such high performance and power efficiency requires meticulous planning and execution. FuriosaAI has demonstrated excellence from design to delivery, creating chips that stand out in a crowded market.

As RNGD begins sampling to early access customers, the excitement is palpable. Broader availability is expected in early 2025, and the anticipation is building. This chip is not just a product; it’s a glimpse into the future of AI computing.

FuriosaAI’s journey is a testament to the power of innovation. The company has carved a niche for itself in a competitive landscape, focusing on rapid product delivery and continuous advancement. The launch of RNGD is a culmination of years of hard work and dedication. It’s a one-shot silicon success that showcases the company’s commitment to pushing boundaries.

The industry is watching closely. June Paik, Co-Founder and CEO of FuriosaAI, is set to present performance benchmarks at Hot Chips. His presentation, titled "Furiosa RNGD: A Tensor Contraction Processor for Sustainable AI Computing," promises to shed light on the chip’s exceptional capabilities. A live demo at the Furiosa booth will give attendees a firsthand look at this groundbreaking technology.

In a world where AI is becoming increasingly integral to our lives, the demand for efficient, powerful hardware is skyrocketing. RNGD is positioned to meet that demand head-on. It’s not just about keeping pace; it’s about setting the pace.

As we look to the future, RNGD represents a shift in the AI hardware landscape. It’s a reminder that innovation is not just about creating new products; it’s about rethinking how we approach technology. FuriosaAI is leading the charge, and the industry is eager to see what comes next.

In conclusion, RNGD is more than just a chip; it’s a revolution in AI inference. With its focus on efficiency, performance, and sustainability, FuriosaAI is not just participating in the AI race; it’s redefining the finish line. The future of AI computing is bright, and RNGD is at the forefront of that future.