Cerebras Systems Unveils World’s Fastest AI Chip with Whopping 4 Trillion Transistors

Revolutionary Compute
for Generative AI

Introducing the Cerebras CS-3: the world’s fastest AI accelerator. Built to scale out
to 2048 systems, the CS-3 trains trillion parameter models at record speed.
Exa-scale performance, single device simplicity.

Explore CS-3 >

Launch Blog >

Cerebras AI Day

Watch keynotes from our historic event

Opening Keynote by CEO, Andrew Feldman

Hardware Keynote by CTO, Sean Lie

ML and Product Keynote by VP of Product, Jessica Liu

SEE ALL OF OUR VIDEOS

ai model services

You bring the data, we'll train the model

Whether you want to build a multi-lingual chatbot or predict DNA sequences, our team of AI scientists and engineers will work with you and your data to build state-of-the-art models leveraging the latest AI techniques.

FIND OUT MORE

high performance computing

The fastest HPC accelerator on earth

With 900,000 cores and 44 GB of on-chip memory, the CS-3 completely redefines the performance envelope of HPC systems. From Monte Carlo Particle Transport to Seismic Processing, the CS-3 routinely outperforms entire supercomputing installations.

FIND OUT MORE

Models on Cerebras

The Cerebras platform has trained a huge assortment of models from multi-lingual LLMs to healthcare chatbots. We help customers train their own foundation models or fine-tune open source models like Llama 2. Best of all, the majority of our work is open source.

llama 2

Foundation language model
7B-70B, 2T tokens
4K context

OPEN WEIGHTS

Mistral

7B Foundation model that leverages
grouped-query attention,
coupled with sliding window attention

TRAINED ON CEREBRAS

JAIS

Bilingual Arabic + English model
13B, 30B Parameters
Available on Azure, G42 Cloud

OPEN WEIGHTS TRAINED ON CEREBRAS

MED42

Medical Q&A LLM
Fine-tuned from Llama2-70B
Scores 72% on USMLE

TRAINED ON CEREBRAS

bloom

Massive multi-lingual LLM
176B parameters, 366B tokens
2k context

OPEN SOURCE TRAINED ON CEREBRAS

FALCON

Foundation language model
40B, 1T tokens,
(Uses Flash Attention and Multiquery)

CEREBRAS IMPLEMENTATION

MPT

Foundation model trained
on 1T tokens of English
that uses ALiBi positioning method

OPEN SOURCE TRAINED ON CEREBRAS

starcoder

Coding LLM
15.5B parameters, 1T tokens
8K context

OPEN WEIGHTS TRAINED ON CEREBRAS

diffusion
transformer

Image generation model
33M-2B parameters
Adaptive layer norm

CEREBRAS IMPLEMENTATION

T5

For NLP applications
Encoder-decoder model
60M-11B parameters

CEREBRAS IMPLEMENTATION

CRYSTALCODER

Trained for English + Code
7B Parameters, 1.3T Tokens
LLM360 Release

OPEN SOURCE TRAINED ON CEREBRAS

CEREBRAS-GPT

Foundational Language Model
100m - 13b parameters
NLP

OPEN SOURCE TRAINED ON CEREBRAS

BTLM-chat

BTLM-3B-8K fine-tuned for chat
3B parameters, 8K context
Direct Preference Optimization

CEREBRAS IMPLEMENTATION

gigaGPT

Implements nanoGPT on Cerebras
Trains 175B+ models
565 lines of code

CEREBRAS IMPLEMENTATION

FIND OUT MORE

Latest blog posts

April 12, 2024

Cerebras CS-3 vs. Nvidia B200: 2024 AI Accelerators Compared

March 12, 2024

Cerebras CS-3: the world’s fastest and most scalable AI accelerator

March 11, 2024

Cerebras and Qualcomm Unleash ~10X Inference Performance Boost with Hardware-Aware LLM Training

Customer Spotlight

"Mayo Clinic selected Cerebras as its first generative AI collaborator for its large-scale, domain-specific AI expertise to accelerate breakthrough insights for the benefit of patients."

Matthew Callstrom, MD, PhD

Medical Director for Strategy, Chair - Department of Radiology

"The Cerebras CS-2 is a critical component that allows GSK to train language models using biological datasets at a scale and size previously unattainable. These foundational models form the basis of many of our AI systems and play a vital role in the discovery of transformational medicines."

Kim Branson

SVP Global Head of AI and ML, GlaxoSmithKline

"Training which historically took over 2 weeks to run on a large cluster of GPUs was accomplished in just over 2 days — 52hrs to be exact — on a single CS-1. This could allow us to iterate more frequently and get much more accurate answers, orders of magnitude faster."

Nick Brown

Head of AI & Data Science, AstraZeneca

"Working with the Cerebras ML team we were able to train a new state-of-the-art large language model that outperforms models twice its size in a matter of weeks. Their AI expertise is second to none."

The Opentensor Foundation

"TotalEnergies’ roadmap is crystal clear: more energy, less emissions. To achieve this, we need to combine our strengths with those who enable us to go faster, higher, and stronger… We count on the CS-2 system to boost our multi-energy research and give our research ‘athletes’ that extra competitive advantage."

Vincent Saubestre

CEO & President, TotalEnergies Research & Technology USA

"Cerebras allowed us to reduce the experiment turnaround time on our cancer prediction models by 300x, ultimately enabling us to explore questions that previously would have taken years, in mere months."

Dr. Rick Stevens

Associate Laboratory Director of Computing, Environment and Life Sciences, Argonne National Laboratory

LEARN MORE

In the News

March 11, 2024

Cerebras Systems Unveils World’s Fastest AI Chip with Whopping 4 Trillion Transistors

Revolutionary Computefor Generative AI

Introducing the Cerebras CS-3: the world’s fastest AI accelerator. Built to scale out to 2048 systems, the CS-3 trains trillion parameter models at record speed.Exa-scale performance, single device simplicity.

Cerebras AI Day

Watch keynotes from our historic event

Opening Keynote by CEO, Andrew Feldman

Hardware Keynote by CTO, Sean Lie

ML and Product Keynote by VP of Product, Jessica Liu

ai model services

You bring the data, we'll train the model

Whether you want to build a multi-lingual chatbot or predict DNA sequences, our team of AI scientists and engineers will work with you and your data to build state-of-the-art models leveraging the latest AI techniques.

high performance computing

The fastest HPC accelerator on earth

With 900,000 cores and 44 GB of on-chip memory, the CS-3 completely redefines the performance envelope of HPC systems. From Monte Carlo Particle Transport to Seismic Processing, the CS-3 routinely outperforms entire supercomputing installations.

Models on Cerebras

The Cerebras platform has trained a huge assortment of models from multi-lingual LLMs to healthcare chatbots. We help customers train their own foundation models or fine-tune open source models like Llama 2. Best of all, the majority of our work is open source.

llama 2

Mistral

JAIS

MED42

bloom

FALCON

MPT

starcoder

diffusiontransformer

T5

CRYSTALCODER

CEREBRAS-GPT

BTLM-chat

gigaGPT

Latest blog posts

Cerebras CS-3 vs. Nvidia B200: 2024 AI Accelerators Compared

Cerebras CS-3: the world’s fastest and most scalable AI accelerator

Cerebras and Qualcomm Unleash ~10X Inference Performance Boost with Hardware-Aware LLM Training

Customer Spotlight

In the News

Cerebras and G42 Break Ground on Condor Galaxy 3, an 8 exaFLOPs AI Supercomputer

Cerebras Systems Unveils World’s Fastest AI Chip with Whopping 4 Trillion Transistors

Cerebras Selects Qualcomm to Deliver Unprecedented Performance in AI Inference

Subscribe to the newsletter and stay updated about our latest innovations.

Follow

Product

Applications

Industries

Resources

Developers

Company

Revolutionary Compute
for Generative AI

Introducing the Cerebras CS-3: the world’s fastest AI accelerator. Built to scale out
to 2048 systems, the CS-3 trains trillion parameter models at record speed.
Exa-scale performance, single device simplicity.

diffusion
transformer