Machine Learning Software Engineer

Do you want to be in the forefront of Deep Learning innovation? Training extremely large models at a fraction of time compared to other solutions? Helping companies and labs around the world solving a real impactful problem? Working with the latest Deep Learning Architecture (i.e. Transformer, GNN…etc)? If so, then we need you!!!

Cerebras is developing a radically new chip and system to dramatically accelerate deep learning applications. Our system runs training and inference workloads orders of magnitude faster than contemporary machines, fundamentally changing the way ML researchers work and pursue AI innovation.

We are innovating at every level of the stack – from chip, to microcode, to power delivery and cooling, to new algorithms and network architectures at the cutting edge of ML research. Our fully-integrated system delivers unprecedented performance because it is built from the ground up for deep learning workloads.

Cerebras is building a team of exceptional people to work together on big problems. They aren’t afraid of taking risk and thinking outside of the box to solve fun and challenging problems.

The Team

As an ML Software Engineer on our team, you will work with leaders from industry and academia at the intersection of hardware and software, to develop state-of-the-art solutions for emerging problems in AI compute.

The Cerebras software platform is designed to be targeted by today’s most relevant machine learning frameworks such as TensorFlow, PyTorch, JAX, and MXNet.  Our ML software engineers are responsible for the backend of these frameworks and the integration with our own highly optimized software stack.

Fundamentally, you will be enabling ML researchers to use the software tools and workflows of today to unlock the advanced hardware capabilities of tomorrow.

The Role

This role includes our ML framework backend and frontend stack, you will be involved in the frontend workflow for development, training and inference on our new hardware system. And the backend runtime that map the abstract computation expressed via third-party ML frameworks computation graph into our own representations that can then be compiled into highly optimized executables that target Cerebras’s system.

The role includes cross team collaboration with the applied science and ML application team in one hand, and the compiler and hardware team on the other hand.

Skills & Qualifications

  • Bachelor’s / Master’s degree or foreign equivalent in Computer Science, Engineering, or related.
  • 5+ years software development experience.
  • Understanding of state-of-the-art deep learning model architectures and training protocols.
  • Strong Python and C++ development skills.
  • Experience with at least one deep learning framework internals (i.e. TensorFlow, PyTorch, JAX, Caffe 2, MXNet, PaddlePaddle, CNTK, Caffe, Theano, Chainer…etc) is strongly preferred.
  • Experience with GPU programing such as CUDA, shading language…etc.
  • Experience with deep learning distributed training.
  • Familiar with compiler IR stack such as LLVM and MLIR.


Los Altos, CA or San Diego, CA or Toronto, Canada


  • ML Frameworks:


  • Headquarters/Los Altos Office
  • Remote Office
  • San Diego Office
  • Toronto Office

Apply for this position.


Cover Letter