The job below is no longer available.

You might also like

in Mountain View, CA

Use left and right arrow keys to navigate
Hours Full-time, Part-time
Location Mountain View, California

About this job

Job Description

We are looking for an experienced and talented Performance Modeling Software Engineer to work in a fast paced engineering organization designing cutting-edge technologies in the AI HW acceleration space. In this role you will drive the development, architecture, analysis, qualification, and quantification of our system-level performance models, simulation software, and supporting tooling and infrastructure. The tools and processes developed by the team will be used to validate, verify, and explore cross-cutting, system-level design decisions across the various engineering domains at Luminous. You will be a foundational member of a small team tasked with building a platform to model and explore the hardware/software design space of Luminous Computing's first series of products.



Responsibilities:

  • Build analytical performance models for deep-learning workloads executing on novel hardware system designs and integrate them into a sophisticated system-level simulation platform
  • Model the performance of Luminous hardware systems with sufficient precision to influence and/or validate critical system architectural design decisions
  • Work closely with HW design teams to implement behavioral performance models of AI HW accelerator system components (compute, memory, & interconnects) within our internal system-level architecture modeling and simulation framework
  • Participate in identifying system- and component-level bottlenecks that exist within and/or across the network, memory, and compute subsystem boundaries
  • Propose new architectural hardware and software features to improve runtime of ML models of interest on current and future architectures
  • Inform automatic parallelization techniques and optimized kernel generation within the compiler through performance modeling and iterative parameter sweeping over the parallelization strategy and network topology domains

Requirements

  • 3+ years of experience co-designing and implementing hardware and software systems to accelerate machine learning and deep learning workloads
  • BS/MS/PhD in Computer Science, Computer Engineering, or a related technical field
  • Extensive knowledge in computer systems architecture, particularly high-performance, parallel systems such as GPU clusters, heterogenous scale-out systems, and machine learning accelerator systems
  • Familiarity with scale-out performance and MPI collective primitives such as AllReduce, AllGather, and other common parallel-computing communication primitives
  • Understanding of modern deep neural network architectures (e.g. CNNs, Transformers, GNNs, Mixture-of-Experts, DLRM, etc.) and the algorithms that drive them (e.g convolutions, attention, back-propagation, etc.)
  • Experience optimizing software routines (i.e. kernels) using knowledge of the underlying low-level details of the target hardware architecture
  • Experience developing and deploying machine learning models in the datacenter (TensorFlow, PyTorch, MeshTensorflow, DeepSpeed, etc.)
  • Proficiency in modern programming languages (e.g. C++, Python, Haskell, etc.)
  • Experience with standard software development tools including code reviews, task management (e.g., GitLab, GitHub, Jira), version control (e.g. git), etc.
  • Strong independent problem solving skills coupled with the ability to work collaboratively with a small software-engineering team working on mission-critical projects

Benefits

  • Our benefits are here to support you!
    • Competitive salary and benefits
    • 401(k) program
    • Medical / dental / vision, AD+D and Life
    • Flexible vacation and company paid holidays
    • Healthy meals and snacks provided for non-remote employees
    • Paid parental leave
    • Flexible Spending Account
  • Were remote friendly
    • We believe that your value to the team can be provided at a Luminous Computing office or from the comfort of your home. Depending on your function, enjoy the flexibility of remote work and join our growing community of remote employees in the continental United States.
  • Were vested
    • You wont just own your work here, youll have the potential to own equity in Luminous Computing. We are competing in a market that is projected to grow exponentially, which gives our company valuation room to grow at tremendous rates.