Back to feed

[Remote] System Modeling (Performance Models)

Remote Full-time Live

Note: The job is a remote job and is open to candidates in USA. Unconventional AI is focused on redefining computing to address the energy limitations of AI on a global scale. They are seeking a Member of Technical Staff for System Modeling (Performance Models) to develop physics-based system models and simulation frameworks for machine learning workloads, supporting innovative AI acceleration architectures.

Responsibilities

  • Building extensible and composable high-fidelity power, performance and area estimation tools for novel AI acceleration system architectures to enable rapid design space exploration
  • Define and create comparative analyses across candidate architectures and existing state-of-art implementations
  • Working with other teams to understand their needs for such modeling and simulation to support high level system design as well as lower level verification of hardware

Skills

  • MS/PhD in a quantitative field (AI/ML, Computer Science, Physics, Electrical Engineering, Applied Math), or BS with substantial, clear evidence of equivalent research/engineering depth
  • Experience with tools and development for power profiling, modeling and simulation for AI workloads
  • Deep understanding of spatial architectures and data orchestration mechanisms
  • Deep understanding of different dataflow strategies and their tradeoffs, e.g. Weight-Stationary (WS), Output-Stationary (OS), Input-Stationary (IS) and Row-Stationary (RS)
  • Familiar with (OSS) tools for hardware accelerator design: TimLoop, Accelergy, NeuroSim, CIMLoop, CACTI, etc
  • Familiar with different existing systolic array accelerator architectures for AI/ML workloads
  • Solid understanding of modern AI/ML architectures and training/inference workflows
  • Strong experience implementing and debugging ML models in PyTorch (preferred) or similar, with practical experience profiling, optimizing, and stabilizing non-trivial large-scale ML systems
  • Basic familiarity of analog dynamic systems, including transient responses, nonidealities such as nonlinearity, quantization, random noise, and feedback/stability
  • Strong Python engineering skills: modular design, testing, packaging, CI
  • Experience with PyTorch internals: autograd, custom modules, low-level ops; familiarity with torch.compile or similar graph capture/compile flows
  • Experience with CUDA, Triton, or other GPU programming approaches (writing custom kernels, understanding memory hierarchy, basic performance tuning)
  • Comfort with at least some of: JAX, NumPy, TensorFlow, Modal, HPC patterns (MPI, NCCL, distributed training), SciPy
  • Demonstrated ability to reason across multiple layers of the stack: algorithm, software, runtime, hardware
  • Able to connect model architecture choices to system performance implications: memory bandwidth, communication patterns, latency, energy, and numerical issues
  • Experience applying at least some efficiency techniques (quantization, sparsity, pruning, distillation, kernel fusion, etc.)
  • Prior experience building or extending a serious simulation or modeling framework (could be ML systems, physics, circuits, or other technical domains)
  • Comfort with approximations and tradeoffs: you know when to use a simple model and when you need something closer to the physics

Benefits

  • Best-in-class health benefits
  • 401k matching
  • Truly unlimited PTO
  • Complimentary meals when working from our Palo Alto office

Company Overview

  • Unconventional AI rethinks computer foundations to optimize energy efficiency for AI. It was founded in 2025, and is headquartered in San Francisco, California, USA, with a workforce of 11-50 employees. Its website is https://unconv.ai.
  • Apply To This Job

    On the same wavelength