Research Engineer – AI Systems
Posted 7hrs ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
AI Systems Research Engineer at Yotta Labs focusing on optimizing AI systems and GPU architecture for multi-silicon computing. Join a visionary team rethinking AI infrastructure through innovation and collaboration.
Responsibilities:
- Design and implement high-performance kernels for Attention, MoE, GEMM, collective communication, and quantization.
- Optimize kernels for NVIDIA, AMD, and AWS Trainium.
- Develop custom operators and graph optimizations using Neuron SDK, PyTorch/XLA, Torch Dynamo, and Neuron Compiler.
- Improve performance of vLLM, SGLang, TensorRT-LLM, and custom inference runtimes.
- Design scalable distributed training and inference solutions across thousands of accelerators.
- Contribute to open-source projects, publish technical findings and engage with the developer community.
Requirements:
- Proficiency in AI programming languages such as Python and C++
- Deep understanding of GPU architecture and performance optimization
- Experience with CUDA, Triton, ROCm/HIP, or AWS Neuron
- Strong understanding of AI frameworks (e.g., PyTorch, Dynamo, LMCache), model architectures and profiling tools (e.g. Nsight, ROCm Profiler, or Neuron Profiler)
- Strong problem-solving skills and the ability to work in a collaborative, remote environment
- A background in computer science, engineering, or a related field is preferred
Benefits:
- Competitive compensation with equity
- Flexible, remote work environment that values innovation and autonomy




















