AI Performance Software Engineer Job at Signify Technology, San Francisco, CA

MTQ4NGt2bk5UM1J2V1hFVGZlRENqQXV6NGc9PQ==
  • Signify Technology
  • San Francisco, CA

Job Description

AI Performance Engineer – CUDA & PyTorch Focus

Location: San Fransisco, CA

Compensation: $200,000-$300,000

A stealth-mode AI systems company is reimagining how large-scale inference is done. With generative AI workloads scaling rapidly, inference efficiency has become a critical bottleneck. We're building an integrated hardware-software platform that brings breakthrough performance and usability to production-scale LLM applications.

This is an opportunity to work on a highly technical team spun out of top-tier academic research, focused on the cutting edge of AI, distributed systems, and performance optimization.

What You’ll Do:

  • Drive core research and implementation of performance optimizations for modern AI models
  • Implement advanced techniques like FlashAttention, KV caching, quantization, and model compression
  • Design and build scalable, distributed compute strategies across GPU-based systems
  • Profile, benchmark, and optimize CUDA kernels and AI runtime performance across inference stacks
  • Work across frameworks like PyTorch, ONNX, and vLLM to improve end-to-end efficiency

What We're Looking For:

  • Strong background in CUDA and low-level GPU performance tuning
  • Proven experience building with PyTorch and deploying high-performance ML models
  • Proficiency in Python and C++
  • Experience with large-scale distributed systems in cloud environments (AWS, GCP, or Azure)
  • Exposure to AI compilers or frameworks like MLIR is a plus
  • Interest in system design, scalability, and accelerating LLM workloads in real production environments

If you’ve spent your time making large models faster, leaner, and more efficient—and want to solve hard technical problems at the core of GenAI infrastructure—this role is for you.

Reach out to learn more.

Job Tags

Similar Jobs

The Perico Group

Hygienist (Full Time or Part Time) Job at The Perico Group

 ...care to their patients and, if interested, get the opportunity to assist in surgical care. We are looking for a fun, hardworking team...  ...and accurately assess patient oral health condition Acquire dental imaging, including radiographs, CBCTs and 2D photographs Assemble... 

Kumon North America, Inc.

Production Associate Job at Kumon North America, Inc.

 ...leadership in the education sector. Creative Impact : Contribute to the design and production of materials that shape the learning experience for thousands of students. About the Role: Production Associate As a Production Associate at Kumon North America, you... 

J & J Staffing Resources

Mechanical Designer Job at J & J Staffing Resources

 ...pays $33-$37 per hour, based on experience. The schedule is Monday through Friday, 8:00am-5:00pm. Will work in the office Mondays through Wednesdays and work from home Thursdays and Fridays each week! Responsibilities and Requirements: Develop detailed 2D and 3D... 

New York Junior Tennis & Learning

Social Worker Job at New York Junior Tennis & Learning

 ...Full-Time Social Worker (Afterschool Program - Bronx Locations) About New York Junior Tennis & Learning For over fifty years, New York Junior Tennis & Learning (NYJTL) has honored its founder Arthur Ashes legacy by celebrating the diversity of children, encouraging... 

Watco

Trainmaster Job at Watco

 ...opportunities General Purpose The Trainmaster supervises and coordinates activities of train crew engaged in switching railroad cars within yard of railroad, industrial plant, or similar location to facilitate loading or unloading of cars or making up and breaking...