WELCOME TO AIRA MATRIX CAREERS

Principal Engineer - HPC

Expectations and Tasks

To accelerate the work of all new upcoming projects and aligning with the management objective of completion as per timeline so that the development plan is not affected.

Job duties will include

  • Developing code for one or more of the DNN training frameworks (such as Caffe, TensorFlow or Torch): Numerical analysis, Performance analysis, Model compression and Optimization & Computer architecture.
  • Will provide leadership in designing and implementing ground-breaking GPU Compute that run demanding deep learning, high-performance computing, and computationally intensive workloads.
  • Identify architectural changes and/or completely new approaches for accelerating our deep learning models
  • You will help us with the strategic challenges we encounter, including compute, networking, and storage design for large scale, high-performance workloads, effective resource utilization in a heterogeneous computing environment, evolving our private/public cloud strategy, capacity modelling, and growth planning across our products and services.
  • Responsible for converting business needs associated with AI-ML algorithms in to a set of product goals covering workload scenarios, end user expectations, compute infrastructure and time of execution; this should lead to a plan for making the algorithms production ready
  • Benchmark and optimise the Computer Vision Algorithms and the Hardware Accelerators for performance and quality KPIs.
  • Optimize algorithms for optimal performance on the GPU tensor cores.
  • Collaborate with various teams to drive an end to end workflow from data curation and training to performance optimization and deployment.
  • Provide technical leadership and expertise for project deliverables
  • Leading, mentoring and managing the technical team.
  • Direct a team of engineers, providing mentorship in technical and soft skills, defining career paths, conducting one-on-one sessions, conducting performance reviews, and managing daily work to ensure the success of the company.

Education and Qualifications

  • MS or PhD in Computer Science, Electrical Engineering, or related field. A strong background in deployment of complex deep learning architectures
  • 7+ years of relevant experience in at least a few of the following relevant areas is required in your work history: Machine learning (with focus on Deep Neural Networks), including understanding of DL fundamentals; Experience adapting and training DNNs for various tasks; Experience developing code for one or more of the DNN training frameworks (such as Caffe, TensorFlow or Torch): Numerical analysis, Performance analysis, Model compression and Optimization & Computer architecture.
  • Strong Data structures and Algorithms know-how with Excellent C/C++ programming skills.
  • Hands-on expertise with PyTorch, TensorRT, CuDNN
  • Hand-on expertise with GPU computing (CUDA, OpenCL, OpenACC) and HPC (MPI, OpenMP)
  • In-depth understanding of container technologies like Docker, Singularity, Shifter, Charliecloud.
  • Proficient in Python programming and bash scripting.
  • Excellent communication and collaboration skills.
  • Self-motivated and able to find creative practical solutions to problems."

Job Segment

High Performance Computing, GPU, Deployment, Production,

Job Location

Thane, Maharashtra

Employment Type

Regular Full Time

Expected Travel

0-15%

 

APPLY NOW

Quick Links
Get Updates
© Copyright 2025. AIRA MATRIX. All Rights Reserved.
Request A Demo
Name *
First Name
 
Last Name
Organization *
Company Name
Email *
example@example.com
Phone Please enter a valid phone number
Region *
Your location
Which of our products are you interested in
  Custom AI Development
  Digital Pathology as a Service (DPaaS)
  Image Management System (AIRADHI)
  Predictive Toxicology (PredTox)
  Prostate Diagnosis (AIRAProstate)
  Prostate Stratification (AIRAStrat)
  Prostate Prediction (AIRAPredict)
  Quality Control (AIRAQc)
  Safety Assessment (AIRATox)