Home Job Listings Categories Locations

Senior AI/ML Ops & GPU Acceleration Engineer

📍 Bangalore, India

Technology HireAlpha

Job Description

Position:

AI/ML Ops/GPU acceleration + AI inference/TensorRT/ONNX Primary Skill: AI/ML Ops/GPU acceleration; Secondary Skill: AI inference/TensorRT/ONNX;

(1 year exp is fine; or good knowledge) Role:

Technology Engineer

-5+ Years Role:

Senior Technology Engineer- 7+ Years; Location- Offshore – Bangalore BCIT

Job Description: Build and maintain containerized applications using OpenShift, OpenShift AI, Kubernetes, and Helm charts. Integrate and optimize inference engines such as Triton and vLLM for scalable model serving. Lead model deployment, monitoring, and lifecycle management in production environments. Implement monitoring and alerting solutions using Grafana and Prometheus. Collaborate on GenAI and LLM projects, including Agentic AI initiatives. Automate CI/CD pipelines and infrastructure using Jenkins, Ansible, Groovy, and Terraform. Develop automation scripts and tools in Python. Architect, deploy, and manage AI/ML solutions on AWS Cloud; experience with Bedrock and SageMaker is a plus. Build and enhance AI Platform ( both on premise and in public cloud). Make is scalable, high performance and resilient Contribute to future road map and key architecture decisions.

Ready to Apply?

Don't miss this opportunity! Apply now and join our team.

Job Details

Posted Date: February 24, 2026
Job Type: Technology
Location: Bangalore, India
Company: HireAlpha

Ready to Apply?

Don't miss this opportunity! Apply now and join our team.