Job Description
Hiring for AI/ML Ops/GPU acceleration + AI inference/TensorRT/ONNX
Build and maintain containerized applications using OpenShift, OpenShift AI, Kubernetes, and Helm charts.
Integrate and optimize inference engines such as Triton and vLLM for scalable model serving.
Lead model deployment, monitoring, and lifecycle management in production environments.
Implement monitoring and alerting solutions using Grafana and Prometheus.
Collaborate on GenAI and LLM projects, including Agentic AI initiatives.
Automate CI/CD pipelines and infrastructure using Jenkins, Ansible, Groovy, and Terraform.
Develop automation scripts and tools in Python.
Architect, deploy, and manage AI/ML solutions on AWS Cloud; experience with Bedrock and SageMaker is a plus.
Build and enhance AI Platform ( both on premise and in public cloud).
Make is scalable, high performance and resilient
Contribute to future road map and key architecture decisions.
Ready to Apply?
Don't miss this opportunity! Apply now and join our team.
Job Details
Posted Date:
February 26, 2026
Job Type:
Construction
Location:
India
Company:
Jobworld Management Consultancy LLC
Ready to Apply?
Don't miss this opportunity! Apply now and join our team.