Job Description

A Top Tier IT MNC is hiring for Senior Staff Engineer - Python Backend & LLM.

Experience: 7 - 11 years

Mandatory Skills : FastAPI (Expert),LLM Application Frameworks (Strong),Prompt Engineering with LLMs (Strong),Python (Strong)

Job Summary

We are looking for a highly skilled LLM Engineer & Python Backend Developer to build, scale, and maintain AI-powered applications using Large Language Models and modern Python backend systems. You will be responsible for developing intelligent agents, RAG pipelines, and production-grade APIs that power next-generation AI products.

Key Responsibilities

LLM, AI & GenAI Engineering

Design, implement, and optimize applications using LLMs (GPT-4/5, Claude, Gemini, LLaMA, Mistral, Mixtral, DeepSeek, etc.).

Develop advanced prompt engineering, system prompting, and structured output pipelines.

Build and maintain RAG (Retrieval-Augmented Generation) systems with hybrid search.

Implement multi-agent systems and autonomous workflows.

Fine-tune and adapt foundation models using LoRA, QLoRA, PEFT, and adapters.

Deploy open-source LLMs using vLLM, TGI, Ollama, LM Studio, or Triton.

Integrate multimodal models (text, image, audio, video).

Develop evaluation pipelines for hallucination detection, factuality, and bias.

Implement guardrails, moderation, and alignment techniques.

Optimize latency, cost, and throughput for LLM inference.

Backend Engineering (Python)

Build scalable backend systems using FastAPI, Django, or Flask.

Design REST, GraphQL, and streaming APIs.

Develop microservices and event-driven architectures.

Integrate relational, NoSQL, and in-memory databases.

Implement authentication, authorization, and security best practices.

Write unit, integration, and load tests.

Optimize backend performance and observability.

AI Infrastructure & LLMOps

Design model serving and inference pipelines.

Build monitoring, logging, and tracing for AI systems.

Implement CI/CD pipelines for ML/LLM workflows.

Manage feature stores, embedding pipelines, and indexing.

Handle versioning for models, prompts, and datasets.

Implement A/B testing for AI outputs.

Automate retraining and evaluation pipelines.

Data, Search & Knowledge Systems

Build document ingestion, chunking, and preprocessing pipelines.

Implement semantic, keyword, and hybrid search.

Work with vector databases (Pinecone, Weaviate, Milvus, Qdrant, FAISS, Chroma).

Develop knowledge graphs and ontology-based systems.

Build enterprise search and document intelligence systems.

Cloud, DevOps & Scalability

Deploy AI systems on AWS, GCP, or Azure.

Manage Kubernetes-based inference clusters.

Optimize GPU/TPU utilization.

Implement autoscaling and cost optimization.

Secure AI pipelines and infrastructure.

Required Qualifications

Bachelor’s/Master’s degree in CS, AI, Data Science, or equivalent experience.

Strong proficiency in Python.

Proven experience with LLM APIs and open-source models.

Hands-on experience with RAG, embeddings, and vector search.

Experience with AI frameworks (LangChain, LlamaIndex, Haystack, DSPy, AutoGen, CrewAI).

Experience building production backend systems.

Solid understanding of distributed systems and microservices.

Experience with cloud platforms and containerization

Ready to Apply?

Don't miss this opportunity! Apply now and join our team.

Apply Now

Job Details

Posted Date: March 19, 2026

Job Type: Technology

Location: India

Company: Client of Prasha Consultancy Services Private Limited

Ready to Apply?

Don't miss this opportunity! Apply now and join our team.

Apply Now