Job Description
Job Title: Generative AI Engineer
Location:
Gurgaon (Hybrid)
Experience:
3+ Years
Notice Period:
Immediate Joiners Preferred
Role Overview
We are looking for a highly skilled
Generative AI Engineer
to design and deploy advanced AI-driven search, recommendation, and LLM-powered systems. The ideal candidate will have strong hands-on experience with Transformer architectures, LLM fine-tuning, semantic search, and production-grade ML systems.
This role requires deep technical expertise in machine learning, natural language processing, and scalable deployment in enterprise environments.
Key Responsibilities
Search & Recommendation Development
Lead end-to-end design, development, and deployment of search, personalization, and recommendation systems.
Build scalable solutions that significantly improve user experience and business KPIs.
Optimize ranking models and dense retrieval systems.
Transformer-Based Model Implementation
Fine-tune and optimize models such as
BERT, RoBERTa, and encoder architectures
for:
Semantic Search
Relevance Ranking
Query Understanding
Embedding Generation
Implement dense vector search and retrieval systems.
Large Language Model (LLM) Innovation
Research and prototype LLM-based solutions.
Work on model selection, prompt engineering, LoRA-based fine-tuning, and quantization.
Design and implement
RAG (Retrieval-Augmented Generation)
systems using vector databases.
Build advanced retrieval pipelines with frameworks such as Hugging Face, LangChain, and LlamaIndex.
⚙ ML Productionization (MLOps)
Build, train, validate, and deploy ML models into scalable, low-latency production systems.
Ensure reliability, observability, and maintainability of ML services.
Collaborate with engineering teams to integrate models into real-world applications.
Data Strategy & Feature Engineering
Work closely with Data Engineering to define datasets and feature pipelines.
Ensure data quality, consistency, and governance across ML workflows.
Develop innovative features for ranking and recommendation models.
Evaluation & Optimization
Define and track KPIs such as NDCG, CTR, latency, perplexity, recall, and precision.
Continuously improve model accuracy, robustness, and performance.
Conduct A/B testing and experimentation.
Essential Technical Qualifications
MS/PhD in Computer Science, Data Science, Engineering, or equivalent experience.
3+ years of hands-on experience in ML/AI engineering.
Expert-level proficiency in
Python .
Strong experience with ML/DL libraries (NumPy, Pandas, Scikit-learn).
Deep experience with
PyTorch or TensorFlow .
Proven hands-on work with Transformer models (BERT, encoder-only models) for IR/NLU.
Practical experience with LLM fine-tuning and deployment.
Experience with frameworks such as
Hugging Face, LangChain, LlamaIndex .
Strong understanding of classical ML algorithms and statistical modeling.
Direct experience building search ranking systems, recommendation engines, or vector-based search.
Experience with cloud platforms ( AWS, GCP, or Azure ).
Experience with MLOps tools such as
MLFlow, Kubeflow, Docker, Kubernetes .
Preferred Qualifications
Experience with LoRA, PEFT, or model quantization.
Hands-on work with vector databases (Pinecone, Weaviate, FAISS, Milvus).
Exposure to large-scale data systems and distributed training.
Why Join?
Work on cutting-edge Generative AI systems.
High-impact role influencing search & recommendation architecture.
Collaborative and innovation-driven environment.