Job Description
Generative AI Engineer
About the Role
Experience:
5+ years
Tech Focus:
Python, RAG, LangChain/LangGraph, OpenAI/Bedrock, SQL/Spark, Postgres/MongoDB, Pandas/scikit-learn/CV2
Cloud:
AWS, GCP, Azure
We’re looking for an AI Engineer with strong Python expertise and hands-on experience delivering systems powered by Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG). You’ll design and ship resilient AI services, build robust orchestration with LangChain/LangGraph, and ensure reliability at scale with strong error handling and
production failovers . The ideal candidate is a pragmatic builder who can collaborate across engineering, data, and product to ship business impact quickly and safely.
What You’ll Do
Design & Ship LLM/RAG Services:
Build Python-based services and APIs that leverage RAG pipelines (chunking, indexing, embeddings, retrieval) and LLM orchestration with
LangChain / LangGraph .
Model Selection & Routing:
Implement provider-agnostic strategies across
OpenAI ,
AWS Bedrock , and other endpoints; add smart routing, fallback, and
production failover
patterns for high availability.
Data & Feature Pipelines:
Develop ingestion, preprocessing, and transformation pipelines with
Python, SQL, Spark ; persist and retrieve from
Postgres
and
MongoDB .
Document & Image Processing:
Use
Pandas ,
scikit-learn , and
CV2
for unstructured data preparation (e.g., OCR-ready cleaning, page-level segmentation, image enhancements).
Reliability & Observability:
Implement robust
error handling , logging, metrics, tracing, and alerting; design for graceful degradation and circuit breakers.
Security & Compliance:
Enforce secrets management, RBAC, data access controls, PII handling, and auditability across services.
DevEx & Reuse:
Contribute to reusable libraries, templates, and documentation to accelerate delivery across teams.
Cross-Functional Delivery:
Partner with data scientists, platform engineers, and product to scope requirements and ship end‑to‑end solutions.
Required Qualifications
5+ years
of professional software engineering with a track record of independently designing and shipping complex systems.
Advanced
Python
with strong software engineering practices (testing, CI/CD, containerization).
Practical experience building systems with
LLMs
and
RAG
patterns (prompting, retrieval pipelines, indexing, chunking, embeddings).
Orchestration with
LangChain
and
LangGraph
for multi-step workflows.
Hands-on with
OpenAI
and
AWS Bedrock
(usage, quotas, rate limits, provider selection).
Data engineering depth with
SQL ,
Spark , and storage systems ( Postgres ,
MongoDB ).
Proficiency with
Pandas ,
scikit-learn , and
CV2
for data preparation and feature engineering on text/images.
Strong
error handling
and resiliency design; proven experience implementing
production failovers .
Cloud proficiency in
AWS ,
GCP , and/or
Azure ; experience with infrastructure-as-code and container orchestration (e.g., Docker, Kubernetes).
Clear communication and ability to collaborate with cross-functional stakeholders.
Core Technology Stack
Languages & Data:
Python, SQL, Spark; Postgres, MongoDB
Libraries:
Pandas, scikit-learn,
CV2
LLM & Orchestration:
LangChain ,
LangGraph ,
RAG
Model Providers:
OpenAI ,
AWS Bedrock
(plus optional cloud-native endpoints)
Cloud:
AWS ,
GCP ,
Azure
Reliability:
Model selection/routing,
error handling ,
production failovers , observability
If interested, please share your resume at:
manisha.2190498@infosys.com
Location - PAN India