Job Description
Generative AI Engineer
About the Role
Experience:
5+ years
Tech Focus:
Python, RAG, LangChain/LangGraph, OpenAI/Bedrock, SQL/Spark, Postgres/MongoDB, Pandas/scikit-learn/CV2
Cloud:
AWS, GCP, Azure
We’re looking for an AI Engineer with strong Python expertise and hands-on experience delivering systems powered by Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG). You’ll design and ship resilient AI services, build robust orchestration with LangChain/LangGraph, and ensure reliability at scale with strong error handling and
production failovers
. The ideal candidate is a pragmatic builder who can collaborate across engineering, data, and product to ship business impact quickly and safely.
What You’ll Do
Design & Ship LLM/RAG Services:
Build Python-based services and APIs that leverage RAG pipelines (chunking, indexing, embeddings, retrieval) and LLM orchestration with
LangChain
/
LangGraph
.
Model Selection & Routing:
Implement provider-agnostic strategies across
OpenAI
,
AWS Bedrock
, and other endpoints; add smart routing, fallback, and
production failover
patterns for high availability.
Data & Feature Pipelines:
Develop ingestion, preprocessing, and transformation pipelines with
Python, SQL, Spark
; persist and retrieve from
Postgres
and
MongoDB
.
Document & Image Processing:
Use
Pandas
,
scikit-learn
, and
CV2
for unstructured data preparation (e.g., OCR-ready cleaning, page-level segmentation, image enhancements).
Reliability & Observability:
Implement robust
error handling
, logging, metrics, tracing, and alerting; design for graceful degradation and circuit breakers.
Security & Compliance:
Enforce secrets management, RBAC, data access controls, PII handling, and auditability across services.
DevEx & Reuse:
Contribute to reusable libraries, templates, and documentation to accelerate delivery across teams.
Cross-Functional Delivery:
Partner with data scientists, platform engineers, and product to scope requirements and ship end‑to‑end solutions.
Required Qualifications
5+ years
of professional software engineering with a track record of independently designing and shipping complex systems.
Advanced
Python
with strong software engineering practices (testing, CI/CD, containerization).
Practical experience building systems with
LLMs
and
RAG
patterns (prompting, retrieval pipelines, indexing, chunking, embeddings).
Orchestration with
LangChain
and
LangGraph
for multi-step workflows.
Hands-on with
OpenAI
and
AWS Bedrock
(usage, quotas, rate limits, provider selection).
Data engineering depth with
SQL
,
Spark
, and storage systems (
Postgres
,
MongoDB
).
Proficiency with
Pandas
,
scikit-learn
, and
CV2
for data preparation and feature engineering on text/images.
Strong
error handling
and resiliency design; proven experience implementing
production failovers
.
Cloud proficiency in
AWS
,
GCP
, and/or
Azure
; experience with infrastructure-as-code and container orchestration (e.g., Docker, Kubernetes).
Clear communication and ability to collaborate with cross-functional stakeholders.
Core Technology Stack
Languages & Data:
Python, SQL, Spark; Postgres, MongoDB
Libraries:
Pandas, scikit-learn,
CV2
LLM & Orchestration:
LangChain
,
LangGraph
,
RAG
Model Providers:
OpenAI
,
AWS Bedrock
(plus optional cloud-native endpoints)
Cloud:
AWS
,
GCP
,
Azure
Reliability:
Model selection/routing,
error handling
,
production failovers
, observability
If interested, please share your resume at:
manisha.2190498@infosys.com
Location - PAN India