Job Description
Data Engineer
Role Overview
We are looking for a skilled Data Engineer to support the evolution of Client’s existing Enterprise Insights Hub (EIH) 1.0 into a GenAI-powered EIH 2.0 intelligent solution. This role will play a critical part in enabling executive decision-making by transforming large-scale enterprise data into high-quality, minimal, and GenAI-ready datasets that power dynamic insight cards and actionable intelligence.
---
Key Responsibilities
· Design, build, and maintain end-to-end data pipelines to ingest, process, and transform high-volume enterprise data.
· Develop scalable data solutions on Azure using Databricks for batch and streaming workloads.
· Perform data extraction, transformation, and optimization to convert raw, high-volume datasets into curated, analytics- and GenAI-ready datasets.
· Collaborate with AI/ML and GenAI teams to ensure data is structured, governed, and optimized for downstream GenAI use cases.
· Enable the transition from static dashboards to dynamic, insight-driven executive views.
· Ensure data quality, reliability, performance, and scalability across pipelines.
· Implement best practices for data governance, security, and cost optimization.
· Work closely with product owners, business analysts, and stakeholders to understand requirements and translate them into robust data solutions.
---
Required Skills & Experience
· Strong experience as a Data Engineer in enterprise-scale environments.
· Hands-on experience with Azure Data Services and Databricks.
· Solid understanding of data pipeline creation, orchestration, and monitoring.
· Proven ability to handle high-volume, complex datasets and reduce them into meaningful, minimal datasets for analytics and AI consumption.
· Experience with data extraction, ingestion from multiple structured and semi-structured sources.
· Good understanding of end-to-end data lifecycle, from source systems to consumption layers.
· Strong SQL skills and experience with data modeling.
· Familiarity with cloud-native architectures and performance optimization.
---
Nice-to-Have
· Exposure to GenAI / AI-powered analytics or data preparation for LLM-based systems.
· Experience with Azure Data Factory, Delta Lake, or similar technologies.
· Understanding of executive reporting, KPIs, and enterprise analytics platforms.
The Company
Infogain is a digital platform engineering firm based in Silicon Valley. We engineer business outcomes for Fortune 500 companies and digital natives in the technology, healthcare, insurance, travel, telecom, and retail/CPG industries. We accelerate experience-based transformation in the delivery of digital platforms using technologies such as cloud, microservices, automation, IoT, and artificial intelligence. Infogain is a multi-cloud expert in hyper-scale cloud providers: Microsoft Azure, Google Cloud Platform, and Amazon Web Services.
Infogain, an Apax Fund's portfolio company, has offices in California, Washington, Texas, the United Kingdom, the United Arab Emirates, and Singapore, with delivery centers in Seattle, Houston, Austin, Montevideo, Krakow, Noida, Bangalore, Pune, Gurgaon, and Mumbai.
Infogain is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, or age.