Home Job Listings Categories Locations

Senior Data Engineer

📍 Mumbai, India

Technology Celebal Technologies

Job Description

Role: Sr. Azure Databricks Data Engineer Experience: 6+ Years Location: Navi Mumbai Duration: Fulltime

Role Overview We are looking for a Data Engineer to design and build scalable, production-grade data pipelines on Azure using Databricks. The role involves working with high-volume, high-velocity enterprise data across domains such as telecom, retail, and regulatory systems (e.g., GST, eWay Bill). You will be responsible for building reliable batch and real-time pipelines, ensuring data quality, auditability, and performance at scale.

Key Responsibilities Data Engineering & Pipeline Development • Design and implement end-to-end data pipelines using Azure Databricks and PySpark • Build and maintain batch and real-time ingestion pipelines using: o Azure Data Factory (ADF) o Kafka / Azure Event Hubs • Process TB–PB scale structured and semi-structured datasets (JSON, Parquet, CSV)

Lakehouse Architecture • Implement and maintain Medallion Architecture (Bronze, Silver, Gold layers) • Develop reusable data models for analytics, reporting, and downstream consumption • Ensure data lineage, traceability, and auditability across layers

Delta Lake & Data Management • Leverage Delta Lake features: o MERGE (upserts), SCD Type 1/2 implementations o Schema enforcement and evolution o ACID-compliant pipelines • Optimize Delta tables using: o OPTIMIZE, Z-ORDER, VACUUM • Handle incremental processing and CDC pipelines

Streaming & Real-Time Processing • Build low-latency streaming pipelines using Structured Streaming • Handle: o Late-arriving data o Watermarking and windowing o Exactly-once processing semantics • Integrate streaming pipelines with downstream Delta tables and serving layers

Performance Optimization & Scalability • Optimize Spark jobs using: o Partitioning strategies o Broadcast joins o Caching and persistence o Adaptive Query Execution (AQE) • Troubleshoot performance bottlenecks such as: o Data skew o Shuffle issues o Memory constraints

Orchestration & Workflow Management • Design orchestration workflows using Azure Data Factory: o Pipeline dependencies o Scheduling and triggers o Retry and failure handling • Integrate ADF with Databricks Jobs / Workflows for end-to-end execution

Data Quality & Governance • Implement robust data quality checks, including: o Schema validation o Deduplication o Null and integrity checks o Data reconciliation across sources • Handle schema drift and evolving data contracts • Ensure compliance with regulatory and audit requirements

Domain-Specific Use Cases • Build pipelines for: o High-frequency transactional systems o Regulatory datasets (GST, eWay Bill, financial reporting) o Retail / telecom data platforms • Ensure data consistency, reconciliation, and reporting accuracy

Technical Skills Required Core • Strong SQL: o Window functions o Complex joins o Query optimization • PySpark: o Transformations and actions o Performance tuning

Azure Ecosystem • Azure Data Factory (ADF) • Azure Data Lake Storage Gen2 (ADLS) • Azure Databricks

Streaming • Kafka or Azure Event Hubs • Structured Streaming

Good to Have • CDC pipeline implementation • Databricks Autoloader • Delta optimization techniques • Multi-region / multi-source ingestion • Data quality frameworks (e.g., expectation-based validation)

Soft Skills & Expectations • Strong debugging skills in distributed data systems (Spark) • Experience handling production incidents and RCA (Root Cause Analysis) • Ability to work in high-scale, SLA-driven environments • Effective collaboration with business, analytics, and downstream consumers • Flexibility to work in WFO / hybrid setup

What Success Looks Like • Reliable, scalable pipelines handling large-scale enterprise data • Reduced pipeline failures and improved data SLAs • High-quality, trusted datasets for business-critical reporting • Efficient Spark jobs with optimized cost and performance

Ready to Apply?

Don't miss this opportunity! Apply now and join our team.

Job Details

Posted Date: February 27, 2026
Job Type: Technology
Location: Mumbai, India
Company: Celebal Technologies

Ready to Apply?

Don't miss this opportunity! Apply now and join our team.