Job Description
BrezQ Inc
Database Reliability Engineer (Aurora PostgreSQL | AWS | DMS | Terraform | Automation)
Company:
BrezQ Inc
Location:
Remote - India
Reporting To:
Database Reliability Engineering Manager
About BrezQ
BrezQ Inc is an IT services and cloud infrastructure solutions company delivering enterprise-grade engineering expertise across database reliability, cloud architecture, DevOps, and scalable platform engineering. We specialize in production-critical environments that demand high availability, automation-first operations, and performance optimization at scale.
Role Overview
BrezQ is seeking a hands-on
Database Reliability Engineer (DRE)
with deep experience in Aurora PostgreSQL, AWS database services, infrastructure automation, and zero-downtime migrations.
This role requires ownership of production-grade database platforms, strong PostgreSQL internals knowledge, Infrastructure-as-Code implementation, performance engineering, and reliability best practices in distributed cloud environments.
Key Responsibilities
Design, implement, and maintain highly available Aurora PostgreSQL clusters in AWS
Architect Multi-AZ and multi-region database deployments for high availability
Perform zero-downtime database upgrades (in-place and out-of-place strategies)
Lead database migrations using AWS DMS
Build and enhance Terraform modules for DMS and database infrastructure deployment
Develop automation scripts using Python (preferred), Bash, or Golang
Implement CI/CD pipelines for repeatable and automated database deployments
Optimize schema design, indexing strategies, and complex query performance
Analyze execution plans using EXPLAIN / ANALYZE and resolve performance bottlenecks
Troubleshoot production issues involving locking, deadlocks, replication lag, and connection saturation
Design backup, rollback, and disaster recovery strategies
Define monitoring baselines using SLOs, SLIs, SLAs, and RED metrics
Collaborate with SRE, DevOps, and application teams during design reviews and deployment planning
Maintain operational runbooks and reliability documentation
Required Technical Skills
Database Expertise
5–10 years of experience managing production-grade databases
Strong hands-on experience with Aurora PostgreSQL
Deep understanding of PostgreSQL internals including WAL, replication, indexing, vacuum tuning, and connection management
Strong SQL proficiency for performance tuning, query optimization, and schema design
Experience with database design, provisioning, capacity planning, scaling strategies, and incident response
On-call production support experience
AWS & Migration Experience
Aurora PostgreSQL architecture and high-availability design
AWS Database Migration Service (DMS)
Blue/Green deployments
Multi-region failover and disaster recovery architecture
Backup management and rollback planning
Automation & Infrastructure-as-Code
Python (strongly preferred) or Golang
Bash scripting
Terraform (mandatory)
CI/CD tools such as Jenkins, GitHub Actions, Argo Workflows, or similar
Experience using Git-based version control and pull request workflows
Monitoring & Observability
Experience with Prometheus, Grafana, Datadog, CloudWatch, ELK, Splunk, or equivalent tools
Monitoring and troubleshooting:
Slow queries
Query timeouts
Blocking locks and deadlocks
Connection pool saturation
Replication health
Understanding and implementation of SLO, SLI, SLA, and RED metrics
Preferred Qualifications
Experience with DynamoDB, MongoDB, Cassandra, or ElastiCache (Redis/Valkey)
Ability to evaluate relational vs NoSQL database architectures based on workload patterns, scalability, and cost considerations
Experience working in distributed global production systems
Exposure to containerized environments (Docker)
Participation in architecture design reviews and reliability process improvements
What Makes This Role Critical
This role supports mission-critical AWS database infrastructure requiring high availability, automation-driven reliability, and performance optimization at scale. The ideal candidate has hands-on experience operating production environments and can independently own database reliability engineering functions.
Essential (Core Database Engineer skills):
PostgreSQL
MongoDB
SQL
Database Design
Performance Tuning
Scaling
Backup & Restore
Monitoring
Troubleshooting
Why Join BrezQ
Exposure to enterprise cloud database ecosystems
Work on high-availability AWS production platforms
Automation-focused engineering culture
Opportunity to contribute to scalable and performance-driven infrastructure initiatives
Growing IT services organization with strong cloud engineering focus
Interested candidates are requested to share their resumes at sharan@BrezQ.com