Job Description
We're looking for an entrepreneurial, passionate, and driven
Data Engineer
to join Startup
Gala Intelligence
backed by Navneet Tech Venture situated in
Ahmedabad . As we're building our technology platform from scratch, you'll have the unique opportunity to shape our technology vision, architecture, and engineering culture right from the ground up. Youโll directly contribute to foundational development and establish best practices, while eventually building and contributing to our engineering team.
This role is ideal for someone eager to own the entire tech stack, who thrives on early-stage challenges, and loves building innovative, scalable solutions from day zero.
What Youโll Do
Web Scraping & Crawling:
Build and maintain automated scrapers to extract structured and unstructured data from websites, APIs, and public datasets.
Scalable Scraping Systems:
Develop multi-threaded, distributed crawlers capable of handling high-volume data collection without interruptions.
Data Parsing & Cleaning:
Normalize scraped data, remove noise, and ensure consistency before passing to data pipelines.
Anti-bot & Evasion Tactics:
Implement proxy rotation, captcha solving, and request throttling techniques to handle scraping restrictions.
Integration with Pipelines:
Deliver clean, structured datasets into
NoSQL stores
and
ETL pipelines
for further enrichment and graph-based storage.
Data Quality & Validation:
Ensure data accuracy, deduplicate records, and maintain a
trust scoring system
for data confidence.
Documentation & Maintenance:
Keep scrapers updated when websites change, and document scraping logic for reproducibility.
Who You Are
Technical Skills:
4+ years of experience in
web and mobile scraping , crawling, or data collection.
Strong proficiency in
Python
(libraries like BeautifulSoup, Scrapy, Selenium, Playwright, Requests).
Familiarity with
NoSQL databases
(MongoDB, DynamoDB) and data serialization formats (JSON, CSV, Parquet).
Experience in handling
large-scale scraping
with proxy management and rate-limiting.
Basic knowledge of
ETL processes
and integration with data pipelines.
Exposure to
graph databases
(Neo4j) is a plus.
Soft Skills:
Detail-oriented, ensuring accuracy and reliability of collected data.
Strong problem-solving skills, particularly in adapting scrapers to evolving web structures.
Curious mindset with a drive to discover new data sources.
Comfortable working in a fast-paced, early-stage startup environment.
Who We Are & Our Culture
Gala Intelligence , backed by
Navneet Tech Ventures , is a tech-driven startup dedicated to solving one of the most pressing business challenges - fraud detection and prevention. We're building cutting-edge, real-time products designed to empower consumers and businesses to stay ahead of fraudsters, leveraging innovative technology and deep domain expertise.
Our culture and values:
Weโre united by a single, critical mission - stopping fraud before it impacts businesses. Curiosity, innovation, and proactive action define our approach. We value transparency, collaboration, and individual ownership, creating an environment where talented people can do their best work.
Problem-Driven Innovation : We're deeply committed to solving real challenges that genuinely matter for our customers.
Rapid Action & Ownership : We encourage autonomy and accountabilityโown your projects, move quickly, and shape the future of Gala Intelligence.
Collaborative Excellence : Cross-team collaboration ensures alignment, sparks innovation, and drives us forward together.
Continuous Learning : Fraud evolves rapidly, and so do we. Continuous improvement, experimentation, and learning are core to our success.
If you're excited by the opportunity to leverage technology in the fight against fraud, and you're ready to build something impactful from day one, we want to hear from you!