Job Description

We are

CirrusLabs

. Our vision is to become the world's most sought-after niche digital transformation company that helps customers realize value through innovation. Our mission is to co-create success with our customers, partners and community. Our goal is to enable employees to dream, grow and make things happen. We are committed to excellence. We are a dependable partner organization that delivers on commitments. We strive to maintain integrity with our employees and customers. Every action we take is driven by value. The core of who we are is through our well-knit teams and employees. You are the core of a values driven organization.

You have an entrepreneurial spirit. You enjoy working as a part of well-knit teams. You value the team over the individual. You welcome diversity at work and within the greater community. You aren't afraid to take risks. You appreciate a growth path with your leadership team that journeys how you can grow inside and outside of the organization. You thrive upon continuing education programs that your company sponsors to strengthen your skills and for you to become a thought leader ahead of the industry curve.

You are excited about creating change because your skills can help the greater good of every customer, industry and community. We are hiring a talented

Data Scientist & Agentic AI Developer

to join our team. If you're excited to be part of a winning team, CirrusLabs (

http://www.cirruslabs.io

) is a great place to grow your career.

Experience - 5-8 years Location - Bengaluru Work Timings - 2pm - 11pm IST

Tech Stack:

Python, LangChain, LangSmith, Phoenix, TensorFlow, PyTorch, OpenAI API, Anthropic Claude, Azure OpenAI, AWS/Azure/GCP. We value innovation, collaboration, and a commitment to ethical AI development.

Position Overview We are seeking an experienced Data Scientist and Agentic AI Developer to design, develop, and evaluate intelligent AI agents that deliver high-quality, reliable, and compliant solutions. This role requires a blend of data science expertise, AI/ML engineering capabilities, and hands-on experience building agentic systems.

Experience Required 5-6 years of professional experience in data science, machine learning, and AI development

Key Responsibilities

AI Agent Development & Deployment Design and develop agentic AI systems powered by Large Language Models (LLMs) with tool-calling capabilities Build and optimize multi-step reasoning workflows and agent orchestration frameworks Implement retrieval-augmented generation (RAG) pipelines for knowledge-intensive applications Integrate external tools, APIs, and databases into agent workflows Deploy and monitor production-grade AI agents at scale Evaluation & Quality Assurance Develop comprehensive evaluation frameworks using LLM-as-a-judge methodologies Implement automated scoring systems for output quality metrics (correctness, helpfulness, coherence, relevance) Design and execute robustness testing including adversarial attack scenarios Monitor and reduce hallucination rates and ensure factual accuracy Track performance metrics including latency, throughput, and cost-per-interaction Data Science & Analytics Analyze agent performance data to identify improvement opportunities Build custom evaluation pipelines and scoring rubrics Conduct A/B testing and statistical analysis for model optimization Create dashboards and visualization tools for stakeholder reporting Implement RAGAs (Retrieval Augmented Generation Assessment) frameworks Safety, Ethics & Compliance Ensure AI systems meet ethical standards including bias detection and fairness Implement safety guardrails to prevent harmful content generation Develop compliance monitoring systems for regulatory frameworks (EU AI Act, GDPR, HIPAA, DPDP) Document transparency and explainability measures Establish human oversight protocols

Required Skills & Qualifications

Technical Expertise Programming

: Strong proficiency in Python; experience with AI/ML frameworks (Lang Chain, Lang Smith, Phoenix, or similar) LLM Expertise

: Hands-on experience with GPT, Claude, or other frontier models; prompt engineering and fine-tuning Machine Learning

: Deep understanding of NLP, deep learning architectures, and model evaluation Tools & Platforms

: Experience with MLOps tools, vector databases, and observability platforms Data Engineering

: Proficiency in SQL, data pipelines, and ETL processes Domain Knowledge Understanding of agentic AI architectures and autonomous systems Knowledge of RAG systems and information retrieval techniques Familiarity with LLM evaluation methodologies and benchmarks Experience with conversational AI and dialogue systems Understanding of AI safety, alignment, and interpretability Evaluation & Metrics Experience designing evaluation rubrics and scoring systems Proficiency with automated evaluation frameworks (RAGAs, custom evaluators) Understanding of quality metrics: coherence, fluency, factual accuracy, hallucination detection Knowledge of performance metrics: latency optimization, token usage, throughput analysis Experience with user experience metrics (CSAT, NPS, turn count analysis) Soft Skills Strong analytical and problem-solving abilities Excellent communication skills for cross-functional collaboration Ability to balance innovation with practical constraints Detail-oriented with a focus on quality and reliability Self-driven with ability to work in fast-paced environments

Preferred Qualifications

Experience with compliance frameworks and regulatory requirements Background in conversational AI or chatbot development Knowledge of reinforcement learning from human feedback (RLHF) Experience with multi-modal AI systems

Tools & Technologies Frameworks : Lang Chain, Lang Smith, Phoenix, TensorFlow, PyTorch LLM Platforms : OpenAI API, Anthropic Claude, Azure OpenAI Databases : Vector databases (Pinecone, Weaviate, Chroma), PostgreSQL, MongoDB Monitoring : Lang Smith, Phoenix, custom observability tools Cloud : AWS/Azure/GCP experience preferred Version Control : Git, CI/CD pipelines

What You'll Deliver

Production-ready agentic AI systems with measurable quality improvements Comprehensive evaluation frameworks with automated scoring Performance dashboards and reporting systems Documentation for technical specifications and compliance standards Continuous improvement strategies based on data-driven insights

Ready to Apply?

Don't miss this opportunity! Apply now and join our team.

Apply Now

Job Details

Posted Date: November 28, 2025

Job Type: Technology

Location: India

Company: CirrusLabs

Ready to Apply?

Don't miss this opportunity! Apply now and join our team.

Apply Now