Job Description
Company Description
ThreatXIntel is a cybersecurity startup focused on protecting businesses of all sizes from evolving digital threats. Our experienced team specializes in cloud security, web and mobile security testing, cloud assessments, and DevSecOps, offering tailored solutions to address unique client needs. We are committed to providing affordable, high-quality services, enabling businesses to safeguard their digital assets. Using a proactive approach, we continuously monitor and test digital environments to mitigate vulnerabilities before exploitation. At ThreatXIntel, our mission is to empower organizations with reliable cybersecurity solutions, fostering their growth and success in a secure digital landscape.
Role Description
We are seeking a Senior Generative AI Engineer with strong expertise in Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and enterprise-grade GenAI system development.
The consultant will design, build, and deploy end-to-end Generative AI solutions across text, image, audio, and multimodal applications. This role focuses on production deployment, optimization, and scalable AI architecture design.
Key Responsibilities
Design and build end-to-end Generative AI pipelines including data collection, preprocessing, training, evaluation, and deployment
Develop applications using generative models such as GPT, LLaMA, Claude, Stable Diffusion, and similar architectures
Implement enterprise RAG pipelines using vector databases and embedding frameworks
Integrate LLMs and multimodal AI systems using LangChain, LlamaIndex, Hugging Face, and API-based frameworks
Optimize model inference performance, latency, and operational cost
Develop intelligent AI-driven applications in collaboration with product, data science, and engineering teams
Implement prompt engineering strategies and embedding pipelines
Deploy models using Docker, Kubernetes, and CI/CD pipelines
Work with cloud platforms such as Azure, AWS, or GCP
Utilize Azure AI services such as Azure AI Foundry and AI Search where applicable
Required Technical Skills
Generative AI and LLMs
Hands-on experience with GPT, LLaMA, Claude, Mistral, or similar LLMs
Practical experience implementing RAG architectures
Experience with LangChain, LlamaIndex, and Hugging Face Transformers
Strong understanding of prompt engineering and embeddings
Vector Search and Retrieval
Experience with FAISS, Pinecone, Weaviate, Milvus, or similar vector databases
Programming and ML
Strong Python programming skills
Experience with PyTorch or TensorFlow
Cloud and MLOps
Experience with Azure, AWS, or GCP
Containerization using Docker and Kubernetes
API development and CI/CD pipeline integration
Multimodal AI
Exposure to image generation models such as Stable Diffusion or DALLยทE
Experience with multimodal systems combining text, vision, and speech