Descripción del Puesto
MLOps/LLMOps Engineer - LLMs / AI / Fixed Term Contract
We’re seeking a
MLOps/LLMOps Engineer
to join our client on an initial fixed term contract working on a hybrid model (2-3 days per week onsite) in either Zaragoza or Barcelona
Please note - the initial contract will run until 30th June, 2026.
In this role, you will deploy cutting-edge ML/LLMs models to Fortune Global 500 clients, within a world-class team of Quantum experts with an extensive track record in both academia and industry. You will collaborate with the founding team in a fast-paced startup environment and will design, develop, and implement Machine Learning and Large Language Model pipelines, encompassing data acquisition, preprocessing, model training and tuning, deployment, and monitoring.
The company is backed by global investors and major EU support, and their technology is already reshaping how AI is deployed worldwide by cutting costs, slashing model sizes, and making systems faster, greener, and more accessible.
Deploy cutting-edge ML/LLMs models to Fortune Global 500 clients.
Join a world-class team of Quantum experts with an extensive track record in both academia and industry.
Design, develop, and implement Machine Learning (ML) and Large Language Model (LLM) pipelines, encompassing data acquisition, preprocessing, model training and tuning, deployment, and monitoring.
Employ automation tools such as GitOps, CI/CD pipelines, and containerization technologies (Docker, Kubernetes) to enhance ML/LLM processes throughout the Large Language Model lifecycle.
Establish and maintain comprehensive monitoring and alerting systems to track Large Language Model performance, detect data drift, and monitor key metrics, proactively addressing any issues.
Conduct truth analysis to evaluate the accuracy and effectiveness of Large Language Model outputs against known, accurate data.
Collaborate closely with Product and DevOps teams and Generative AI researchers to optimize Model performance and resource utilization.
Manage and maintain cloud infrastructure (e.g., AWS, Azure) for Large Language Model workloads, ensuring both cost-efficiency and scalability.
Communicate effectively with both technical and non-technical stakeholders, providing updates on Large Language Model performance and status.
Bachelor's or master's degree in computer science, Engineering, or a related field.
~1+ years of experience as an ML/LLM engineer in public cloud platforms.
~ Proven experience in MLOps, LLMOps, or related roles, with hands-on experience in managing machine/deep learning and large language model pipelines from development to deployment and monitoring.
~ Experience in cloud platforms (e.g., AWS, Azure) for ML workloads, MLOps, DevOps, or Data Engineering.
~ Knowledge in model parallelism in model training and serving, and data parallelism / hyperparameter tuning.
~ Proficiency in programming languages such as Python, distributed computing tools such as Ray, model parallelism frameworks such as DeepSpeed, Fully Sharded Data Parallel (FSDP), or Megatron LM.
~ Knowledge in with generative AI applications and domains, including content creation, data augmentation, and style transfer.
~ Experience with Azure Machine Learning, Azure Kubernetes Service, Azure CycleCloud, Azure Managed Lustre.
In accordance with local employment laws, applicants must have current, valid authorisation to work in Spain
at the time of application. We are unable to sponsor employment visas for this role. By applying to this role you understand that we may collect your personal data and store and process it on our systems.