Job Description
Software Engineer - Applied ML (US/CAN)
at
Cohere
Who are we?
Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises that build AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build, each of us contributing to increasing the capabilities of our models and the value they bring to customers. We work hard, move fast, and believe a diverse range of perspectives is essential for great products.
Why this role?
We are looking for a skilled Applied AI Software Engineer to join our Applied ML team. You will support the development and deployment of custom LLMs and AI agents, working directly with customers and collaborating closely with machine learning and platform teams to deliver impactful LLM‑based enterprise solutions.
What You’ll Do
Design and build Agentic AI applications, including RAG systems, to deliver high‑business‑value customer use‑cases.
Design, build and maintain data generation and evaluation pipelines to power custom LLM model development.
Develop backend services and APIs to support ML/LLM workflows at scale.
Collaborate with ML engineers on model deployment, evaluation, and experiment tracking.
Ensure adherence to software engineering best practices: design, coding, testing, CI/CD, observability, and version control.
Deploy and monitor applications on cloud and containerized infrastructure (Docker, Kubernetes).
Join us at a pivotal moment, shape what we build, and wear multiple hats.
What You’ll Bring
Required
LLM & Agent Development: LLM evaluation fundamentals, data pipelines, RAG systems, and Agent development (MCP, LangChain/LlamaIndex).
Software Engineering: API design, scalable services, databases, testing, CI/CD, observability, and version control.
ML/LLM Tooling & Infrastructure: HuggingFace, W&B, Docker, and Kubernetes.
Programming Languages: Proficiency in Python for application development, data processing, and ML/LLM integration.
Nice‑to‑have
Vector databases (Pinecone, Weaviate, FAISS, Milvus)
Cloud platforms (AWS, GCP, Azure)
LLM architectures and fine‑tuning recipes (SFT, DPO, LoRA)
LLM runtimes & libraries (vLLM, Transformers)
MLOps and experiment tracking (MLflow or similar)
Evaluation frameworks (LM Evaluation Harness, RAGAs, TruLens)
Go or JavaScript/TypeScript for backend or agent tooling
Applied NLP (SpaCy, NLTK, HuggingFace Datasets/Tokenizers)
Additional Requirements
Strong written and verbal communication skills.
Ability and interest to travel up to 25%.
Why Join Us
Work on cutting‑edge AI and LLM technology.
Collaborate with a highly‑skilled team of ML experts and product engineers.
Opportunities for rapid professional growth.
Help shape the future of enterprise advantage globally.
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
Perks
An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in‑office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top‑up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well‑being, quality time, and workspace improvement
Remote‑flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co‑working stipend
✈️ 6 weeks of vacation (30 working days!)
Seniority level
Entry level
Employment type
Full‑time
Job function
Engineering and Information Technology
Industries
Software Development
Location: Toronto, Ontario, Canada
Salary: CA$80,000.00 – CA$120,000.00
#J-18808-Ljbffr