Job Description
Staff Inference Engineer (Multimodal & LLMs)
Frontier AI Lab | $1B+ Raised
Remote | €200-300k cash + equity
We’re partnering with a frontier AI company building large-scale multimodal and language models used at real, global scale. Backed by over $1bn in funding, this is a rare chance to work deep in the inference stack where performance genuinely matters.
This role is for someone who has proven experience living at the intersection of research-grade LLMs and large-scale systems engineering.
What you’ll do
Push the limits of LLM inference quality, latency, and cost
Turn cutting-edge research ideas into production-ready systems
Own inference metrics and performance in real deployments
Write fast, elegant code close to the metal (and know why it’s fast)
Collaborate tightly with researchers and infra engineers to shape what ships next
What they’re looking for
Deep understanding of transformers and modern LLM inference
Strong instincts for distributed systems and low-precision computation
Obsession with performance: kernels, memory, matmuls, and hardware bottlenecks
Comfort working across Python and lower-level stacks (C/C++, CUDA, Triton, etc.)
Someone opinionated, pragmatic, and happy to challenge “best practice” when it slows things down
Research exposure is a plus, but shipping impact matters more
Why this role
Work on models and systems that define the state of the art
Huge scope, serious autonomy, and real technical depth
Elite peers who care about correctness and speed
Compensation and equity to match top-tier frontier labs
Ready to Apply?
Don't miss this opportunity! Apply now and join our team.
Job Details
Posted Date:
February 23, 2026
Job Type:
Construction
Location:
Indonesia
Company:
Fabrik Talent
Ready to Apply?
Don't miss this opportunity! Apply now and join our team.