Giotto AI Industry · Engineering

LLM Architect

CHF 130'000 – 150'000 / year

Description

Giotto.ai is a Switzerland-based AI company building sovereign intelligence systems for Switzerland and Europe. Our mission is to build sovereign AI capabilities that enable Switzerland and Europe to preserve strategic independence, cultural identity, and core values while achieving world-class performance in advanced reasoning systems.

We are looking for a senior engineer to help architect and build large-scale AI systems around LLMs, reasoning models, and distributed inference infrastructure. You will work on core AI architecture problems:

  • Scalable inference systems
  • Distributed training pipelines
  • Reasoning-oriented model architectures
  • Efficient serving and orchestration
  • Production systems for advanced AI workloads

Depending on your profile, you may operate as:

  • A highly autonomous senior individual contributor
  • A technical lead for a core AI initiative
  • An architect helping shape the long-term direction of our AI stack

We value people who combine deep technical judgment with strong execution.

Responsibilities

  • Architecting production-grade LLM systems
  • Designing scalable inference and training infrastructure
  • Optimizing performance across GPU and distributed environments
  • Building systems around reasoning and agentic workflows
  • Improving efficiency, latency, reliability, and throughput
  • Working closely with research teams to bring frontier ideas into production
  • Contributing to the long-term technical direction of sovereign AI systems in Europe

Core Technologies

  • Python
  • PyTorch
  • Hugging Face ecosystem
  • Large Language Models (LLMs)
  • vLLM
  • Distributed systems
  • Ray
  • Docker
  • Kubernetes

Qualifications

  • 5+ years building ML/NLP systems
  • Strong expertise in Python and PyTorch
  • Proven experience deploying ML systems into production
  • Deep understanding of transformer architectures and modern LLM systems
  • Experience with distributed compute environments
  • Comfortable operating in fast-moving research + production settings
  • Strong ownership mindset and ability to work from first principles

Strongly Valued Experience

  • Designing or operating large-scale LLM systems
  • Distributed inference or training at scale
  • CUDA programming or GPU optimization
  • Systems-level performance engineering
  • Experience with model serving infrastructure
  • Technical leadership on complex AI systems
  • Research or engineering work on reasoning models
  • Experience in high-performance engineering environments

We especially appreciate candidates who have built difficult systems end-to-end and can speak concretely about trade-offs, failures, scaling challenges, and engineering decisions.

Nice to Have

  • PhD in Computer Science, Mathematics, Physics, or related hard sciences
  • Experience at top-tier AI labs or large-scale technology companies
  • Background in systems optimization or infrastructure engineering
  • Competitive programming or Olympiad background (IMO, IOI, IPhO, etc.)
  • Open-source contributions in ML infrastructure or LLM tooling

Location & Work Style

We offer full-time employment in Switzerland. Hybrid setup: Remote work fully supported Team gathers one week per month in the Swiss office Exceptional candidates elsewhere in Europe may also be considered.

Why Giotto.ai

We are building frontier AI systems with a small, highly technical team focused on reasoning, efficiency, and sovereignty. This is an opportunity to work on foundational AI infrastructure and architecture problems with meaningful technical ownership from day one.