Level: Erfahren

Job Feld: Software, Data

Anstellung: Vollzeit

Vertragsart: Unbefristetes Dienstverhältnis

Ort: Wien

Arbeitsmodell: Hybrid, Onsite

Job Zusammenfassung

In dieser Rolle entwickelst du fortschrittliche KI-Agenten, die die Qualität und menschliche Übereinstimmung generativer Designs bewerten, während du skalierbare Evaluationssysteme mit Inferenztechniken implementierst und optimierst.

Job Technologien

Deine Rolle im Team

You will engineer sophisticated AI agents that can automatically assess the quality and human alignment of our generative design models.

This high-impact role focuses on building the practical systems that make cutting-edge research effective, to provide a rapid feedback loop that guides the future of design generation at Canva, ultimately empowering millions of users to create.

At the moment, this role is focused on:

Agentic Evaluation Systems: Engineering autonomous AI agents that use Multimodal Large Language Models (MLLMs) to evaluate the quality, relevance, and human alignment of generated designs.
Inference-Time Alignment: Mastering techniques that improve model outputs without full retraining, but by inference-based methods including prompt engineering, in-context learning and Retrieval-Augmented Generation (RAG).
Model Benchmarking & Analysis: Building a rigorous framework to systematically benchmark internal and external quality understanding models, delivering clear, data-driven insights on human alignment.

Primary Responsibilities:

Design, build, and optimize the infrastructure for an 'MLLM-as-a-Judge' evaluation system for scalable, automated feedback.
Implement and experiment with inference-time alignment techniques (Prompt Engineering, RAG, ICL) to directly improve model output quality.
Establish and manage a comprehensive benchmarking process to compare various foundation models on design-centric tasks.
Analyze evaluation data to identify model failure modes and provide actionable recommendations to the research team.
Collaborate with research scientists and ML engineers to integrate the agentic judge system into the model development lifecycle.
Translate the latest research in LLM evaluation and agentic AI into practical, production-ready engineering solutions.

Unsere Erwartungen an dich

Qualifikationen

Excel at creating data-driven evaluation methodologies, turning user analytics into clear, actionable insights.

You’ve successfully managed or optimized large-scale distributed model training across hundreds of GPUs.

You have a solid understanding of machine learning, have worked with PyTorch and know how to optimize such codes for speed.

Nice to Have:

Familiarity with evaluation libraries and frameworks.
Knowledge of data visualization tools to communicate findings effectively.
A background or interest in human-computer interaction, design principles, or AI ethics.

Erfahrung

You have a strong understanding of generative AI models (e.g., Diffusion Models, GANs, Transformers) and their architectures, with practical experience that informs robust evaluation strategies.

You have disciplined coding practices, and are experienced with code reviews and pull requests.

You have experience working in cloud environments, ideally AWS.

Experience building or working with agentic AI systems or multi-agent coordination.

Benefits

Gesundheit, Fitness & Fun

Work-Life-Integration

Essen & Trinken

Mehr Netto

Job Standorte

Standort Wien
Ungargasse 37
1030 Wien
Österreich
Standort Wien
Ungargasse 37
1030 Wien
Österreich

Themen mit denen du dich im Job beschäftigst

Das ist dein Arbeitgeber

Canva Austria GmbH.

Wien

Empowering the world to design by Visual AI: By making complicated tech simple, Kaleido strives to enable individuals and businesses of all sizes to benefit from the recent advances in visual AI. Our tools simplify and accelerate workflows, foster creativity, and enable others to create new products. Since 2021, we are part of Canva. Our mission at Kaleido as part of Canva is to empower the world to design and since Canva launching in 2013, we have grown exponentially, amassing over 100 million monthly active users across 190 different countries and a team of over 3,000 people… and the best bit is that we’ve only achieved 1% of what we know we’re capable of.

Unternehmensgröße: 50-249 Employees

Gründungsjahr: 2018

Sprachen: Englisch

Unternehmenstyp: Startup

Arbeitsmodell: Hybrid, Onsite

Branche: Internet, IT, Telekom

Research Engineer Evaluations

Canva Austria GmbH.

Ort: Wien
Arbeitsmodell: Hybrid, Onsite

Research Engineer Evaluations

Job Zusammenfassung

Job Technologien

Deine Rolle im Team

Unsere Erwartungen an dich

Qualifikationen

Erfahrung

Benefits

Gesundheit, Fitness & Fun

Work-Life-Integration

Essen & Trinken

Mehr Netto

Job Standorte

Standort Wien

Standort Wien

Themen mit denen du dich im Job beschäftigst

Das ist dein Arbeitgeber

Canva Austria GmbH.

Weitere Jobs

Generative AI Software Engineer

Data AI Consultant

Data Scientist

AI Engineer

Senior Back End Engineer AI

Experienced Web Developer Backend

Karriere Tipps

Für Unternehmer

Unternehmen

Partner und Portale

Research Engineer Evaluations

Job

Job Zusammenfassung

Job Technologien

Deine Rolle im Team

Unsere Erwartungen an dich

Qualifikationen

Erfahrung

Benefits

Gesundheit, Fitness & Fun

Work-Life-Integration

Essen & Trinken

Mehr Netto

Job Standorte

Standort Wien

Standort Wien

Themen mit denen du dich im Job beschäftigst

Das ist dein Arbeitgeber

Canva Austria GmbH.

Description

Weitere Jobs

Generative AI Software Engineer

Data AI Consultant

Data Scientist