Elva

Engineering

AIVideoAgentEngineer

₹35–60L ~AI est. Europe FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“AI Video Agent Engineer at Elva. Skills: AI Agent Architecture, Pipeline Orchestration, Multimodal Data Processing, Model Orchestration. Design AI agent architecture. Implement AI agent architecture”

What You'll Achieve.

Analyze content; Generate narrative structure; Select visual elements; Select audio elements; Produce structured video output

Industry & Context.

Engineering
Problems you'll solve

Troubleshooting

What They're Looking For.

Must Have

AI Systems & Agent Architecture experience, Workflow Orchestration experience, n8n MCP experience, Python programming skills, JavaScript programming skills, Ability to integrate tools for multimodal generation

Nice to Have

ComfyUI custom nodes experience, Lightweight APIs experience, FFmpeg experience, Video processing pipelines experience, Image processing workflows experience, Deep understanding of LangGraph, Deep understanding of LangChain, Experience designing long-term memory systems, Experience building agents with feedback loops, Experience with LangSmith, Experience with Arize Phoenix, Experience with Promptfoo

What You'll Do.

Design AI agent architecture

Implement AI agent architecture

Define system prompts

Define behavioral rules

Define structured instructions

Develop orchestration pipelines

Maintain orchestration pipelines

Design multi-agent workflows

Design tool-calling logic

Design dynamic routing

Design context passing

Design multimodal data pipelines

Handle video materials

Handle user text prompts

Handle structured metadata

Ensure data transformation

Ensure context transfer

Integrate external APIs

Integrate internal APIs

Evaluate available APIs

Tune model interactions

Optimize model interactions

Optimize pipelines for quality

Optimize pipelines for reliability

Optimize pipelines for efficiency

Design memory systems

Design contextual reasoning systems

Full Job Description

We are toogeza, a Ukrainian recruiting company that is focused on hiring talents and building teams for tech startups worldwide. People make a difference in the big game, we may help to find the right ones. Currently, we are looking for AI Video Agent Engineer for Elva. Location: Remote Job Type: Full-Time OVERVIEW: We are building an AI-driven video production system designed as an intelligent multi-agent orchestration layer capable of transforming raw ideas and references into fully structured video content. The system operates as a black-box creative engine for different types of users: - professional video creators - casual users - creators of short narrative concepts Users provide an idea, references, or media fragments, and the system automatically orchestrates multiple AI agents responsible for analysis, scripting, editing, and production of video content. We are looking for an engineer who can design and implement multi-agent pipelines, orchestrate AI tools, and build intelligent workflows that combine video analysis, storytelling logic, and automated editing. This role sits at the intersection of AI systems architecture, creative tooling, and multimodal content generation. RESPONSIBILITIES: AI Agent Architecture Design and implement the architecture of a multi-agent video editing system including agents responsible for: - video analysis - narrative generation - editing orchestration - production and output synthesis Define system prompts, behavioral rules, and structured instructions for agents interacting within the pipeline. Pipeline Orchestration (n8n) Develop and maintain complex orchestration pipelines in n8n, including: - multi-agent workflows - tool-calling logic - dynamic routing between tools and models - context passing between agents Pipelines must be capable of selecting the most appropriate models, tools, and strategies depending on the task. Multimodal Data Processing Design robust pipelines for handling: - video materials - image assets - use

Free ATS check

Applying for this AI Video Agent Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

  • Ashby is a fast modern ATS — most applications take under 3 minutes.
  • The resume parser is strong; verify parsed experience dates and job titles.
  • Custom screening questions are often scored algorithmically — answer completely.
  • Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Elva?

Real rants from real employees. Read before you apply.

Read Company Rants →