NextGen Federal Systems

technology and professional services

EdgeAI/ModelOptimizationEngineer

Aberdeen, Maryland, United States FULL TIME Remote Friendly

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Edge AI/Model Optimization Engineer at NextGen Federal Systems. Skills: Edge AI, Model Optimization, LLMs, GPU Computing. Evaluate candidate LLMs, embedding models, AI inference solutions. Tune and optimize AI model runtime configurations”

What You'll Achieve.

ensure AI-enabled capabilities remain performant, reliable, and mission-effective; balancing mission effectiveness, latency, throughput, resource utilization, reliability, and operational sustainability; validate that agent behavior, workflow reliability, and operational outcomes remain acceptable

Industry & Context.

technology and professional services

Problems you'll solve

analytical, troubleshooting, and performance optimization skills.

Eligibility Requirements

Active Security Clearance is required

What They're Looking For.

Must Have

Bachelor’s degree in Computer Science, Electrical Engineering, Computer Engineering, Data Science, Artificial Intelligence, or related technical discipline., 5+ years of experience supporting AI/ML deployment, model optimization, edge computing, GPU acceleration, or AI inference operations., Experience deploying and optimizing LLMs, embedding models, or AI inference pipelines within resource-constrained or edge-compute environments., Experience with GPU-enabled systems and inference optimization technologies such as CUDA, TensorRT, ONNX Runtime, vLLM, Ollama, or equivalent platforms., Experience tuning AI runtime configurations including quantization, batching, caching, and memory optimization techniques., Experience benchmarking AI models and operational workflows against hardware performance constraints., Experience with Linux-based systems, containerized deployments, and orchestration technologies such as Docker and Kubernetes., Familiarity with Python and AI/ML deployment frameworks commonly used for edge inference and operational AI systems., analytical, troubleshooting, and performance optimization skills., Ability to communicate technical findings and operational tradeoffs effectively to technical and non-technical stakeholders., Active Security Clearance is required

Nice to Have

Experience supporting tactical, airborne, or mission-command edge computing environments., Familiarity with X9 Spider Mission Computer architectures or similar embedded GPU-enabled mission systems., Experience supporting AI-enabled workflows within NGC2, AIDP, EMSCO, Lattice, or related operational ecosystems., Experience with model quantization techniques such as INT8, FP16, GGUF, GPTQ, AWQ, or similar optimization approaches., Familiarity with disconnected, degraded, intermittent, and low-bandwidth (DDIL) operational environments., Experience with hardware evaluation and performance trade studies for operational edge compute systems.

What You'll Do.

Evaluate candidate LLMs

AI inference solutions

Tune and optimize AI model runtime configurations

Collaborate with customer stakeholders

Benchmark agentic AI workflows

Recommend model-selection

configuration tradeoffs

Build and maintain repeatable performance testing frameworks

sustain local model-serving components

Collaborate with agent engineers

sustainment activities

Train customer technical personnel

Maintain technical documentation

Support DevSecOps and CI/CD activities

How You'll Work.

Team & Collaboration

close collaboration with AI engineers, systems integrators, mission stakeholders, and operational users; Collaborate with customer stakeholders; Collaborate with agent engineers, AI developers, and integration teams

Communication Scope

Ability to communicate technical findings and operational tradeoffs effectively to technical and non-technical stakeholders.

Full Job Description

## Description NextGen is seeking a highly motivated and technically skilled Edge AI/Model Optimization Engineer to support the deployment, optimization, and sustainment of AI and agentic AI capabilities within edge and tactical computing environments. This role focuses on evaluating, tuning, benchmarking, and operationalizing Large Language Models (LLMs), embedding models, and AI inference services for constrained hardware platforms, including the X9 Spider Mission Computer architecture and other edge compute systems supporting operational missions using ReadiChat. ReadiChat is a mission-focused, agentic AI platform designed to help organizations build, deploy, govern, and scale specialized AI agents for operational workflows. It combines AI agents, workflow orchestration, grounded knowledge, testing frameworks, and enterprise controls into a single collaborative workspace. The ideal candidate will possess expertise in AI model optimization, GPU-enabled edge computing, runtime performance tuning, and operational AI deployment. This role requires close collaboration with AI engineers, systems integrators, mission stakeholders, and operational users to ensure AI-enabled capabilities remain performant, reliable, and mission-effective within disconnected, degraded, intermittent, and low-bandwidth environments. ## Responsibilities Evaluate candidate Large Language Models (LLMs), embedding models, and AI inference solutions for quality, latency, memory utilization, reliability, and operational performance on embedded GPU-enabled edge compute platforms, including the X9 Spider Mission Computer architecture. Tune and optimize AI model runtime configurations for edge deployment, including quantization strategies, batching configurations, context window sizing, cache behavior, inference scheduling, and GPU memory utilization specific to operational edge hardware environments. Collaborate with customer stakeholders to assess mission requirements and evaluate alternative edge

Free ATS check

Applying for this Edge AI/Model Optimization Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 47 detected · ranked by frequency

AI model optimization ×4

GPU-enabled edge computing ×4

runtime performance tuning ×4

operational AI deployment ×4

LLMs ×3

quantization strategies ×3

batching configurations ×3

context window sizing ×3

cache behavior ×3

inference scheduling ×3

GPU memory utilization ×3

performance and stress-testing ×3

model compression ×3

DevSecOps ×3

CI/CD ×3

Edge AI ×2

Model Optimization ×2

GPU Computing ×2

X9 Spider Mission Computer ×2

ReadiChat ×2

CUDA ×2

TensorRT ×2

ONNX Runtime ×2

vLLM ×2

Ollama ×2

Docker ×2

Kubernetes ×2

agentic AI

Large Language Models

embedding models

AI inference services

Role Details

Experience 5–10 yrs

Level Senior

Work Mode Hybrid

Type FULL TIME

Education Bachelor’s degree in Computer Science, Electrical Engineerin

AI-Extracted Insights

Domain Areas

edge-and-tactical-computing-environmentsconstrained-hardware-platformsoperational-missionsmission-focusedagentic-ai-platformoperational-workflowsenterprise-controlstactical

How to Apply on Lever

Lever uses a streamlined one-page form — apply in under 5 minutes.
LinkedIn import works well; review parsed data before submitting.
The cover letter field is optional but visible to reviewers — use it to differentiate.
Referral codes from employees can significantly boost visibility of your application.

ANONYMOUS · UNFILTERED

What do employees actually say about NextGen Federal Systems?

Real rants from real employees. Read before you apply.

Read Company Rants →