The Role Gap
Sr.MLEngineer–ML&AppliedAI
Neural analysis suggests this role is
optimal for Senior candidates.
“Sr. ML Engineer – ML & Applied AI at The Role Gap. Skills: production ML systems, Python, ML frameworks (scikit-learn, XGBoost, PyTorch, TensorFlow), scalable ML pipelines and services, model serving frameworks, containerization (Docker), Kubernetes, cloud platforms (GCP, AWS, Azure), ML lifecycle management, CI/CD pipelines for ML, SQL, distributed data processing (Spark, PySpark). Architect and build scalable, production-grade ML systems from experimentation to deployment and lifecycle managem”
What You'll Achieve.
power data-driven decision making across the enterprise; end-to-end ML system ownership; enable reliable, high-performance model inference in both batch and real-time environments
Industry & Context.
Excellent problem-solving skills
What They're Looking For.
Must Have
Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field, 10+ years of experience in machine learning, software engineering, or related roles, significant experience in production ML systems, programming expertise in Python, solid software engineering fundamentals (data structures, system design, APIs), Extensive experience with ML frameworks such as scikit-learn, XGBoost, PyTorch, or TensorFlow, Proven experience designing and deploying scalable ML pipelines and services in production, Hands-on experience with model serving frameworks and API development (e. g. , FastAPI, Flask), experience with containerization (Docker), orchestration platforms such as Kubernetes, Experience working with cloud platforms (GCP, AWS, or Azure), building cloud-native ML solutions, Deep understanding of ML lifecycle management, including training, evaluation, deployment, monitoring, and retraining, Experience implementing CI/CD pipelines for ML workflows, managing version control systems (Git), experience with SQL, distributed data processing frameworks (e. g. , Spark, PySpark), Excellent problem-solving skills, ability to design scalable, maintainable systems
Nice to Have
modern applications involving large language models (LLMs), agent-based AI systems, vector databases, retrieval-augmented generation (RAG), agentic workflows
What You'll Do.
Architect and build scalable
production-grade ML systems from experimentation to deployment and lifecycle management
Design and implement end-to-end ML pipelines
including data ingestion
Develop and maintain high-performance model serving systems using APIs (e. g.
FastAPI) for real-time and batch inference
Lead the design and implementation of feature stores and reusable feature pipelines across teams
Build and optimize distributed data processing workflows using Spark
Implement and enforce MLOps best practices
including CI/CD pipelines
and experiment tracking
Design and manage model monitoring and observability frameworks to track performance
Drive strategies for model retraining
and continuous improvement
Contribute to the adoption of modern AI capabilities
retrieval-augmented generation (RAG)
and agentic workflows
Ensure high standards of code quality
How You'll Work.
Team & Collaboration
Collaborate closely with data engineers, platform teams, and product stakeholders to integrate ML solutions into production systems
Full Job Description
## About the Role Gap Inc. is seeking a Senior Machine Learning Engineer with 10+ years of experience to design, build, and scale production-grade machine learning and AI systems that power data-driven decision making across the enterprise. This role is focused on end-to-end ML system ownership, including data pipelines, feature engineering, model training, deployment, monitoring, and continuous optimization. You will lead the development of scalable ML platforms, drive best practices in MLOps, and enable reliable, high-performance model inference in both batch and real-time environments. The ideal candidate combines strong software engineering expertise with deep ML knowledge and has experience building robust, scalable ML systems in production, including modern applications involving large language models (LLMs) and agent-based AI systems. ## What You'll Do * Architect and build **scalable, production-grade ML systems** from experimentation to deployment and lifecycle management * Design and implement **end-to-end ML pipelines** , including data ingestion, feature engineering, training, validation, and inference * Develop and maintain **high-performance model serving systems** using APIs (e.g., FastAPI) for real-time and batch inference * Lead the design and implementation of **feature stores** and reusable feature pipelines across teams * Build and optimize **distributed data processing workflows** using Spark, Databricks, or similar platforms * Implement and enforce **MLOps best practices** , including CI/CD pipelines, automated retraining, model versioning, and experiment tracking * Design and manage **model monitoring and observability frameworks** to track performance, drift, latency, and system health * Drive strategies for **model retraining, drift detection, and continuous improvement** * Collaborate closely with data engineers, platform teams, and product stakeholders to integrate ML solutions into production systems * Contribute to the adoption of **mode
Applying for this Sr. ML Engineer – ML & Applied AI role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about The Role Gap?
Real rants from real employees. Read before you apply.