Pathway
Technology
MachineLearningDevOps-CloudandComputeCluster-R&DSupport
Neural analysis suggests this role is
optimal for Mid+ candidates.
“Machine Learning DevOps - Cloud and Compute Cluster - R&D Support at Pathway. Skills: Machine Learning DevOps, Cloud management, Compute cluster management, Infrastructure scaling. Optimize infrastructure for ML training. Optimize infrastructure for ML inference”
Industry & Context.
Troubleshooting
What They're Looking For.
Must Have
BSc in Computer Science or Information Technology, Very good familiarity with Linux, Very good familiarity with shell scripts, Very good familiarity with cluster configuration scripts, Proficiency in workload management, Proficiency in containerization, Proficiency in orchestration, Solid grasp of CI/CD tools, Solid grasp of CI/CD workflows, Cloud infrastructure knowledge, Experience with monitoring/logging tools, Experience with infrastructure as code, Experience with ML pipeline orchestration tools, Programming skills in Python, Experience with cluster administration, Experience with systems administration, Experience with networks administration
Nice to Have
Ambitious efforts in the past, Accepted contribution to Linux kernel, Won an important bug bounty, Supported academic grid/cluster computing team, Won a sports championship
What You'll Do.
Optimize infrastructure for ML training
Optimize infrastructure for ML inference
Automate ML/LLM pipelines
Maintain ML/LLM pipelines
Manage model versioning
Manage reproducibility
Work with terabyte-large datasets
Implement ML-centric CI/CD practices
Monitor model performance
Monitor data drift in production
How You'll Work.
Team & Collaboration
Machine learning engineers; Software engineers; Platform teams
Full Job Description
### About Pathway [Pathway](https://pathway.com/) is shaking the foundations of artificial intelligence by introducing the world’s first post-transformer model that adapts and thinks just like humans. Pathway’s breakthrough architecture (BDH) outperforms Transformer and provides the enterprise with full visibility into how the model works. Combining the foundational model with the fastest data processing engine on the market, Pathway enables enterprises to move beyond incremental optimization and toward truly contextualized, experience-driven intelligence. The company is trusted by organizations such as NATO, La Poste, and Formula 1 racing teams. Pathway is led by co-founder & CEO Zuzanna Stamirowska, a complexity scientist who created a team consisting of AI pioneers, including CTO Jan Chorowski who was the first person to apply Attention to speech and worked with Nobel laureate Goeff Hinton at Google Brain, as well as CSO Adrian Kosowski, a leading computer scientist and quantum physicist who obtained his PhD at the age of 20. The company is backed by leading investors and advisors, including TQ Ventures and Lukasz Kaiser, co-author of the Transformer (“the T” in ChatGPT) and a key researcher behind OpenAI’s reasoning models. Pathway is headquartered in Palo Alto, California. ### The opportunity We are currently searching for a Machine Learning DevOps with experience in cloud and compute cluster management, scaling infrastructures, and Linux administration. Our development, ML training, and production environment is in the cloud, **using several major cloud providers**. We need support in **managing and automating the processes** , and **scaling** the infrastructure to growing team and production needs. ### You Will * Optimize infrastructure for ML training and inference (e.g., GPUs, distributed compute). * Automate and maintain ML/LLM pipelines (data ingestion, training, validation, deployment). * Manage model versioning, reproducibility, and traceability. * Work
Applying for this Machine Learning DevOps - Cloud and Compute Cluster - R&D Support role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Pathway?
Real rants from real employees. Read before you apply.