New York Independent System Operator

Manager,CloudPlatformEngineering

$118–204k Rensselaer, New York, United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Manager candidates.

The Brief

“Manager, Cloud Platform Engineering at New York Independent System Operator. Skills: Cloud Platform Engineering, DevOps, Site Reliability Engineering, Kubernetes. Lead team building cloud platform. Operate cloud platform”

What You'll Achieve.

Deliver roadmap commitments; Improve platform reliability; Improve platform adoption; Improve platform delivery; Reduce operational toil; Improve mean time to detection; Improve mean time to recovery

Industry & Context.

Problems you'll solve

Troubleshooting; Root cause analysis

Eligibility Requirements

24x7 accountability, On-call participation, Work onsite several days

What They're Looking For.

Must Have

5+ years cloud platform engineering, 2+ years management experience, Experience with EKS/container platform, Experience with AWS infrastructure-as-code, Experience with CI/CD tooling, Experience with Terraform, Experience with CloudFormation, Experience with CDK, Experience with Helm, Experience with incident management, Experience with change management, Experience with problem management, Experience with platform budgets, Experience with cloud consumption planning, Experience with vendor agreements, Experience with cloud service agreements, Experience with platform roadmap, Experience with platform performance metrics, Experience with platform reliability metrics, Experience with platform adoption metrics, Experience with platform delivery metrics, Experience with platform cost metrics, Experience with AI-assisted engineering, Experience with security controls, Experience with operational controls, Experience with supply-chain security, Experience with container image hardening, Experience with vulnerability scanning, Experience with dependency management, Experience with monitoring, Experience with logging, Experience with tracing, Experience with alerting, Experience with dashboard use, Experience with cloud cost optimization, Experience with resource efficiency, Experience with consumption management, Experience with cost transparency, Experience with production support, Experience with on-call participation, Experience with incident triage, Experience with service restoration, Experience with platform operations, Experience with NYISO incident management, Experience with NYISO change management, Experience with NYISO problem management, Experience with lifecycle management, Experience with platform-related tickets, Experience with platform-related changes, Experience with platform-related issues, Experience with platform service improvements, Experience with platform strategy, Experience with platform architecture, Experience with platform roadmap, Experience with platform-as-a-product, Experience with internal developer platform, Experience with service offerings, Experience with paved-road delivery paths, Experience with service-level objectives, Experience with cloud migration goals, Experience with scalable architecture, Experience with resilient architecture, Experience with secure architecture, Experience with performant architecture, Experience with container architecture, Experience with reusable standards, Experience with secure-by-default standards, Experience with policy-as-code, Experience with roadmap governance, Experience with capability prioritization, Experience with delivery progress tracking, Experience with status communication, Experience with risk communication, Experience with dependency communication, Experience with decision communication, Experience with IT leadership communication, Experience with stakeholder communication, Experience with team development, Experience with cloud talent, Experience with platform talent, Experience with DevOps talent, Experience with reliability engineering talent, Experience with performance management, Experience with security-conscious environment, Experience with operationally disciplined environment, Experience with delivery execution, Experience with platform initiatives, Experience with cloud migrations, Experience with modernization efforts, Experience with optimization projects, Experience with estimates, Experience with execution plans, Experience with dependencies, Experience with delivery commitments, Experience with troubleshooting complex issues, Experience with Kubernetes environment, Experience with delivery tooling, Experience with related infrastructure services, Experience with Infrastructure as Code, Experience with platform automation, Experience with cost optimization, Experience with EKS/container platform reliability, Experience with EKS/container platform security, Experience with EKS/container platform observability, Experience with EKS/container platform supportability, Experience with application team needs, Experience with vulnerability management, Experience with dependency management, Experience with software supply-chain security

Nice to Have

Kubernetes experience a plus, AWS certification a plus, Cloud security certification a plus, DevOps certification a plus, Site Reliability Engineering certification a plus

What You'll Do.

Lead team building cloud platform

Operate cloud platform

Improve cloud platform

Treat cloud platform as service

Define platform consumers

Define operational expectations

Define service-level objectives

Define forward-looking roadmap

Provide self-service paths

Own Site Reliability Engineering practice

Own EKS/container platform

Own shared delivery tooling

Own security controls

Own compliance controls

Own operational controls

Provide production ownership

Participate in on-call

Perform incident triage

Lead service restoration

Align platform operations with processes

Manage lifecycle of tickets

Manage lifecycle of changes

Manage lifecycle of operational issues

Manage lifecycle of service improvements

Develop platform budgets

Manage platform budgets

Forecast capital budgets

Forecast operating budgets

Plan cloud consumption

Manage vendor expenses

Partner on vendor agreements

Partner on cloud service agreements

Oversee vendor performance

Execute platform roadmap

Provide roadmap estimates

Manage roadmap execution plans

Coordinate cross-team dependencies

Deliver roadmap commitments

Maintain roadmap artifacts

Provide progress updates

Report platform performance metrics

Report platform reliability metrics

Report platform adoption metrics

Report platform delivery metrics

Report platform cost metrics

Understand business requirements

Drive cloud best practices

Drive platform-engineering best practices

Support audit obligations

Support regulatory obligations

Advance operating-model shift

Guide cloud architecture

Guide container architecture

Establish reusable standards

Establish policy-as-code

Prioritize platform capabilities

Track roadmap delivery progress

Communicate status to leadership

Communicate risks to leadership

Communicate dependencies to leadership

Communicate decisions to leadership

Attract platform talent

Develop platform talent

Mentor platform talent

Retain platform talent

Attract DevOps talent

Develop DevOps talent

Attract reliability engineering talent

Develop reliability engineering talent

Mentor reliability engineering talent

Retain reliability engineering talent

Provide regular team feedback

Manage team performance

Foster collaborative environment

Foster innovative environment

Foster security-conscious environment

Foster operationally disciplined environment

Foster results-oriented environment

Manage platform initiatives execution

Manage cloud migrations execution

Manage modernization efforts execution

Manage optimization projects execution

Oversee troubleshooting complex issues

Champion Infrastructure as Code

Champion platform automation

Drive cloud cost optimization

Drive resource efficiency

Drive consumption management

Drive cost transparency

Own platform production support

Ensure platform operations align

Own lifecycle management

Reduce operational toil

Improve mean time to detection

Improve mean time to recovery

Ensure EKS/container platform is reliable

Ensure EKS/container platform is secure

Ensure EKS/container platform is observable

Ensure EKS/container platform is supportable

Drive container image hardening

Drive vulnerability scanning

Drive dependency management

Drive software-supply-chain security

How You'll Work.

Team & Collaboration

Application teams; IT leadership; Procurement; Stakeholders; Cross-functional teams; Business requirements; Audit; Regulatory obligations

Communication Scope

Presentations; Reporting; Updates

Process & Methodology

Roadmap planning, Execution plans, Dependency management, Prioritization

Full Job Description

The New York Independent System Operator (NYISO) manages the efficient flow of electricity on more than 11,000 circuit-miles of high-voltage transmission lines, dispatching power from hundreds of generating units across the state. The New York Independent System Operator (NYISO) applies cutting-edge technology to operating a reliable electricity system, managing competitive markets for wholesale electricity, and planning for the Empire State's energy future. The NYISO’s Information Technology department invites applications for a full-time Manager, Cloud Platform Engineering. The Manager, Cloud Platform Engineering leads the team that builds, operates, and continuously improves NYISO’s shared cloud platform, including the Kubernetes/EKS foundation, AWS infrastructure-as-code, delivery tooling, reliability practices, and platform services that application teams use to deploy and operate software. The Manager is responsible for treating the internal cloud platform as a service, with defined consumers, operational expectations, service-level objectives, and a forward-looking roadmap. This includes providing application teams with secure, reliable, cost-effective, and self-service paths to build, deploy, and operate applications on NYISO’s cloud platform. The role owns the Site Reliability Engineering practice, the EKS/container platform, shared delivery tooling, and the security, compliance, and operational controls embedded across the platform and delivery lifecycle. This includes production ownership for the cloud platform with 24x7 accountability, on-call participation, incident triage, and leadership of service restoration activities. The Manager ensures platform operations align with NYISO’s incident, change, and problem management processes, with accountability for the lifecycle management of platform-related tickets, changes, operational issues, and service improvements. The Manager develops and manages platform budgets, including multi-year capital and operatin

Free ATS check

Applying for this Manager, Cloud Platform Engineering role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Greenhouse

  • Create a Greenhouse profile before applying — it saves time across multiple applications.
  • Upload your resume as a PDF; the parser handles it better than Word.
  • Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
  • Enable email notifications to track application status in real time.

ANONYMOUS · UNFILTERED

What do employees actually say about New York Independent System Operator?

Real rants from real employees. Read before you apply.

Read Company Rants →