New York Independent System Operator
Manager,CloudPlatformEngineering
Neural analysis suggests this role is
optimal for Manager candidates.
“Manager, Cloud Platform Engineering at New York Independent System Operator. Skills: Cloud Platform Engineering, DevOps, Site Reliability Engineering, Kubernetes. Lead team building cloud platform. Operate cloud platform”
What You'll Achieve.
Deliver roadmap commitments; Improve platform reliability; Improve platform adoption; Improve platform delivery; Reduce operational toil; Improve mean time to detection; Improve mean time to recovery
Industry & Context.
Troubleshooting; Root cause analysis
24x7 accountability, On-call participation, Work onsite several days
What They're Looking For.
Must Have
5+ years cloud platform engineering, 2+ years management experience, Experience with EKS/container platform, Experience with AWS infrastructure-as-code, Experience with CI/CD tooling, Experience with Terraform, Experience with CloudFormation, Experience with CDK, Experience with Helm, Experience with incident management, Experience with change management, Experience with problem management, Experience with platform budgets, Experience with cloud consumption planning, Experience with vendor agreements, Experience with cloud service agreements, Experience with platform roadmap, Experience with platform performance metrics, Experience with platform reliability metrics, Experience with platform adoption metrics, Experience with platform delivery metrics, Experience with platform cost metrics, Experience with AI-assisted engineering, Experience with security controls, Experience with operational controls, Experience with supply-chain security, Experience with container image hardening, Experience with vulnerability scanning, Experience with dependency management, Experience with monitoring, Experience with logging, Experience with tracing, Experience with alerting, Experience with dashboard use, Experience with cloud cost optimization, Experience with resource efficiency, Experience with consumption management, Experience with cost transparency, Experience with production support, Experience with on-call participation, Experience with incident triage, Experience with service restoration, Experience with platform operations, Experience with NYISO incident management, Experience with NYISO change management, Experience with NYISO problem management, Experience with lifecycle management, Experience with platform-related tickets, Experience with platform-related changes, Experience with platform-related issues, Experience with platform service improvements, Experience with platform strategy, Experience with platform architecture, Experience with platform roadmap, Experience with platform-as-a-product, Experience with internal developer platform, Experience with service offerings, Experience with paved-road delivery paths, Experience with service-level objectives, Experience with cloud migration goals, Experience with scalable architecture, Experience with resilient architecture, Experience with secure architecture, Experience with performant architecture, Experience with container architecture, Experience with reusable standards, Experience with secure-by-default standards, Experience with policy-as-code, Experience with roadmap governance, Experience with capability prioritization, Experience with delivery progress tracking, Experience with status communication, Experience with risk communication, Experience with dependency communication, Experience with decision communication, Experience with IT leadership communication, Experience with stakeholder communication, Experience with team development, Experience with cloud talent, Experience with platform talent, Experience with DevOps talent, Experience with reliability engineering talent, Experience with performance management, Experience with security-conscious environment, Experience with operationally disciplined environment, Experience with delivery execution, Experience with platform initiatives, Experience with cloud migrations, Experience with modernization efforts, Experience with optimization projects, Experience with estimates, Experience with execution plans, Experience with dependencies, Experience with delivery commitments, Experience with troubleshooting complex issues, Experience with Kubernetes environment, Experience with delivery tooling, Experience with related infrastructure services, Experience with Infrastructure as Code, Experience with platform automation, Experience with cost optimization, Experience with EKS/container platform reliability, Experience with EKS/container platform security, Experience with EKS/container platform observability, Experience with EKS/container platform supportability, Experience with application team needs, Experience with vulnerability management, Experience with dependency management, Experience with software supply-chain security
Nice to Have
Kubernetes experience a plus, AWS certification a plus, Cloud security certification a plus, DevOps certification a plus, Site Reliability Engineering certification a plus
What You'll Do.
Lead team building cloud platform
Operate cloud platform
Improve cloud platform
Treat cloud platform as service
Define platform consumers
Define operational expectations
Define service-level objectives
Define forward-looking roadmap
Provide self-service paths
Own Site Reliability Engineering practice
Own EKS/container platform
Own shared delivery tooling
Own security controls
Own compliance controls
Own operational controls
Provide production ownership
Participate in on-call
Perform incident triage
Lead service restoration
Align platform operations with processes
Manage lifecycle of tickets
Manage lifecycle of changes
Manage lifecycle of operational issues
Manage lifecycle of service improvements
Develop platform budgets
Manage platform budgets
Forecast capital budgets
Forecast operating budgets
Plan cloud consumption
Manage vendor expenses
Partner on vendor agreements
Partner on cloud service agreements
Oversee vendor performance
Execute platform roadmap
Provide roadmap estimates
Manage roadmap execution plans
Coordinate cross-team dependencies
Deliver roadmap commitments
Maintain roadmap artifacts
Provide progress updates
Report platform performance metrics
Report platform reliability metrics
Report platform adoption metrics
Report platform delivery metrics
Report platform cost metrics
Understand business requirements
Drive cloud best practices
Drive platform-engineering best practices
Support audit obligations
Support regulatory obligations
Advance operating-model shift
Guide cloud architecture
Guide container architecture
Establish reusable standards
Establish policy-as-code
Prioritize platform capabilities
Track roadmap delivery progress
Communicate status to leadership
Communicate risks to leadership
Communicate dependencies to leadership
Communicate decisions to leadership
Attract platform talent
Develop platform talent
Mentor platform talent
Retain platform talent
Attract DevOps talent
Develop DevOps talent
Attract reliability engineering talent
Develop reliability engineering talent
Mentor reliability engineering talent
Retain reliability engineering talent
Provide regular team feedback
Manage team performance
Foster collaborative environment
Foster innovative environment
Foster security-conscious environment
Foster operationally disciplined environment
Foster results-oriented environment
Manage platform initiatives execution
Manage cloud migrations execution
Manage modernization efforts execution
Manage optimization projects execution
Oversee troubleshooting complex issues
Champion Infrastructure as Code
Champion platform automation
Drive cloud cost optimization
Drive resource efficiency
Drive consumption management
Drive cost transparency
Own platform production support
Ensure platform operations align
Own lifecycle management
Reduce operational toil
Improve mean time to detection
Improve mean time to recovery
Ensure EKS/container platform is reliable
Ensure EKS/container platform is secure
Ensure EKS/container platform is observable
Ensure EKS/container platform is supportable
Drive container image hardening
Drive vulnerability scanning
Drive dependency management
Drive software-supply-chain security
How You'll Work.
Team & Collaboration
Application teams; IT leadership; Procurement; Stakeholders; Cross-functional teams; Business requirements; Audit; Regulatory obligations
Communication Scope
Presentations; Reporting; Updates
Process & Methodology
Roadmap planning, Execution plans, Dependency management, Prioritization
Full Job Description
The New York Independent System Operator (NYISO) manages the efficient flow of electricity on more than 11,000 circuit-miles of high-voltage transmission lines, dispatching power from hundreds of generating units across the state. The New York Independent System Operator (NYISO) applies cutting-edge technology to operating a reliable electricity system, managing competitive markets for wholesale electricity, and planning for the Empire State's energy future. The NYISO’s Information Technology department invites applications for a full-time Manager, Cloud Platform Engineering. The Manager, Cloud Platform Engineering leads the team that builds, operates, and continuously improves NYISO’s shared cloud platform, including the Kubernetes/EKS foundation, AWS infrastructure-as-code, delivery tooling, reliability practices, and platform services that application teams use to deploy and operate software. The Manager is responsible for treating the internal cloud platform as a service, with defined consumers, operational expectations, service-level objectives, and a forward-looking roadmap. This includes providing application teams with secure, reliable, cost-effective, and self-service paths to build, deploy, and operate applications on NYISO’s cloud platform. The role owns the Site Reliability Engineering practice, the EKS/container platform, shared delivery tooling, and the security, compliance, and operational controls embedded across the platform and delivery lifecycle. This includes production ownership for the cloud platform with 24x7 accountability, on-call participation, incident triage, and leadership of service restoration activities. The Manager ensures platform operations align with NYISO’s incident, change, and problem management processes, with accountability for the lifecycle management of platform-related tickets, changes, operational issues, and service improvements. The Manager develops and manages platform budgets, including multi-year capital and operatin
Applying for this Manager, Cloud Platform Engineering role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about New York Independent System Operator?
Real rants from real employees. Read before you apply.