Southwest Airlines
Technology
SoftwareEngineer-CloudOps
Neural analysis suggests this role is
optimal for Mid candidates.
“Software Engineer - Cloud Ops at Southwest Airlines. Skills: Cloud Operations, AWS, AI Platform Operations, Reliability Engineering, Automation, Scripting. Ensure AI and agentic workloads run reliably in real production environments. Serve as an L1/L2 first responder for AWS-based AI services”
What You'll Achieve.
Ensure AI and agentic workloads run reliably in real production environments; Reduce repeat incidents
Industry & Context.
Identify and resolve issues that arise during the design, testing and maintenance processes using problem-solving skills; Analyze and manage large, complex, and vague Business or technical problems, articulating the problem or root cause, and translating the analysis into viable solution recommendations
Support on-call rotations, Ability to work extended hours as needed
What They're Looking For.
Must Have
Intermediate-level experience, fully functioning broad knowledge in software engineering, 2-5 years of relevant work-related experience, 3+ years in production support / cloud ops on AWS—incidents, on-call rotation, 3+ years in Python or Java for ops scripting and L2 fixes, 2+ years in Cloud-native AWS—Lambda, ECS, S3, DynamoDB, IAM, 2+ years in Observability tools—reading logs, metrics, traces (Datadog/CloudWatch), 2+ years in Incident response, ticket triage, and escalation handling, Intermediate knowledge of software development methodologies, practices, concepts, and technologies, Ability to demonstrate consistent knowledge application, skills of software development, and testing to solve a range of problems, Intermediate knowledge of at least one required programming language, Ability to partner, communicate, and negotiate with various Technology or partner Teams, Ability to analyze and manage large, complex, and vague Business or technical problems, articulating the problem or root cause, and translating the analysis into viable solution recommendations, Ability to take on multiple assignments, whether administrative or project related, while maintaining a successful level of completion in all responsible work, Ability to mentor others to do the same, Ability to develop, present and effectively communicate ideas and strategies to a variety of audiences, Must meet confidentiality expectations as to confidential, proprietary and sensitive Company information
Nice to Have
Exposure to AI/agentic frameworks or LLM APIs, Familiarity with on-call and ticket lifecycle (ServiceNow/Jira), Familiarity with Linux internals and networking, Docker fundamentals, Basic scripting for ops automation (Pythonash)
What You'll Do.
Ensure AI and agentic workloads run reliably in real production environments
Serve as an L1/L2 first responder for AWS-based AI services
and implement improvements
Apply automation and scripting to streamline operations
Support on-call rotations
Reduce repeat incidents
Apply knowledge and skills of software development and testing effectively to solve a range of problems
Document each aspect of a system or application
Determine and assess the needs of the user and then create software to meet the requirements
Identify and resolve issues that arise during the design
testing and maintenance processes
Prepare and install solutions by determining and designing system specifications
How You'll Work.
Team & Collaboration
Work alongside other engineers on the team to elevate technology and consistently apply best practices; Collaborate closely with customers and cross-functional departments to communicate project statuses and proposals; Partner, communicate, and negotiate with various Technology or partner Teams
Communication Scope
Communicate project statuses and proposals; Develop, present and effectively communicate ideas and strategies to a variety of audiences
Full Job Description
**Department:** Technology **Our Company Promise** We are committed to provide our Employees a stable work environment with equal opportunity for learning and personal growth. Creativity and innovation are encouraged for improving the effectiveness of Southwest Airlines. Above all, Employees will be provided the same concern, respect, and caring attitude within the organization that they are expected to share externally with every Southwest Customer. **Job Description:** As a Software Engineer Global supporting the AI Platform Operations Pod with a Cloud Operations focus, you’ll help ensure Southwest’s AI and agentic workloads run reliably in real production environments. This role emphasizes hands‑on cloud operations and reliability—serving as an L1/L2 first responder for AWS‑based AI services while partnering with Sr Engineers to troubleshoot issues, restore service, and implement improvements. You’ll work directly with AWS platforms such as Lambda, EKS, CloudWatch, and IAM, applying automation and scripting to streamline operations, support on‑call rotations, and reduce repeat incidents. Through day‑to‑day exposure to production systems, you’ll strengthen your cloud and operational skills while gaining practical experience supporting AI technologies, including LLMs, vector databases, and agentic frameworks. This role offers a strong foundation for engineers who want to grow into cloud and AI platform operations while contributing to systems that must perform at scale, every day. Responsibilities * Apply knowledge and skills of software development and testing effectively to solve a range of problems * Work alongside other engineers on the team to elevate technology and consistently apply best practices * Collaborate closely with customers and cross-functional departments to communicate project statuses and proposals * Document each aspect of a system or application as a reference for future upgrades and maintenance * Determine and assess the needs of the user and
Applying for this Software Engineer - Cloud Ops role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about Southwest Airlines?
Real rants from real employees. Read before you apply.