Lead Site/System Reliability Engineer
Location
Glendale, CA | United States
Job description
Job Summary:
“We Power the Magic!” That’s our motto at Disney Parks, Experiences and Products Technology & Digital. Our team creates world-class immersive digital experiences for the Company’s premier vacation brands including Disney’s Parks & Resorts worldwide, Disney Cruise Line, Aulani, A Disney Resort & Spa, and Disney Vacation Club.
We are responsible for the end-to-end digital and physical Guest experience for all technology & digital-led initiatives across the Attractions & Entertainment, Food & Beverage, Resorts & Transportation and Merchandise lines of business as well as other initiatives including MyDisneyExperience and Hey, Disney!
As a Lead System Engineer, you will work in an extremely collaborative and high energy environment. In this role, you will: work directly with cloud architecture, actively engage with development/QA teams, product and project managers, and product/business teams. You will lead project/planning efforts, architectural review, or design, make recommendations for technical decisions, and attend meetings w/ various teams. You will provision, tune, and automate cloud-based services and applications. You will provide systems administration to make our applications highly available, application maintenance and support.
What You'll Do:
- Responsible for creating breakdown of project tasks to meet project objectives
- Responsible for creating work breakdown of tasks and managing own Jira Epics and Stories
- Communicating project status and potential blockers to leadership each week
- Assisting with capacity management of our services to meet demands during launches
- Accountable for/teaching other engineers how to create breakdowns of tasks
- Accountable for/teaching other engineers how to complete tasks on time
- Responsible to help unblock self and escalate to the right leader if necessary
- Partner with AID team to keep architecture diagrams updated after changes
Required Qualifications & Skills :
- Minimum of 7 years of related work
- Excellent communication and relationship skills, and be able to articulate advanced technical topics to both technical and non-technical staff
- Communicate clearly in high pressure situations to all levels of stakeholders
- Represent our organization in incidents that involve other business units
- Experience troubleshooting during on-call situations: demonstrable analytical and problem-solving skills
- Experience in technical process, incident response, and change management ( ITIL experience )
- Experience with compliance and vulnerability management
- Be able to collect and articulate forensic details to facilitate root cause analysis
- Be a self-starter, lead project/planning efforts, architectural design, lead meetings as needed to reach objectives, attend team and product meeting as needed
- In addition, you can guide other Systems Engineers to:
- Prioritize workload
- Discuss technical challenges and collaborate to identify effective solutions
- Manage multiple tasks from start to finish
Basic Technical Qualifications:
- Technical experience in consumer and employee-facing enterprise systems
- Ability to deep-dive/troubleshooting applications and systems to restore service as quickly as possible
- Expertise in maintaining web, caching, and queuing technologies in large high traffic environments
- Expertise in architecting highly scalable and highly available systems
- Expertise in a public cloud (AWS or Google’s GCP)
- Expertise in configuration management
- Proficiency with containerization
- Proficiency in a programming language
- Proficiency with distributed version control systems (for example GIT) with Continuous Integration/Deployment techniques
- Proficiency in supporting SQL and NoSQL technologies
Preferred Technical Qualifications:
- AWS Cloud (Fargate, ECS, Lambdas, ApiGateways, EC2, S3, ALB/ELB, Elasticache, VPC, IAM, EKS, KMS-Secret Manager)
- Google Cloud Platform (App Engine, Kubernetes ( Helm/Tiller ), Cloud Functions, Firebase, IAM
- Vault
- Logging/Monitoring/Alerting (Cloudwatch/Splunk/AppDynamics/Elasticache/Grafana)
- MessageQueueing: RabbitMQ, PubSub
- Source Code Management Tools: Github, GitLab, Jenkins/CICD/Gitlab
- Terraform/Atlantis
- F5 LTM
- Languages: Go/Python/Node.js (Angular.js framework)/Java (Spring MVC)
- Rundeck/Chef/Ansible
Education:
- Bachelor’s degree in Computer Science, Information Systems, Software, Electrical or Electronics Engineering, or comparable field of study, and/or equivalent work experience
The hiring range for this position in California is $124,000.00-$166,200.00 per year. The base pay actually offered will take into account internal equity and also may vary depending on the candidate’s geographic region, job-related knowledge, skills, and experience among other factors. A bonus and/or long-term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and/or other benefits, dependent on the level and position offered.
Job tags
Salary