Sr. Infrastructure Reliability Engineer, Infrastructure Reliability & Quality
Descrizione dell'offerta
Sr. Infrastructure Reliability Engineer, Infrastructure Reliability & Quality
Job ID: | Amazon Data Services, Inc.
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.
You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
Responsibilities
As a Senior Infrastructure Reliability Engineer you will:
- Proactively drive reliability risk identification, assessment, and mitigation for datacenter infrastructure equipment (e.g., LV Generator, MV Transformers, LV SWGR, Breakers, UPS, HV Transformers, In‑rack Power shelf, etc.).
- Conduct root cause analysis of critical equipment failures and drive continuous improvements to enhance datacenter availability for AWS customers.
- Collaborate with internal and external partners, including suppliers, to drive product specifications, risk identification plans, and execution.
- Apply physics‑of‑failure based approaches to develop and implement analytical and empirical methods for product quality and reliability risk identification during design, manufacture, and deployment stages.
- Perform lifecycle environmental and operational stress analysis (thermal, electrical, chemical, and mechanical) to identify overstress and fatigue‑related product weaknesses.
- Assess electronics manufacturing process quality/reliability issues and evaluate product design risks.
- Use statistical techniques and models to analyze test and field data.
- Lead critical component identification and vendor selection/qualification requirements at component level.
- Develop datacenter system‑level reliability models, perform reliability quantification, and conduct risk analysis for configuration optimization.
- Monitor product performance in the field, conduct root cause analysis of critical failures, and implement corrective and preventive actions.
- Lead vendor auditing and quarterly review processes to drive continuous improvement of datacenter availability.
Qualifications
- Bachelor’s degree in Electrical or Mechanical Engineering, Engineering Technology, or Reliability Engineering.
- 10+ years of Reliability Engineering work experience in a high‑reliability industry.
- 3+ years of experience with failure analysis activities and root cause analysis.
- 3+ years of experience with accelerated life testing, stress analysis, and finite element analysis.
- Knowledge of reliability engineering tools such as reliability block diagrams, statistical modeling, and data analytics.
- Strong problem analysis, communication, and vendor management skills.
- Ability to travel within the US and internationally.
Preferred Qualifications
- Master’s or Ph.D. in Reliability Engineering, Physics, Electrical, Mechanical, or Materials Engineering, or a related field.
- 10+ years of experience in reliability risk identification and assessment from component to system level using analytical, experimental, and statistical approaches.
- Proven experience applying proactive, cost‑effective reliability approaches throughout product design, manufacture, and deployment stages.
- Experience working with external design and manufacturing supply chain partners.
- Familiarity with major data center infrastructure equipment reliability performance.
- Ability to manage multiple qualification activities and development schedules.
Benefits and Compensation
The base salary range for this position is 136,600.00 – 184,800.00 USD annually. Your Amazon package will include sign‑on payments and restricted stock units (RSUs). Final compensation will be determined based on experience, qualifications, and location. Amazon also offers comprehensive benefits, including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance, and optional supplemental life plans), Employee Assistance Program, mental health support, medical advice line, flexible spending accounts, adoption and surrogacy reimbursement coverage, 401(k) matching, paid time off, and parental leave. Learn more about our benefits at
Amazon is an equal‑opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, please visit for more information.
#J-18808-Ljbffr