Infrastructure Quality Engineer, Infrastructure Reliability & Quality (IRQ)
Other Engineering, Quality Assurance
Description
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.
You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
Our AWS Infrastructure Reliability & Quality (IRQ) engineering team provides engineering support for our data center infrastructure equipment (Air Handling Unit, Switchgear, Breaker, Panel Board, UPS, Transformer, Generator, ATS etc.). As a member of this team you will be proactively driving quality and reliability risk identification, assessment and mitigation for data center equipment. You will also be responsible for root cause analysis of critical equipment failures, supplier process breakdown and drive continuous improvements to improve datacenter availability for AWS customers. You will work closely with both internal and external partners including suppliers to define product specifications, risk identification plans and mitigations. Internally you will collaborate with AWS Engineering, Procurement, Construction, Commissioning, Operations and Field Engineering teams. Externally you will manage supplier qualification, quality and reliability monitoring, supplier issue resolution and supplier development and continuous improvement initiatives that span the product lifecycle. You must have can-do attitude, be ownership minded, independent, action- and results-oriented to succeed in our open collaborative environment.
Key job responsibilities
- Develop, implement and maintain equipment quality and reliability roadmaps by collaborating with engineering, operations, and procurement teams.
- Define, monitor and achieve the correct quality/reliability performance targets for each equipment.
- Verify AWS quality standards are met at suppliers through in-person and remote audits.
- Establish and monitor end-of-line and incoming inspection/first article inspection plans.
- Support supplier and equipment qualification and assessment processes in support of procurement teams including issue resolution.
- Collaborate globally with suppliers to resolve field issues through Root Cause Analysis and corrective actions. Escalate complex failure investigations to AWS Senior/Principal Engineers.
- Develop and support suppliers with product improvement initiatives and Key Performance Indicators (KPI). Provide a feedback mechanism from suppliers to internal teams to resolve joint quality issues.
- Support internal AWS teams in New Product Development (NPD) initiatives including Failure Mode and Effect Analysis (FMEA) of design and manufacturing processes.
- Ensure AWS products meet or exceed industry standards for initial quality and long-term reliability performance.
- Analyze product design assumptions and AWS operational requirements to identify and mitigate equipment performance risks.
- Drive Continuous Process Improvement strategy through identification of new qualification criteria, test requirements, preventative maintenance checkpoints or specification to improve overall equipment resilience
- Successfully handle concurrent projects, sometimes in multiple geographical regions.
- Travel required, both international and domestic, approximately 30-50%
About the team
About AWS
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Inclusive Team Culture
AWS values curiosity and connection. Our employee-led and company-sponsored affinity groups promote inclusion and empower our people to take pride in what makes us unique. Our inclusion events foster stronger, more collaborative teams. Our continual innovation is fueled by the bold ideas, fresh perspectives, and passionate voices our teams bring to everything we do.
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.