Description
Are you passionate about working on cutting edge new technology in Cloud infrastructure Platform management with a team that embodies the growth mindset? Are you hoping to join an organization which is built on a mission “To empower every person and organization on this planet to achieve more”? Then this is the role for you. The Azure Cloud Hardware Infrastructure division (SCHIE) is responsible for Firmware design and development of Server and Rack Infrastructure Firmware for Microsoft Online Services.
Responsibilities
Engage in front line debug of Bug checks/memory errors/CPU Errors/Boot failures/Security incidents across multiple platform architectures
Perform RCA of critical issues and define/develop fixes
Collaborate with multiple teams like firmware, operations, security to address key customer infrastructure outages
Define and architect Automated debug workflows for bug checks/Machine check/CPU errors using Silicon vendor features and cloud orchestration tools
Maintain Site reliability/availability SLA for Cloud customers
Define, develop and implement long term strategies to reduce infrastructure outages
Qualifications
As a Cloud Firmware Debug engineer , you will be the first line of defense in analyzing, debugging complex hardware firmware reliability issues across multiple hardware architectures. You will perform Platform/Component debug, identify RCA’s and develop/propose fixes for Firmware/SW across Global Azure infrastructure. You will get to collaborate with engineers across multiple geographies and functional(OS, Hardware Infrastructure, Firmware engineering, RAS, Cloud operations, Customers, Debug) teams and help prevent and resolve any Cloud Capacity issues across diverse customers
Expertise in CPU architectures (2Socket,4socket,8socket,16 socket) – Intel-AMD-ARM is a must
Expertise in one or more areas of Platform Server Architecture (CPU, Memory(DDR4/DDR5), PCI-e, NVMe, SSD/SAS, Secureboot, UEFI, BMC, GPUs, infiniband, Hardware interfaces like MUX/I2C/SPI, schematics, TPM, Converged network adapters/Smart Nics,IPMI, ARM Linux, Windows Kernel)
Hands on expertise in or more on tools like ITP,ARIUM, Windbg, Immunity and other Firmware/Software/OS debug tools
Ability to trace back patterns/trigger events using debug logs and cloud telemetry data
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, *** (including pregnancy), ****** orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.