Hewlett Packard Enterprise (HPE) advances the way people live and work. We bring together curious minds to create breakthrough technology solutions, helping our customers make their mark on the world. One of our core principles is belief in the power of people. Recognizing that our people are HPE’s biggest competitive advantage allows us to focus on ensuring that we keep the employee at the center of everything we do - and we trust our Human Resources (HR) division to look after our biggest asset. This includes everything from recruiting activities, administration, compensation, performance management, employee development, as well as the allocation of our industry leading work-life-balance and training programs.
We are looking for experienced HPC DevOps / SRE (Site Reliability
Engineer) to join our dynamic and highly professional GreenLake
Management Services (GMS) team in Sofia.
The role would be part of a dedicated team of Automation & HPC
Expertise consultants available to simplify common HPC tasks for a
high priority (F500 listed) GMS customer.
This is a hybrid, shift-based role with approximately 30% on-site
engagement.
How you will make your mark:
Maintaining and improving the existing HPC system operations
Using principles of IaC and CI/CD to bring up systems on a new
level
for automation and reliability of a complex High-performance
computing environment, based on latest and most advanced
technologies
Operate and administer the stack covering: General system updates;
System performance; Outage management; Change and capacity
management
Provide leadership in technical incident and problem management and
in their resolution, working closely with end customers and HPE
remote and field support staff
Develop action plans to investigate and resolve complex
issues/problems and communicate to engineers and customer
Lead delivery efforts for specific customer
Participate/Drive ITIL based practices.