Hewlett Packard Enterprise
High Performance Computing DevOps\Site Reliability Engineer

High Performance Computing DevOps\Site Reliability Engineer

Sofia full-time

About us

Hewlett Packard Enterprise (HPE) advances the way people live and work. We bring together curious minds to create breakthrough technology solutions, helping our customers make their mark on the world. One of our core principles is belief in the power of people. Recognizing that our people are HPE’s biggest competitive advantage allows us to focus on ensuring that we keep the employee at the center of everything we do - and we trust our Human Resources (HR) division to look after our biggest asset. This includes everything from recruiting activities, administration, compensation, performance management, employee development, as well as the allocation of our industry leading work-life-balance and training programs.


We are looking for experienced HPC DevOps / SRE (Site Reliability Engineer) to join our dynamic and highly professional GreenLake Management Services (GMS) team in Sofia.

The role would be part of a dedicated team of Automation & HPC Expertise consultants available to simplify common HPC tasks for a high priority (F500 listed) GMS customer.

This is a hybrid, shift-based role with approximately 30% on-site engagement.

How you will make your mark:

Maintaining and improving the existing HPC system operations
Using principles of IaC and CI/CD to bring up systems on a new level
for automation and reliability of a complex High-performance computing environment, based on latest and most advanced technologies
Operate and administer the stack covering: General system updates; System performance; Outage management; Change and capacity management
Provide leadership in technical incident and problem management and in their resolution, working closely with end customers and HPE remote and field support staff
Develop action plans to investigate and resolve complex issues/problems and communicate to engineers and customer
Lead delivery efforts for specific customer
Participate/Drive ITIL based practices.

Requirements and necessary skills

University degree in Computer Science or relevant experience
Strong verbal and written communication in English
Strong analytical and problem-solving skills
Comfortable and effective in combining technical expertise with customer service
Willing to work “On call” on a team rotating basis
Ability to gather data, perform analysis of customer reported issues, produce and keep up to date relevant technical documentation
Desire to learn and grow by utilizing and learning new technologies
Desire to challenge how things are done if there is a better way
Proactive service orientation.
Key technical qualifications:

DevOps/SRE experience and knowledge of automation and IaC toolchain (Ansible, Python, PowerShell, etc.)
Experience with Bitbucket, Jenkins and Foreman
Experience with Bright Cluster Manager, HPC Compute, GPFS, Mellanox or equivalent
Good understanding of Linux/Unix Server technologies (RedHat, CentOS)
Experience with operating large and complex environments (Major incident management, Coordination and rollout of maintenances), based on ITIL based practices
Familiarity with JIRA, Grafana and ServiceNow hands-on experience
Familiarity working with VMs (VMware) and Containers (Docker) considered a plus.

We offer

Attractive compensation package
Career and Development - worldwide career opportunities, access to a high-tech Engineering Lab
Work That Fits Your Life ­- 24 days annual paid leave, have a free afternoon once a month, 6 months paid parental leave with 100% of your salary, possibility to work from home, transition support through life events
Wellness and Health Programs
Socially Engaged Community - 60 hours/year additional time off for volunteering, plastic free office, participation in socially responsible causes via partnership with 50+ non-government organizations.
Exciting Workplace Experience.
Join us and make your mark!

Want to know more about it?

Then let’s stay connected!

HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT and Affirmative Action employer. We are committed to diversity and building a team that represents a variety of backgrounds, perspectives, and skills. We do not discriminate and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global diverse team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together.
JobTiger Banner
JobTiger Banner