Datacenter Operations Manager

Oracle · Torino, Piemonte, Italia ·


Descrizione dell'offerta

Key details:

Location: mostly on-site in Turin (30-40 min. travelling from Turin maximum)

Key skills: experience in managing the DC - cooling system, power, managing space capacity, etc. Background in engineering is highly welcome. Project management experience is also desirable - due to expansion, we have multiple projects ongoing that require supervision.

Working pattern: normal business hours 9 AM to 5 PM

Travelling: possible occasional travel to our other DC's in Milano, IT, or Madrid, ES

Job description:

As Operation Site Manager you will be the technical liaison between the technology teams and the Data Center Environment you run, and be key in maintaining all operational aspects for these sites. You will be supporting our growth path and will be recognized as a ‘technical expert’ with a focus on core Data Center infrastructure. You will troubleshoot and solve all but the most complex infrastructure issues. As a pragmatic problem solver on a wide range of Data Center environments and systems, you understand which issues to escalate to the appropriate resolver groups. You will proactively monitor the environments by checking monitoring systems, monitoring ticket queues and proactively with your own eyes and ears, taking corrective actions as needed. You will work effectively with other groups involved in maintaining the environment, including the systems housed within. You will make recommendations for improvement. You will contribute to, create and maintain documentation. You will understand all aspects and quirks of the sites you support from 1-line diagrams through to cooling infrastructure. You know how to innovate and make decisions on his/her own, but also know how to take direction when it is given, paying attention to all details involved.

You can execute small projects on your own and work with others in planning and executing larger projects. This would suit an individual who is able to add significant value based on existing experience whilst evolving their career upstream into Cloud Services.

This is an individual contributor role, with matrix management of several teams.

Responsibilities

  • Conducting audits for power and mechanical capacity and overseeing changes.
  • Collaborating with internal teams to troubleshoot and perform Root Cause Analysis (RCA) and Corrective Action (CA) for issues.
  • Liaising with local colocation partners to fully understand site topology and articulate issues as needed.
  • Ensure rack deliveries and installations are planned and managed effectively.
  • Developing and Sustaining 5S standards on each site you have responsibility for.
  • Working with the build team to rollout expansions.
  • Manage & prioritise tasks via internal tools e.g. JIRA and Confluence.
  • Uphold meticulous documentation and adherence to SOPs.
  • Collaborate with project teams and colocation partners to validate the functionality of electrical and mechanical systems.
  • Extend operational support encompassing failure mode analysis, root cause identification, maintenance assistance, best practices, procedural reviews, and more.
  • Participate in operational reviews, gathering and analyzing technical data to pinpoint and rectify existing reliability and availability concerns.
  • Collaborate effectively with engineering and design teams for issues and improvements.
  • Offer expertise to address and mitigate global resiliency, reliability, and availability risks.
  • Collaborate with internal teams to set and maintain standards, ensuring consistency and reliability in service delivery.
  • Emerge as the go-to technical authority within the team and across other departments for your assigned area.
  • Approach challenges with a positive attitude, offering innovative and creative solutions.
  • Be intimately familiar with all aspects of LV cabling (copper, fibre) and related components, and cable management strategies.
  • Coaching and mentoring other individual contributors across the organization.
  • Have a Dev Ops mindset, and the ability to code is desirable (scripting, SQL, low-code APEX environments).
  • Working globally to share ideas, best practices, lessons learnt and initiatives.
  • Be able to travel, often at short notice to multiple campuses in Italy
  • Awareness and adherence to all applicable Health & Safety rules.

Additionally, you should be able to dive deep into any part of the stack, value simplicity, work comfortably in a collaborative, agile environment, and be excited to learn. You should be fully versed in Data Centre build, live operations management, change control and be familiar with 5S.

Candidatura e Ritorno (in fondo)