Engineering Manager, Site Reliability Engineering (Traffic) at Wikimedia Foundation

0
Engineering Manager, Site Reliability Engineering (Traffic) at Wikimedia Foundation
Apply for Jobs in Port Harcourt & South South States
Apply for Jobs in Abuja & Middle Belt States
Apply for Jobs in Lagos & Western States

The Wikimedia Foundation is the nonprofit organization that operates Wikipedia and the other Wikimedia free knowledge projects. Our vision is a world in which every single human can freely share in the sum of all knowledge. We believe that everyone has the potential to contribute something to our shared knowledge, and that everyone should be able to access that knowledge freely. We host Wikipedia and the Wikimedia projects, build software experiences for reading, contributing, and sharing Wikimedia content, support the volunteer communities and partners who make Wikimedia possible, and advocate for policies that enable Wikimedia and free knowledge to thrive.

Title: Engineering Manager, Site Reliability Engineering (Traffic)

Location: Lagos

Summary

  • The Wikimedia Foundation is looking for an Engineering Manager to join our SRE team, reporting to the Director of Site Reliability Engineering. As Engineering Manager, you will be responsible for supporting the engineers developing our infrastructure and supporting the services that depend on it, used by hundreds of millions of people around the world.
  • In this role, you will be managing the Traffic team, a remote-first, globally distributed team responsible for designing and operating a private and privacy-protecting global CDN for Wikipedia and related sister projects. Our Traffic stack handles public DNS, global request routing, TLS termination, HTTP content caching, and internal service routing for the public services of the Foundation, and is a critical layer in bringing our free content to our global audiences.

You are responsible for:

  • Managing one to two globally distributed teams within Wikimedia’s Site Reliability Engineering organization
  • Recruiting, hiring, and helping onboard new team members
  • Working with team members to set individual performance goals, and supporting them in meeting and evolving their goals and career path
  • Triaging incoming workload, maintaining focus on priorities, and setting realistic expectations for both peers and team members
  • Coordinating and communicating with other members of the Wikimedia engineering teams on relevant projects, and contributing to the organizational strategy
  • Continuously developing the roadmap of the team in alignment with other SRE and Technology teams, and helping to draft and execute the team’s annual and quarterly plans
  • Project managing new and existing initiatives
  • Leading the definition, refinement, and execution of the processes through which the team manages and performs work
  • Leading incident response, diagnosis, and follow-up on system alerts and outages across Wikimedia’s production infrastructure
  • Facilitate the definition and establishment of Service Level Objectives and track Error Budgets with service owners and stakeholders

Skills and Experience:

  • Prior experience managing teams
  • Prior hands-on experience with software or reliability engineering (within the last 3 years preferred)
  • Aptitude for automation and streamlining of tasks
  • Communicate effectively in both spoken and written English
  • Ability to work independently, as an effective part of a globally distributed team
  • Willing and able to travel several times a year for occasional in-person meetings
  • B.S. or M.S. in Computer Science or the equivalent in related work experience

Qualities that are important to us:

  • Commitment to the mission of the organization and our values
  • Commitment to our guiding principles
  • Ability to disagree in a respectful manner and yet work towards a solution even when you disagree
  • Good at asynchronous communication
  • Solutions-focused. The Wikimedia ecosystem is complex, resources are limited, and our guiding principles are ambitious. We want you to work to find solutions embracing these factors.
  • Self motivated with an ability to navigate through ambiguity and bring a project to completion with limited directions
  • Curiosity and commitment to learn

Additionally, we’d love it if you have:

  • Experience working in a distributed, largely remote environment
  • Experience contributing to open source projects

Deadline: Not Stated

How to Apply: Candidates should Click Here to apply.

LEAVE A REPLY

Please enter your comment!
Please enter your name here