Intermediate Site Reliability Engineer, Gitaly:Cluster

  • GitLab
  • Remote
  • 01 Oct, 2024

Job Description

GitLab is an open core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating the rate of human progress. This mission is integral to our culture, influencing how we hire, build products, and lead our industry. We make this possible at GitLab by running our operations on our product and staying aligned with our values. Learn more about Life at GitLab.

An overview of this role

The GitLab DevSecOps platform empowers 100,000+ organizations to deliver software faster and more efficiently. We are one of the world’s largest all-remote companies with 2,000+ team members and values that foster a culture where people embrace the belief that everyone can contribute. Learn more about Life at GitLab.

SREs with Gitaly work alongside Backend Engineers with a focus primarily on improving the availability and the reliability of the Gitaly fleet on GitLab.com. While the backend engineers approach their responsibilities from a software developer point of view, the SREs approach the same problems from the operational perspective and collaborate closely on finding an optimal solution, in addition to ensuring that new Gitaly features can run at scale and deployed to production safely.

Gitaly is the Git data storage tier of GitLab, providing a reliable, secure and fast distributed Git data store over gRPC. For more information about Gitaly, see the team’s Direction page. 

Gitaly’s high-availability storage requires developers who understand distributed storage systems, their management, observability and availability. Cluster team contributes features, fixes bugs and improves performance of this software stack.

Currently, we're building a new distributed cluster solution and improvements to our Disaster Recovery readiness.

What you’ll do  

  • Work with peer SREs to maintain Gitaly’s environments within GitLab’s SaaS offerings, including cost and performance optimization, capacity planning, migrations and debugging production issues.
  • Participate in architectural discussions and decisions surrounding Gitaly, within the greater GitLab ecosystem.
  • Design RPC interfaces for the Gitaly service.
  • Scope, estimate and describe tasks to reach the team’s goals.
  • Develop production automation and tooling for Gitaly, for use both in SaaS and self-managed installations.
  • Help ensure that Gitaly development tooling, releases and other processes serve the team and the product’s goals.
  • Develop Gitaly in accordance with the product’s goals and a focus on reliability and maintainability.
  • Instrument, monitor and profile Gitaly in the production environment.
  • Build dashboards and alerts to monitor the health of your services.
  • Conduct acceptance testing of the features you’ve built.
  • Educate all team members on best practices relating to high availability.
  • Write performant, maintainable, and elegant code and peer review others’ code.
  • Be positive and solution-oriented.
  • Constantly improve the quality & security of the product.
  • Take initiative in improving the software in small or large ways to address pain points in your own experience as a developer.
  • Qualify developers for hiring.
  • Respond to user emergencies, platform alerts and support requests, including regular on-call duties.

What you’ll bring 

  • Mandatory: experience running highly-available systems in production environments at scale.
  • Mandatory: hands-on experience with Cloud technologies including Kubernetes.
  • Mandatory: proven professional experience building, debugging, optimizing software in large-scale, high-volume environments.
  • Mandatory: proven professional experience writing and testing high-quality code.
  • Mandatory: a good understanding of building instrumented, observable software systems.
  • Highly desirable: Experience with Terraform infrastructure as code.
  • Highly desirable: proven professional experience writing and testing quality code in Go.
  • Highly desirable: a good understanding of git’s internal data structures or experience running git servers.
  • Highly desirable: experience with gRPC.
  • Highly desirable: willingness to learn Ruby.

About the team

The Gitaly team owns and runs services that handle all Git operations on GitLab.com, one of the largest open source SaaS sites on the Internet. This means we are constantly faced with solving unique performance, scalability, and cost challenges that impact our users every day. Our future is about shipping improvements that can scale both GitLab.com from an infrastructure perspective, as well as deploying new features that will scale with the growing size of repositories across the industry.

The base salary range for this role’s listed level is currently for residents of listed locations only. Grade level and salary ranges are determined through interviews and a review of education, experience, knowledge, skills, abilities of the applicant, equity with other team members, and alignment with market data. See more information on our benefits and equity. Sales roles are also eligible for incentive pay targeted at up to 100% of the offered base salary.

California/Colorado/Hawaii/New Jersey/New York/Washington/DC pay range
$103,600$103,600 USD

Country Hiring Guidelines: GitLab hires new team members in countries around the world. All of our roles are remote, however some roles may carry specific location-based eligibility requirements. Our Talent Acquisition team can help answer any questions about location after starting the recruiting process.  

Privacy Policy: Please review our Recruitment Privacy Policy. Your privacy is important to us.

GitLab is proud to be an equal opportunity workplace and is an affirmative action employer. GitLab’s policies and practices relating to recruitment, employment, career development and advancement, promotion, and retirement are based solely on merit, regardless of race, color, religion, ancestry, sex (including pregnancy, lactation, sexual orientation, gender identity, or gender expression), national origin, age, citizenship, marital status, mental or physical disability, genetic information (including family medical history), discharge status from the military, protected veteran status (which includes disabled veterans, recently separated veterans, active duty wartime or campaign badge veterans, and Armed Forces service medal veterans), or any other basis protected by law. GitLab will not tolerate discrimination or harassment based on any of these characteristics. See also GitLab’s EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know during the recruiting process.