Senior Software Engineer, Site Reliability Engineering
Company: Google
Location: Sunnyvale
Posted on: April 1, 2026
|
|
|
Job Description:
info_outline X Applicants in San Francisco: Qualified
applications with arrest or conviction records will be considered
for employment in accordance with the San Francisco Fair Chance
Ordinance for Employers and the California Fair Chance Act.Note: By
applying to this position you will have an opportunity to share
your preferred working location from the following: Sunnyvale, CA,
USA; San Francisco, CA, USA; Mountain View, CA, USA; Pittsburgh,
PA, USA; Durham, NC, USA; Raleigh, N.C., USA . Minimum
qualifications: Bachelor’s degree in Computer Science, a related
field, or equivalent practical experience. 5 years of experience
with software development in one or more programming languages. 3
years of experience in designing, analyzing, and troubleshooting
large-scale distributed systems. 2 years of experience leading
projects and providing technical leadership. Preferred
qualifications: Master's degree in Computer Science or Engineering.
About the job Hope is not a strategy. Engineering solutions to
design, build, and maintain efficient large-scale systems is a true
strategy, and a good one. Site Reliability Engineering (SRE) is an
engineering discipline that combines software and systems
engineering to build and run large-scale, massively distributed,
fault-tolerant systems. SRE ensures that Google's services—both our
internally critical and our externally-visible systems—have
reliability and uptime appropriate to users' needs and a fast rate
of improvement while keeping an ever-watchful eye on capacity and
performance. SRE is also a mindset and a set of engineering
approaches to running better production systems—we build our own
creative engineering solutions to operations problems. Much of our
software development focuses on optimizing existing systems,
building infrastructure and eliminating work through automation. As
SREs are responsible for the big picture of how our systems relate
to each other, we use a breadth of tools and approaches to solve a
broad spectrum of problems. Practices such as limiting time spent
on operational work, blameless postmortems and proactive
identification of potential outages factor into iterative
improvement that is key to both product quality and interesting and
dynamic day-to-day work. SRE's culture of diversity, intellectual
curiosity, problem solving and openness is key to its success. Our
organization brings together people with a wide variety of
backgrounds, experiences and perspectives. We encourage them to
collaborate, think big and take risks in a blame-free environment.
We promote self-direction to work on meaningful projects, while we
also strive to create an environment that provides the support and
mentorship needed to learn and grow. To learn more: Check out Site
Reliability Engineering , written by Google SREs. Watch a recorded
Hangout on Air to meet some of our SREs. Read a career profile
about why a software engineer chose to join SRE. Behind everything
our users see online is the architecture built by the Technical
Infrastructure team to keep it running. From developing and
maintaining our data centers to building the next generation of
Google platforms, we make Google's product portfolio possible.
We're proud to be our engineers' engineers and love voiding
warranties by taking things apart so we can rebuild them. We keep
our networks up and running, ensuring our users have the best and
fastest experience possible. The US base salary range for this
full-time position is $174,000-$252,000 bonus equity benefits. Our
salary ranges are determined by role, level, and location. Within
the range, individual pay is determined by work location and
additional factors, including job-related skills, experience, and
relevant education or training. Your recruiter can share more about
the specific salary range for your preferred location during the
hiring process. Please note that the compensation details listed in
US role postings reflect the base salary only, and do not include
bonus, equity, or benefits. Learn more about benefits at Google .
Responsibilities Engage in and improve the whole lifecycle of
services—from inception and design, through to deployment,
operation and refinement. Support services before they go live
through activities such as system design consulting, developing
software platforms and frameworks, capacity planning and launch
reviews. Maintain services once they are live by measuring and
monitoring availability, latency and overall system health. Scale
systems sustainably through mechanisms like automation, and evolve
systems by pushing for changes that improve reliability and
velocity. Practice sustainable incident response and blameless
postmortems.
Keywords: Google, Castro Valley , Senior Software Engineer, Site Reliability Engineering, IT / Software / Systems , Sunnyvale, California