Granicus
Site Reliability Engineer
About the job:
Full-time
The Company
Serving the People Who Serve the People
Granicus is driven by the excitement of building, implementing, and maintaining technology that is transforming the Govtech industry by bringing governments and its constituents together. We are on a mission to support our customers with meeting the needs of their communities and implementing our technology in ways that are equitable and inclusive. Granicus has consistently appeared on the GovTech 100 list over the past 5 years and has been recognized as the best companies to work on BuiltIn.
Over the last 25 years, we have served 5,500 federal, state, and local government agencies and more than 300 million citizen subscribers power an unmatched Subscriber Network that use our digital solutions to make the world a better place. With comprehensive cloud-based solutions for communications, government website design, meeting and agenda management software, records management, and digital services, Granicus empowers stronger relationships between government and residents across the U.S., U.K., Australia, New Zealand, and Canada. By simplifying interactions with residents, while disseminating critical information, Granicus brings governments closer to the people they serve—driving meaningful change for communities around the globe.
Want to know more? See more of what we do here.
As a Site/Systems Reliability Engineering manager working on critical services, your mission will be to ensure our services are fast, highly available, scalable, and able to withstand unprecedented increases in load. The Systems Reliability Engineer will be at the heart of solving production problems. Your scope is from the kernel to the application. The position requires the flexibility to take a holistic approach to troubleshoot and the ability to delve deeply into technical details. The Systems Reliability Engineer will build automation tools for system health and production acceptance tests to validate production changes. The Systems Reliability Engineer will ensure the system is well-instrumented and highly fault tolerant.
What your impact will look like:
Engage, influence, and evangelise SRE practices with development, operational and product groups to align technology service/solution delivery.
Drive quality accountability within the organisation with well-defined processes, metrics, and goals for process quality. This includes leading effective post-mortems and ensuring actions are followed up.
Manage availability, latency, scalability, and efficiency of Granicus applications development by instilling engineering reliability into our development life cycle with a focus on fault-tolerant approaches.
Drive capacity planning, performance analysis, instrumentation, and other non-functional systems requirements.
Must be able to define and report “progress” on strategic initiates and project-level tasks to all stakeholders, including senior executives and clients and use effective communication approaches with each constituency.
Implement metrics-driven processes to ensure service quality targets are met.
You will love this job if you have:
Knowledge of defining and monitoring system quality measures, including SLO and SLA
Expert knowledge in designing, developing, and managing large real-time systems
Project and process management
Prior successful experience as a systems performance or site/systems reliability engineer
Mastery of Docker, Containers & Kubernetes
Mastery of Python (desirable) Programming
Mastery of fault-tolerant approaches in a large-scale distributed environment and high-performance systems
Demonstrated experience working in large, complex systems environments
Deep understanding of internet and networking protocols
A passion for performance excellence, robustness, and an engineering mindset
A degree in Computer Science or a related technical field
Hands-on cloud experience (AWS/Azure or both)
At least 5 years of experience as a people manager conducting regular 1:1 and performance/skills assessment
Comfortable working with global teams in different time zones
Collaborate with other engineering teams to improve a process or a strategic solution to a repetitive issue
Resource scheduling for on-call support
+ bonus and benefits
Benefits: At Granicus, we offer a competitive benefits package that allows employees to tailor benefits to their needs. Benefits listed below are for employees based in the U.S.
– Flexible Time Off
– Medical (includes an option that is paid 100% by Granicus!), Dental & Vision Insurance
– 401(k) plan with matching contribution
– Paid Parental Leave
– Employer-paid Short and Long Term Disability Insurance, Group Term Life Insurance and AD&D Insurance
– Group legal coverage
– And more!
Granicus is committed to providing equal employment opportunities. All qualified applicants and employees will be considered for employment and advancement.
RegionUSA Only
Compensation$115K – $125K USD/Year
CategoryDevOps and Sysadmin
Applicants12
Company Benefits
🌎 Distributed Team
🖥 Home Office Budget
📚 Learning Budget
👀 No Monitoring System
🚫 No Politics At Work
⬜️ No Whiteboard Interview
🏖 Paid Time Off
👴🏻 We Hire Old (and Young)
To apply for this job please visit jobs.lever.co.