Float
Software Engineer
Overview
Application
Description
Who We Are
Float is the world’s leading software for teams to plan their time. Launched in 2012, we’ve grown every year since, and remain proudly independent, self-funded and profitable. As a certified B Corporation, we’re committed to making a positive contribution to our team, customers, the environment, and the remote community. We’re a team of 50 working 100% remotely who believe in living our Best Work Life. You’ll. partner with team members globally, including Australia, Mexico, Italy, Nigeria, Canada, and the USA. Hear what our team has to say by browsing , or reading our . Check out what our customers think of Float .
We’re on a scale up journey, and we’re seeking people who thrive in this stage, given the autonomy, and the opportunity, to do the best work of their career.
Why We’re Hiring For This Role
The role of Site Reliability Engineers at Float is to increase the autonomy of the product and engineering teams by growing their capabilities to focus on solving problems. SRE makes sure our engineers get scalable infrastructure to build software on top of, making sure pipelines from idea to customer run smoothly and are easily built upon, and we also deal with broad areas of security around our network and defining internal security policy and practices.
Our goals for the Engineering team are to increase the pace with which they deliver improvements for our customers, provide an increasingly sophisticated and reliable service from our teams, and mitigate external threats as we grow.
You will help us tackle those problems by increasing reliability of our services to support larger clients joining Float, and increasing the robust security systems we’ve implemented to continue protecting our growing customer base.
Chris Nash, our Team Lead (SRE & QA), explains the important role you will play within our SRE team. .
You’ll be working asynchronously with a bright, dedicated team from across the globe, with a strong focus on taking complex problems and creating solutions that feel simple and intuitive for our customers.
What You’ll Be Responsible For
Early on, you’ll jump right into:
Continuing to support the regular maintenance of all the engineering systems supporting Float’s customers
Identifying areas requiring support to scale
Identifying areas for improving service resilience, ultimately delivering the ability to be resilient within the product and engineering teams themselves
Optimizing our monitoring and observability stack, building on the knowledge to create a standard set of tools and configurations for the product and engineering teams
Understanding Float’s SLOs in context, and building out SLO patterns and procedures for product and engineering teams
Once you are settled, we expect that you will jump into the following projects:
Building a repeatable and trustworthy disaster recovery program using chaos engineering techniques
Migrating all of our deployment configurations to a global single source of truth
Expanding Float’s infrastructure across multiple regions to create a global network
What You’ll Need To Be Successful
We want you to love your work and believe that these skills will allow you to succeed in the role.
Applying these skills requires:
An senior-level understanding of how SRE operates as an enabling team
A very good understanding of Service Level Objectives
Extensive knowledge of Kafka administration
Working experience with Terraform, Bash, and a go-to language which ideally would be one of PHP, NodeJS, Python
Experience with Kubernetes and GCP would be highly valued
To apply for this job please visit apply.workable.com.