Senior Site Reliability Engineer (Platform)

Permanent
Australia - Sydney

Build software and systems to monitor, scale, and deploy a distributed cloud service supporting millions of devices globally.

  • Scale and automate cloud services serving more than 4 billion HTTP requests/day
  • Autonomous environment with high ownership and impact to customers & product
  • Shape the future of container and serverless technologies

Senior Site Reliability Engineer (Platform)

  • Scale and automate cloud services serving more than 4 billion HTTP requests per day
  • Autonomous environment with high ownership and impact to customers & product
  • Shape the future of container and serverless technologies toward a microservice-like architecture

We're looking for a Senior Engineer to help build software and systems to monitor, scale, and deploy a distributed cloud service supporting millions of devices globally. Autonomous environment and supported professional development.

The company & team

Our client is a cloud-managed IT company headquartered in San Francisco and expanding rapidly in Sydney. They provide a full-suite of cloud-controlled products powering critical infrastructure of network switches, security appliances, wireless APs and security cameras. Backed by the resources and branding of a stable industry giant, they operate as an autonomous unit with great engineering culture.

The Cloud Platform Team is responsible for building and scaling the platform on which applications and services are deployed. The team is excited about moving towards a more microservice-like architecture, leveraging existing on-prem infrastructure along with AWS and GCP.

You will be responsible for shaping the future of container and serverless technologies, on a global network that is supported by eight data-centers on five continents.

Exciting challenges you'll be tackling:

  • Work with stakeholders across the organization to design and implement CI tooling to support building container images for various projects in our mono-repo.
  • Build integrations with our deployment infrastructure to bring rolling upgrades and blue/green deployments to our existing tooling.
  • Partner with our Infrastructure team to develop seamless load balancing of services across different clouds (from our own address space).
  • Research solutions that would allow us to leverage public cloud for more of our workloads while still meeting various regulatory requirements for data locality around the world.

Technical requirements

  • Have 5+ years experience across a mix of software development and systems administration roles.
  • Script or code with 1-2 languages like Ruby, Scala, Python or Bash. You are comfortable digging into other people's source code in search of the root cause of a problem and you automate all the things.
  • Care about the customer experience. You have experience supporting an externally-facing production environment.
  • Believe in the Unix way. You build large systems out of small components that each do one job and do it well. We run Debian.
  • Have experience on a pager rotation where you responded to escalations quickly to minimize customer downtime. This role requires being part of an on-call rotation.
  • Are familiar with tools such as Kubernetes, Docker, Ansible, Vault, ElasticSearch, Postgres.

Apply for this job.

Attach… Change
Attach… Change
Loading-grey
Submitting your application