Site Reliability Engineer - Kalix (Australia/New Zealand/Japan)

Remote
Full Time
Engineering
Mid Level

Lightbend operates Kalix, a cloud platform that makes distributed systems and design patterns consumable as a service. Our mission is to take care of the complexities of running distributed systems, allowing developers to focus on their business logic while delivering resilient and scalable systems. We are taking the traditional stateless FaaS model, and turning it on its head, pushing into the uncharted territory of managing stateful application code, built on the solid foundations of tried and tested distributed computing principles that we have successfully delivered over more than a decade.

We are looking for experienced Site Reliability Engineers in Australia/New Zealand/Japan time zones to join our Cloud Services team who are excited to leverage leading SRE practices to operate highly resilient and scalable systems. 

Responsibilities:

  • Develop and extend software to monitor and improve end-to-end platform performance, identify runtime deficiencies, find potential failures, and fix production issues in a fully managed multi-cloud environment.
  • Participate in on-call rotation and incident-resolution.
  • Build deep, full-stack knowledge of our platforms and applications. 
  • Work to simplify and automate deployment processes, run-time operations, and provide non-disruptive releases.
  • Help create and maintain an environment that provides security and privacy for our customers' data.
  • Maintain application reliability and uptime SLAs throughout the application lifecycle using programmatic self-healing and software automation.
  • Travel occasionally to meet with the rest of Lightbend’s technical team.

Candidates can be based in Australia, New Zealand or other countries in these time zones, as this is a fully remote position. This is not a full-time firefighting role requiring super heroes. Site reliability is the entire team’s responsibility. We are looking for an operations expert to be a part of building and running our new offerings as we expand our platform.

Qualifications:

You

  • Are an SRE who understands how to operate modern distributed data systems on Kubernetes to be extremely reliable with predictable performance.
  • Have experience with (multiple) cloud service offerings, specifically from an operational perspective (we operate on Google Cloud and AWS today).
  • Have a passion for automating the complexities of orchestrating and running multi-tenant cloud application services.
  • Are accustomed to collaborating with business owners and understanding diverse business requirements.
  • Have two or more years of experience in distributed systems architecture and runtime requirements.
  • Are a voracious learner, ready to take on new technologies and techniques quickly and constantly.
  • Have excellent written and verbal communication skills in at least English.
  • Are skillful at interacting and working with people; working with a self-organized lean and agile team to mitigate project risks, manage effort and ensure quality.
  • Are dedicated to best practices such as infrastructure as code, automated testing, code reviews, CI/CD, GitOps, and testing.
  • Are biased towards action on tough problems and issues, and focused on your customer’s success.
  • Are an agent of change, constantly learning and seeking better outcomes.
  • Are familiar with many of the supporting technologies we use, including Terraform, Crossplane, FluxCD GitOps, Prometheus, Grafana, Actors, Service Mesh frameworks, etc.
  • Are experienced with complex and secure networking environments, including Encryption Keys, and TLS.

Ideally, you also...

  • Have knowledge of the Lightbend technologies and distributed systems, including Akka clustering.
  • Have supported SaaS/PaaS systems.
  • Have an awareness of Serverless/Functions-as-a-service Platforms.

What we offer:

Lightbend is a welcoming, transparent, and highly distributed company dedicated to creating high-performance systems that bring success to all who use them.  With a strong focus on work-life balance, our company offers a fast-paced, collaborative environment mixed with challenging and engaging work. This combination has attracted and retained some of the brightest minds in our technology communities.

Lightbend is an Equal Opportunity Employer.

Share

Apply for this position

Required*
Apply with Indeed
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

Human Check*