ArangoDB is the leading native multi-model NoSQL database, with more than 10 million downloads. It combines the power of graphs, with JSON documents and a key-value store. ArangoDB makes all of our clients data models accessible with a single declarative query language. Developers can build high-performance applications on distributed clusters. Oh, and did we mention it is open source?

We are looking for an experienced Site Reliability Engineer to advance the infrastructure and operations part of our managed cloud service, ArangoDB Oasis. Oasis provides fully hosted, managed and monitored cluster deployments of the ArangoDB database on all major cloud providers for our clients. You will be able to contribute significantly to this state-of-the-art service and take a leading role in shaping the strategy of hosting ArangoDB on multiple cloud providers.

Our headquarter is in San Francisco (US), our development hub is in Cologne (Germany) and our diverse team includes workmates at remote locations worldwide. Team Oasis is a remote team working from Central Europe, so remote candidates in this location & time zone are preferred.

About the Role

  • You will own the infrastructure and platform side of the Arango Oasis project
  • Designing and developing the cloud infrastructure on AWS, Google Cloud and Azure for reliability, efficiency, and scalability
  • You work closely with customers to solve their platform & infrastructure related issues
  • Working closely with the engineering side to automate a scalable deployment with Kubernetes
  • Contributing to the monitoring strategy and alerting processes
  • Developing custom monitoring tools and metrics dashboards, both for internal use and the client side
  • Testing the platform for resilience, running disaster recovery tests and implementing disaster response plans

Your Skills

  • You communicate pro-active and friendly with team members as well as customers
  • You have experience in building and maintaining large-scale distributed cloud infrastructures (AWS, Google Cloud, Azure)
  • Expertise in monitoring and analyzing distributed cloud systems, plus troubleshooting in production
  • Experience in deployment container tools for container Kubernetes, docker
  • You bring a love for automation to the table
  • Programming experience with Go, Rust, Java, Python, JavaScript is a plus, read-only programming experience with above languages is a must
  • Last but not least you have experience with databases, ideally including ArangoDB

The over 50 minds of ArangoDB come from 4 different continents and over a dozen countries. Diverse backgrounds enable us to see new solutions. We love this diversity and encourage everyone who is curious and visionary to join the multi-model movement.