Observability Engineer - Dev Ops - Staked

Kraken Digital Asset Exchange

Posted 2 months ago

About Kraken

As one of the largest and most trusted digital asset platforms globally, we are empowering people to experience the life-changing potential of crypto. Trusted by over 8 million consumer and pro traders, institutions, and authorities worldwide - our unique combination of products, services, and global expertise is helping tip the scales towards mass crypto adoption. But we’re only just getting started. We want to be pioneers in crypto and add value to the everyday lives of billions. Now is not the time to sit on the sidelines. Join us to bring crypto to the world.

To ensure Kraken is the right fit for you, please ensure you read Kraken Culture Explained to find out more about us!

Kraken is looking for an experienced observability engineer to build and maintain the Staking monitoring infrastructure. This is an opportunity tomonitor, observe, and operate very diverse application set in crypto. You will have exposure to a wide range of Blockchains. We’re a diverse group of engineers dedicated to making cryptocurrency available and accessible to the world and enabling people from all walks of life to invest in their independence. The Kraken experience needs to be ambitious, simple, and user-centered; come and help us make that happen!

About the Role

To be successful in this role, you will need to be responsible for maintaining and improving the Staking infrastructure's observability. The job requires executing work, documenting work, and influencing others across the team on best practices.

What you will do:

  • Contribute to the implementation of the refactor/evolution of observability data acquisition
  • Implement “next generation” observability/incident management User Interface in yet to be decided platform and implement alerting/annunciation rules.
  • Contribute to the integration of a new observability tools/platforms from the broader Kraken observability apparatus.
  • Establish technically detailed triage/escalation/remediation procedures and tooling to automate it.
  • Enable a high level of visibility into the state of services and infrastructure
  • Be familiar with risks introduced to the organization by third parties and processes to mitigate these;
  • Take a risk-based approach to all facets of information security;
  • Have a "finger on the pulse" of current challenges and different methods to monitor nodes and applications' health

Qualifications:

  • Optional: relevant and well-regarded certifications in cloud computing such as CKA (Certified Kubernetes Administrator), AWS Professional or Specialty levels, Google Professional level;
  • Scripting Experience: Python or Go (Preferred)
  • Experience with monitoring and alerting systems
  • Experience with Sumo Logic, Splunk, PagerDuty, or Datadog is a plus
  • Optional: Experience with HashiCorp product lines, Jenkins, Helmfile

Location Tagging: #Canada #USA #Li-Remote

We’re powered by people from around the world with their own unique and diverse experiences. We value all Krakenites and their talents, contributions, and perspectives, regardless of their background. We encourage you to apply for roles where you don't fully meet the listed requirements, especially if you're passionate or knowledgable about crypto!

As an equal opportunity employer we don’t tolerate discrimination or harassment of any kind. Whether that’s based on race, ethnicity, age, gender identity, citizenship, religion, sexual orientation, disability, pregnancy, veteran status or any other protected characteristic as outlined by federal, state or local laws.

Stay in the know

Kraken Culture Explained

Follow us on Twitter

Catch up on our blog

Follow us on LinkedIn

Our Working Week

We work 5 days per week but are also hiring for part time roles.

Our Remote Working Policy

We work fully remotely

Expect to be working with

Report incorrect data

Let us know if the job has expired