Posted 1 week ago
Only considering candidates eligible to work in Toronto, Canada ⚠️
At Sensei Labs, we envision a future where people genuinely enjoy their work, freed from the limitations of outdated tools, processes, and office constraints.
Our team utilizes modern, award-winning digital tools to orchestrate work and foster collaboration. We thoughtfully select our work orchestration stack to empower all team members to work in the most productive and efficient way possible.
Four things every Sensei LOVES:
Four Day Week (4DW). Every Sensei enjoys a highly productive four-day work week with Fridays off—without the expectation of working ten-hour days. We focus on improving productivity so we can achieve our work in four days instead of five. We believe this fosters a culture where work-life balance and mental wellness are priorities. Our team loves it, and we hope you will too!
Work from Anywhere (WFA) policy. Our Senseis can work from anywhere in the world as long as they have a laptop and a reliable internet connection. We're proud to break the stereotype that work only gets done when people show up at an office.
Living our SENSEI values. Everything we do is guided by our SENSEI values: Selfless, Empathetic, Nimble, Skilled, Entrepreneurial, and Integrity. These values inform every aspect of our work—from this job description to hiring and promotions.
"No Email" policy. At Sensei Labs, we rely on our proprietary Conductor platform and Microsoft Teams for communication. We’ve eliminated unnecessary internal emails.
Sensei Labs is seeking a Senior Engineer experienced in multi-cloud architecture and development. Our custom SaaS platform, Conductor, helps teams at fast-growing companies evolve and execute faster. You will play a key role in bridging software development, IT operations, and artificial intelligence. You'll design, build, and maintain the infrastructure and tools for the efficient deployment of AI-infused applications, supporting continuous integration and continuous delivery (CI/CD) into production. Join us to create more efficient, happier teams! Your focus will be on DevOps, performance, and high-availability solutions.
What you'll do
- Architect and create functionality to support the platform, such as automated video transcoding, or improvements to our deployment process.
- Develop and maintain CI/CD pipelines for AI and machine learning model deployments.
- Research and implement new tools and frameworks to improve the speed, quality, and processes of AI model development and deployment.
- Ensure security best practices in the deployment and management of AI models and data
- Create and build on our services layer.
- Advocate for new or improved technologies to make the system more efficient.
- Improve tooling to measure, optimize, and scale our cloud infrastructure.
- Perform standard administration tasks including: patching, monitoring and implementing network security best practices.
- Address scaling issues by diagnosing, solutioning, and implementing.
- Collaborate with peers to write, review, or provide feedback on technical design proposals.
- Contribute to internal tools that help us improve our development process, manage our users, and improve the scalability of our systems.
- Improve processes to help the team work more efficiently.
- Ensure that the Sensei OS systems are secured following established best practices.
- Regularly plan and exercise service failure and disaster recovery scenarios.
- Mentor other members of the team as grow.
- Participate in rotation based on-call support.
What we're looking for
- You have 7+ years of experience as DevOps, Infrastructure, Operations or Site Reliability Engineer.
- You have extensive experience with Azure. AWS experience is also an asset.
- You're experienced with scrum sprints, rapid iteration, and continuous delivery.
- You're motivated, self-directed, a good communicator and have strong telecommuting skills (many of our team members work remote at least some of the time).
- 5+ years of experience in Kubernetes, Helm, CI/CD, GitHub Actions, Monitoring Tools such as Prometheus/Grafana and APM tools like DataDog.
- You can work autonomously to make the right decisions for the business.
- You are a strong communicator. Explaining complex technical concepts to designers, support, and other engineers in plain language is no problem for you.
- You have a good knowledge of performance, scalability, availability, and security standards for the web.
- Cloud Administration experience including:
- Cloud Infrastructure Setup and Maintenance
- Security Management
- Performance Monitoring
- Cost Management and Optimization
- Backup and Disaster Recovery
- Compliance and Audits
- Code Quality Tools.
- Microservices Scalability experience including:
- Best Practices and standards
- Automation
- Monitoring and Performance Tuning
- Capacity Planning
- System Reliability and Scaling
- Kubernetes Administration experience including:
- Cluster Management
- Upgrades and Patching
- Resource Allocation and Optimization
- Site Reliability experience with measuring and monitoring availability, latency, and overall service health, drive incident management and post-mortem analysis.
What will set you apart
- You have a history of open source contributions and helping the broader software engineering community (through Github, Stack Overflow, a blog, or the like).
- You are excited about automation, and implementing high performance code that can handle large loads.
- Academic background in computer science (BSc or MSc).
- Advanced understanding of SQL.
- Experience with Redis (or other NoSQL databases).
- Experience in GitOps/MLOps/ArgoCD.
- Prior experience with or knowledge of large scale, high volume systems.
- Azure certifications: "MCSA: Cloud Platform" or " MCSE: Cloud Platform and Infrastructure"
About us
At Sensei Labs, we continue to build an amazing, diverse team, and inclusive culture. Our competitive advantage is rooted in our team members’ unique perspectives and experiences. We encourage you to apply even if you don't have all the qualifications listed but want to bring new ideas and perspectives to augment our team.
We’re very proud of our team’s achievements and the culture we’ve built together. If you’d like to hear more from our team, you can check out our Careers Page and our reviews on Glassdoor (4.9 ⭐s out of 50+ reviews!). You’ll see quotes such as “Fantastic team, great leadership”, “Great culture, very diverse, excellent work-life balance” and “Amazing team!”.
We're committed to ensuring equal access to employment opportunities for all qualified candidates, including candidates of color, women, 2SLGBTQI+ candidates, candidates with family caregiving responsibilities, Indigenous candidates, immigrant candidates, and differently abled candidates. If you require accommodation during the application or interview process, please let us know and we’ll work with you to ensure you have a positive experience.