HaiLa is building ultra-low power radio communications solutions that will eliminate the need for batteries in wireless communication devices by offering a product that is hyper power efficient that can run indefinitely from harvested energy. Our missionis to enable Ambient Power IoT with sensing everywhere on existing protocols such as Wi-Fi.
This is an exciting opportunity for a Senior Platform / DevOps Engineer to join our dynamic and diverse team! You will own HaiLa’s hybrid cloud and on-prem infrastructure end-to-end - spanning EDA tooling, Kubernetes, networking, security, and observability - and work closely with the VP of Engineering to keep our platform reliable, secure, and always evolving.
Based in Montreal, HaiLa is supported by leading sustainability-focused venture investors as well as Stanford University.
Description
HaiLa’s Engineers develop the next-generation wireless technologies dedicated to unlocking the vast potential of the Internet of Things. HaiLa’s chipsets focus on ultra-low power Wi-Fi backscatter connectivity.
As a Senior Platform / DevOps Engineer, you will own the full stack of infrastructure that HaiLa’s hardware and software teams depend on every day - from bare-metal RHEL/SLES servers and Slurm compute clusters to Kubernetes deployments in AWS, FortiGate-secured networking, and a comprehensive observability platform. You will drive IaC-first practices across all layers, ensure security posture compliance, and keep EDA tooling running smoothly for the chip design team.
We need engineers who are ownership-minded, automation-first, and comfortable being the institutional memory of a fast-moving startup. The ideal candidate writes runbooks the rest of the team can act on, treats config as code, and proactively closes gaps before they become incidents.
Responsibilities:
On-prem server & EDA infrastructure
- Lead IaC-first development across all infrastructure layers - network, server hosts, IDP, and cloud (GCP and AWS) - using Terraform, Ansible, and related tooling.
- Own the on-prem server infrastructure and VDI environment (RHEL and SLES hosts, virtualization, NFS); research and implement solutions to meet EDA team demand.
- Operate and optimize the Slurm cluster: monitoring, compute node configuration, and capacity expansion.
- Migrate the Kubernetes cluster and CI setup from GCP to AWS; deploy and maintain applications using virtual machines, Kubernetes, and GitOps.
- Maintain and integrate on-prem and cloud network connectivity (VLAN segmentation, FortiGate firewalls, Site-to-Site VPN, DNS, AWS SES) with a high security posture.
- Manage the remote endpoint fleet; administer Microsoft Intune and patch management; integrate security software to build a DLP solution; maintain security standards across all managed endpoints.
- Own JIRA IT project hygiene: sprint conventions, ticket standards, and workflow configuration.
- Coordinate with the engineering team on IT security policies and incident response; maintain observability across the platform using Grafana, VictoriaMetrics, VictoriaLogs, FlexLM/license exporters, and alert rules.
- Support and maintain TeamCity server and agent deployments; manage the GitHub organization and GitHub Actions runner deployments.
Minimum qualifications:
- 4+ years in infrastructure, SRE, or DevOps roles.
- Strong Linux sysadmin skills on enterprise distros (SLES and RHEL/Rocky), including Lmod, NFS at scale, kernel/driver work, and GPU passthrough.
- Hands-on AWS or GCP cloud experience; Ansible (roles, group_vars) and Terraform (multi-stack, remote state).
- Kubernetes + GitOps (ArgoCD or Flux), Helm chart authoring, and Docker.
- Networking fundamentals: routing, VLANs, VPN/IPsec, 802.1X/RADIUS, PKI; hands-on with an enterprise firewall (FortiGate preferred).
- Observability stack experience: Prometheus-compatible TSDB (VictoriaMetrics a strong plus), Grafana, and log aggregation.
- Scripting in Python and Bash; ability to read Salt and Jinja templates; secrets discipline (1Password / Vault) with a firm no-shortcuts-on-safety-checks policy.
Preferred qualifications:
- Prior exposure to EDA tooling (Cadence, Synopsys, Mentor) and FlexLM / RLM license management.
- Proxmox or other on-prem virtualization; Slurm or other HPC workload manager.
- Full Fortinet stack experience (FortiGate, FortiAnalyzer, FortiClient EMS, FortiAuthenticator) and CrowdStrike Falcon administration.
- TeamCity or comparable CI (Jenkins, GitLab CI, Buildkite); identity providers (Authentik / Keycloak, SAML/OIDC).
- Experience operating a colo or hybrid footprint (ISP evaluation, BGP, cross-connects).
Working style:
- Comfortable owning an on-call rotation spanning cloud, on-prem, and physical/network layers.
- Strong bias toward automation and IaC over click-ops; treats config as code with proper reviews.
- Writes runbooks and tickets the rest of the team can act on independently - you’re the institutional memory of the platform.
Why work for HaiLa
- Play a key role in bringing the breakthrough power efficient RF technology to market
- Be part of a solution that aims to remove 100’s of millions of batteries from landfills.
- Work with a lean and agile team of the best hardware and software engineers in the industry who are eager to share their expertise.
- Gain work experience with innovative high-tech start-up with a future-proof vision.
HaiLa is an equal opportunity employer. We work hard to provide an inclusive work place where everyone feels valued, safe, respected and empowered to grow. If this job description sounds like (or close to) you, we encourage you to apply today!
Job Types: Full-time, Permanent
Pay: $57,161.16-$149,895.17 per year
Benefits:
- Dental care
- Extended health care
- Life insurance
- On-site parking
- Vision care
Work Location: Hybrid remote in Montréal, QC H3B 1A7