Site Reliability Engineer
7 days ago
At Halter, we're building more than software - we're transforming the way the world farms. Our smart collars let farmers shift, monitor, and care for their cattle via deep integrations & insights. Behind it all is the Network Team, powering one of New Zealand's largest private IoT networks with 400,000+ connected devices and counting.
We're looking for a Site Reliability Engineer (SRE) to help scale our systems to a million animals and beyond. You'll apply cloud-scale NRE practices to a wildly distributed, rural IoT network across multiple countries.
Our vision is to become the OS for farming globally. This isn't your average backend gig - this one moos . You're not just writing code — you're ensuring availability for hundreds of thousands of animals and farmers who rely on Halter every single day. What you'll do
- Build & run observability for gateways, towers, and backend/edge services (metrics, logs, tracing, alerts; strong signal / low noise).
- Automate ops: golden configs, zero-touch provisioning, safe canaries/rollbacks, scheduled maintenance, and self-healing where sensible.
- Lead incidents end-to-end (runbooks, comms, mitigation, post-mortems) and drive fixes into code, configs, and process.
- Harden deploys: progressive rollouts for firmware/agent/service changes across thousands of devices and multi-region backends.
- Performance tuning: reduce command/telemetry latency, smooth OTA pipelines, and de-risk noisy/unreliable links with back-pressure & retries.
- Capacity & readiness: plan headroom for spikes and growth; chaos engineering for failover paths (cellular satellite, region failover).
- Own runbooks & SOPs that enable field teams and on-call to respond quickly and consistently.
- Partner with Network/RF engineers on coverage/capacity changes, interference hunts, and carrier/satellite escalations.
- Mentor teammates on SRE mindset, tools, and operational excellence.
- SRE/large-scale ops experience (cloud + distributed systems).
- Strong automation & scripting (Python/Go/etc.) and IaC (Terraform/Ansible/etc.).
- Solid networking fundamentals (TCP/IP, routing, VPNs, firewalls) + RF awareness (LoRa/LTE/sat a plus).
- Hands-on with observability stacks (Prometheus, Grafana, ELK, OpenTelemetry).
- Proven incident management for high-availability systems.
- Performance tuning for latency-sensitive, unreliable-link environments.
- Comfortable in Linux across cloud and edge devices.
- Data-driven: able to turn noisy telemetry into decisions (SQL or Jupyter a plus).
- Pragmatic problem-solver who balances reliability, speed, and cost.
- Bonus: IoT/off-grid/field deployments experience.
- Network awareness (baseline, not deep-dive) You don't need to be a routing/RF guru — we have those. You should be comfortable with:
- Basic L3 troubleshooting: ping/traceroute, IP/subnetting, DNS/DHCP/NAT basics, reading simple routes.
- Reading link health: interpreting RSSI/SNR (LoRa) or RSRP/SINR (LTE) at a high level; spotting "link looks bad vs service is bad."
- Backhaul pragmatics: understanding failover states (cellular satellite), cost/perf trade-offs, and safe config rollout patterns.
- Topology literacy: knowing what a gateway/tower/backhaul path looks like and where to put probes and alerts.
At Halter, we're committed to creating an environment where people thrive. We offer unlimited paid annual leave, as well as additional wellness days. Each year, every team member receives a $1,000 self-development budget to invest in whatever fuels their personal growth.
We offer six months of fully paid leave for primary caregivers and four weeks of fully paid leave for secondary caregivers, along with a range of additional family-friendly benefits. To support your wellbeing, we offer subsidised health insurance through Southern Cross.
And finally, everyone at Halter is an owner. Every employee is part of our stock ownership plan; when we succeed, you share in that success.
Our office-first approach
Being office-first is a core pillar of our culture. We believe in-person connections are key to driving your own growth, learning, impact, and building genuine long-lasting relationships.
We're office first, not office only. This means that working from the office every day is our default setting, but we flex when needed. Your growth, learning, and impact are truly unlimited here, and a big part of that comes from being together, solving problems, innovating, building context, and constantly learning from each other.
We have a state-of-the-art, dog-friendly office in the heart of Auckland City and a test farm in Morrinsville. Delicious snacks and drinks are readily available.
About Halter
At Halter, we're on a mission to enable farmers and graziers to run the most productive and sustainable operations. Our customers are using Halter to break free from the time-intensive constraints of conventional practices and revolutionizing grazing with Halter. People join Halter to do meaningful work. Our team out-think, out-work and out-care. We're committed to delivering real change in the world. We're backed to deliver on a mission that matters by Tier 1 investors including Bessemer Venture Partners, DCVC, Blackbird, Promus Ventures, Rocket Lab's Peter Beck and Icehouse Ventures.
Join our team
If this opportunity sounds like you, please apply below by sending your cover letter explaining why you're excited about this role and working at Halter, along with your CV.
If you think you have what it takes but don't necessarily meet every requirement on this job description, please still get in touch. We'd love to chat to see if you'll be an epic fit
Feel free to check out the careers page for more information on working at Halter and don't forget to follow us on LinkedIn & Instagram. We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
-
Site Reliability Engineer
7 days ago
Auckland, Auckland, New Zealand Randstad New Zealand Full time US$1,000,000 - US$1,500,000 per yearSite Reliability Engineer - Gaming Industry - Infrastructure Lead - AWS Focus - ContractSite Reliability Engineering, Automation & Cloud Scaling - Competitive Rates, High-Impact Infrastructure ProjectAbout The RoleJoin a leading studio in the high-growth gaming industry, focused on building, operating, and scaling the reliable, high-performance systems that...
-
Site Reliability Engineer
3 days ago
Auckland, Auckland, New Zealand SG Consulting Limited Full timeRole OverviewThe Site Reliability Engineer (SRE) is responsible for ensuring the reliability, scalability, and performance of critical IT systems and applications. This role blends software engineering principles with operational excellence to build resilient systems, automate processes, and proactively manage incidents. The SRE will work closely with...
-
Site Reliability Engineer
1 week ago
Auckland, Auckland, New Zealand TechnologyOne Full time NZ$120,000 - NZ$180,000 per yearAt TechnologyOne, our mission is to create customer-centric, innovative ERP software that revolutionises businesses and simplifies everyday operations.As a Site Reliability Engineer, your focus will be on ensuring our SaaS platform is highly available, reliable, and performs at its best to deliver an outstanding customer experience. You'll work on building...
-
Site Reliability Engineer
1 week ago
Auckland, Auckland, New Zealand Workday Full time NZ$80,000 - NZ$120,000 per yearYour work days are brighter here.We're obsessed with making hard work pay off, for our people, our customers, and the world around us. As a Fortune 500 company and a leading AI platform for managing people, money, and agents, we're shaping the future of work so teams can reach their potential and focus on what matters most. The minute you join, you'll feel...
-
Senior Site Reliability Engineer
7 days ago
Auckland, Auckland, New Zealand NZ Transport Agency Waka Kotahi Full time NZ$131,659 - NZ$146,288 per yearDo you have a passion for automation?Work in agile teams using the latest technologiesGitHub and GitHub ActionsIntegration, automation and Azure experience preferredFull time permanent position - Auckland or Wellington or Palmerston North locationTe Whiwhinga mahi | The opportunity The purpose of this role is to work alongside others to define the...
-
Principal Site Reliability Engineer
3 days ago
Auckland, Auckland, New Zealand ClearPoint Full timeA bit about us For over 16 years, ClearPoint has helped organisations succeed in a continually changing digital landscape. We are a trusted technology partner combining digital design, software engineering, data and insights, cloud and platforms, and consulting services to help organisations adapt to change. We pride ourselves on forming transformational...
-
Senior Site Reliability Engineer
1 day ago
Auckland, Auckland, New Zealand ASB Bank Full timeSenior SRE Engineer – DevTools SquadMō Mātou | About us:At ASB, we're more than a bank — we're a purpose-led organisation helping people, communities, and our environment thrive together. With a proud history and a clear vision to support New Zealanders in getting one step ahead, our work is guided by the values of Courage, Care, and Curiosity. We're...
-
Civil Reliability Engineer
1 week ago
Auckland, Auckland, New Zealand Auckland Airport Full time NZ$80,000 - NZ$120,000 per yearDescriptionAuckland Airport stands proudly as the gateway to Aotearoa, welcoming travellers beginning their journeys, farewelling Kiwis to new destinations, connecting businesses and workers to new opportunities, and celebrating partners and investors who back us along the way. We have ambitions to be a global hub and a comprehensive development programme...
-
Site Engineer
7 days ago
Auckland, Auckland, New Zealand Fulton Hogan Full timeJob DescriptionThe Fulton Hogan LifeLife at Fulton Hogan is about making the most of the opportunities, taking responsibility, having a crack, being accountable and making it happen. We live by our REAL values – Respect, Energy & Effort, Attitude, Leadership – and we demonstrate these through the good work we do, every day, as one team.Nau mai haere mai...
-
Reliability Technician
1 week ago
Auckland, Auckland, New Zealand Auckland One Rail Full time NZ$80,000 - NZ$120,000 per yearAs a Reliability Technician, you'll play a key role in identifying and resolving technical issues, improving maintenance strategies, and supporting long-term reliability initiatives. This role combines technical expertise with analytical thinking to drive continuous improvement across the fleet.Your responsibilities include:Investigate failures and conduct...