
Senior Site Reliability Engineer
Loft Orbital Solutions
Senior Site Reliability Engineer
Posted 8 days ago
Employment Type
Full Time
Location
Abu Dhabi
Requirements
GCP/cloud, Kubernetes, CI/CD, DevOps, Networking, Observability, Python/Go/Java, SDN
Job Description
Responsibilities
- Collaborate with developers, test engineers and satellite operators to foster a strong SatDevOps culture.
- Design and roll-out cloud solutions for testing and operations infrastructure; balance trade-offs between existing and additional cloud resources to scale.
- Design, implement, and maintain scalable, reliable, and secure infrastructure in a hybrid cloud environment.
- Enhance developer and test engineer experience by building better tools, streamlined workflows, and improved environments.
- Lead automation and optimization initiatives, including CI/CD pipelines, infrastructure provisioning (IaC), and deployment workflows for test and space operations.
- Own and evolve the observability stack (metrics, tracing, logs), ideally within a Grafana-centric ecosystem.
- Implement and advocate best practices in software reliability, fault tolerance, and performance tuning.
- Identify, investigate, and resolve system reliability issues, perform root cause analysis, and implement long-term fixes.
- Partner with other teams to design and operate Software Defined Network (SDN) solutions.
- Foster a collaborative and inclusive team culture of respectful debate and continuous learning.
- Handle and manage the link between cloud and network/software/hardware infrastructure; assume Information & Technology (I&T) responsibilities as necessary at the start.
Requirements
- Strong experience with public cloud infrastructure (ideally GCP)
- Deep expertise in Kubernetes (architecture, deployment, ops, resource optimization)
- Proven ability to design and build scalable, highly available systems
- Familiarity with Software Defined Networking (SDN) concepts and tools
- Experience implementing and maintaining observability stacks (Grafana, Prometheus, Loki, etc.)
- Proficient in at least one backend language: Go, Python, Rust, C/C++, or Java
- Deep hands-on DevOps experience: CI/CD, infrastructure as code, automation
- Track record in fast-paced, high-growth technical environments
- Strong networking knowledge (TCP/IP, DNS, routing, switching, firewalls, VPNs, security)
- Deep systems administration experience
- Excellent problem-solving skills, operates independently, proactive/results-driven
- Strong communication; thrives on multicultural, cross-functional teams
Preferred Qualifications
- Hands-on experience with GitOps frameworks (ArgoCD, FluxCD)
- Interest or experience in FinOps and cost-optimized architectures
- Understanding of orchestration in resource-constrained/space systems
- Knowledge of Terraform, Ansible, or similar IaC frameworks
- Experience with systems engineering toolsets and SDLC governance
- Cybersecurity awareness
- Familiarity with security practices (vulnerability scanning, threat detection, risk mitigation)
Benefits
About the Company
Orbitworks—a joint venture between Marlan Space (UAE) and Loft Orbital—is revolutionizing space access with reliable, shareable satellites that reduce the time and complexity required to get to orbit. Orbitworks operates satellites, flies customer payloads, and manages missions end-to-end. The company, based in Abu Dhabi, will be the UAE’s first commercial firm mass-manufacturing satellites, operating from a 50,000 sq. ft. facility, aiming to produce dozens of satellites per year.