A
Posted 3 days ago
Sr. Observability Platform Engineer
American Express Global Business Travel
Requirements
8+ years platform engineering or DevOps experience, 6+ years expertise with enterprise observability platforms, Deep understanding of monitoring architectures and logging strategies, Experience with APM, RUM, and distributed tracing, Proficiency in Python, Go, or Bash, Experience with cloud platforms (AWS, Azure, or GCP), Expertise in Docker and Kubernetes, Experience with Infrastructure as Code (Terraform, Ansible)
Skills
ElasticsearchDatadogNew RelicKubernetesTerraformPythonAWS
About the role
Responsibilities
- Architect and lead the design of comprehensive observability platforms using ELK Stack, New Relic, Datadog, and other emerging technologies
- Establish observability standards, best practices, and governance frameworks across the global organization
- Mentor and guide junior engineers and platform teams in observability implementation and optimization
- Develop advanced monitoring strategies, custom dashboards, and intelligent alerting frameworks
- Lead cross-functional initiatives to instrument applications and infrastructure for end-to-end visibility
- Optimize observability infrastructure for scalability, cost-efficiency, and performance at enterprise scale
- Design and implement automated remediation workflows and self-healing capabilities
- Lead incident response efforts and drive post-incident analysis and continuous improvement
Requirements
- 8+ years of experience in platform engineering, DevOps, or systems engineering
- 6+ years of hands-on expertise with enterprise observability platforms (ELK Stack, New Relic, Datadog, etc.)
- Deep understanding of monitoring architectures, logging strategies, metrics collection, and distributed tracing
- Extensive experience with Application Performance Monitoring (APM) and Real User Monitoring (RUM)
- Advanced proficiency in scripting and programming languages such as Python, Go, or Bash
- Extensive experience with cloud platforms (AWS, Azure, or GCP) and multi-cloud environments
- Strong expertise in containerization and orchestration using Docker and Kubernetes
- Proven experience designing and implementing Infrastructure as Code (Terraform, CloudFormation, or Ansible)
- Demonstrated leadership and mentoring capabilities
Preferred Qualifications
- Experience architecting observability solutions for large-scale, distributed systems
- Knowledge of OpenTelemetry or other advanced instrumentation technologies
- Experience with machine learning-based anomaly detection and intelligent alerting
- Familiarity with chaos engineering and resilience testing
- Relevant cloud or vendor-specific observability certifications
Benefits
- Flexible benefits including health and welfare insurance, retirement programs, and parental leave
- Travel perks including weekly deals on flights, hotels, cruises, and car rentals
- Access to over 20,000 courses on a dedicated learning platform
- Inclusive culture with global INclusion Groups
About the Company
American Express Global Business Travel (Amex GBT) is a leading global business travel management company. We provide travel solutions to companies, organizations, and government agencies, helping our colleagues and clients achieve success through an inclusive and collaborative culture.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeSr. Observability Platform Engineer
American Express Global Business Travel · Mexico City
