The candidate will Deliver API - Platform Engineering by leading solution initiatives. Recommend and re-engineer solutions to address complex incidents/issues. Advanced troubleshooting, root-cause analysis, and recommending solutions/fixing bugs. Identify toil and recommend/develop automation solutions. Understand business requirements and provide solutions through the self-service platform. Participate in piloting new products.
Qualifications:
Strong experience with Kafka, including experience in building and operating solutions for high-scale distributed systems.
7+ years of technical leadership helping engineering and operations teams thrive.
Prior experience with enabling “Observability” using tools for Distributed tracing, Event logging, APM Synthetic monitoring.
Understanding of SRE Practices
Experience in Automation.
Experience in building self-service platforms
Prior experience with web services and messaging protocols.
Prior experience with Infrastructure as Code with Ansible and Terraform, OpenShift/Kubernetes.
Prior experience with public cloud providers (Azure and AWS).
Ability to collaborate with teams and impact decisions at the interpersonal level.
Nice to have:
Experience in RabbitMQ and Tibco Messaging tools.
Experience in coding using Python/Angular/Java.
Understanding of SDLC, SAFe terminologies, Agile development methodology.
Experience in CI/CD tools such as Jenkins, Git.
Information
Locations Santa Ana, CAPosition Open to Anywhere in the US, but will work on-siteIndustry Information TechnologyStatus OpenJob Age 3 Day'sCreated Date 05/02/2025No.of Positions 1Duration 24Zip Code