SRE Engineer – Junior/Senior (Ascend Money)

About the role

We are seeking a highly motivated and experienced Senior Site Reliability Engineer (SRE) to join our growing team. As a Senior SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our production systems. You will leverage your deep understanding of infrastructure, automation, and observability to champion operational excellence and build a resilient platform.

Description

Platform Operations: Manage and operate our Kubernetes platform, ensuring high availability, performance, and security.
Automation & Tooling: Design, develop, and implement automation solutions for operational tasks, infrastructure provisioning, and application deployment.
Observability: Build and maintain a comprehensive observability stack (monitoring, logging,tracing) to proactively identify and resolve issues.
Platform Stability & Performance: Implement and maintain proactive measures to ensure platform stability, performance optimization, and capacity planning.
Middleware Expertise: Provide support and expertise for critical middleware tools such as RabbitMQ, Redis, and Kafka, ensuring their optimal performance and reliability.
Incident Response: Participate in our on-call rotation, troubleshoot and resolve production incidents efficiently, and implement preventative measures.
Collaboration: Collaborate effectively with development and other engineering teams.

Qualifications

Positive attitude and empathy for others.
Passion for developing and maintaining reliable, scalable infrastructure.
A minimum of 3 years working experience in relevant areas.
Experience in managing and operating Kubernetes in a production environment.
Experienced with cloud platforms like AWS or GCP.
Experienced with high availability, high-scale, and performance systems.
Understanding of cloud-native architectures.
Experienced with DevSecOps practices.
Strong scripting and automation skills using languages like Python, Bash, or Go.
Proven experience in building and maintaining CI/CD pipelines (e.g., Jenkins, GitLab CI).
Deep understanding of monitoring, logging, and tracing tools and techniques.
Experience with infrastructure-as-code tools (e.g., Terraform, Ansible).
Strong understanding of Linux systems administration and networking concepts.
Experience working with middleware technologies like RabbitMQ, Redis, and Kafka.
Excellent problem-solving and troubleshooting skills.
Excellent communication and collaboration skills.
Strong interest and ability to learn any new technical topic.

Preferred Skills:

Experience with container security best practices.
Experience with chaos engineering principles and practices.

Bonus Points and Opportunity:

Experience in the Financial Services industry.
Opportunity to tackle challenging projects in a dynamic environment.

BACK

JOB OPENINGS

INTERNSHIP

SRE Engineer – Junior/Senior (Ascend Money)

Home

About

Corporate

OUR BUSINESS

CAREER

AROUND ASCEND

PRIVACY POLICY

COOKIE POLICY

WHISTLEBLOWING