We are seeking an experienced, self-driven Senior Site Reliability Engineer to join our
global engineering team, based in the Malaysia/Singapore region. This position is critical
to our "follow the sun" support model, providing dedicated SRE coverage during Asian
business hours.
As a Senior SRE, you will be responsible for maintaining and improving our
infrastructure, ensuring system reliability, and improving automation to enhance
operational efficiency. This role requires a high degree of autonomy and problem-solving abilities, as you will often be working when the UK team is offline.
Pagrindinės pareigos
- Maintain robust, scalable infrastructure on Google Cloud Platform
- Manage and optimize Kubernetes clusters for performance and reliability
- Maintain secure TCP level networking, including VPN setups
- Develop and enhance monitoring, alerting, and observability solutions
- Automate manual processes to improve efficiency and reduce human error
- Perform capacity planning and optimization of cloud resources
- Collaborate with the UK team to improve deployment processes and application
reliability
- Document systems, processes, and incident responses
- Contribute to architectural discussions and technical decision-making
Required skills
- 5+ years of experience in SRE, DevOps, or similar roles
- Strong experience with Google Cloud Platform services and infrastructure
valdymas
- Extensive knowledge of Kubernetes, including deployment, scaling, and
troubleshooting
- Proficiency in Linux/Unix systems administration
- Experience with infrastructure as code tools (e.g., Terraform, Pulumi)
- Solid understanding of networking concepts and practical implementation of VPNs
- Experience with monitoring and observability tools
- Strong programming knowledge (bonus if Golang)
- Excellent problem-solving abilities and troubleshooting skills
- Ability to work independently with minimal supervision
- Strong written and verbal communication skills in English
- Experience with CI/CD pipelines and tools
Preferred skills
- Experience with Golang programming
- Knowledge of ISO8583 messaging standard
- Deep understanding of TCP networking concepts
- Prior experience in financial services, particularly in acquiring/payment processing
- Knowledge of security best practices for cloud environments
Work environment
- Full-time position based in Malaysia/Singapore region
- Primary coverage of APAC business hours, with some flexibility required
- Fully remote role
- Occasional overlap with UK team hours for knowledge sharing and collaboration
- Autonomous work environment with opportunities to drive improvements