Site Reliability Engineer (Mandarin Speaking)

Full Time, onsite
Lintasarta
Area DKI Jakarta, Indonesia

Salary undisclosed

Checking job availability...

Original

Simplified

Lintasarta is a leading Indonesian telecommunications and IT solutions provider, specializing in business-to-business (B2B) services. Founded in 1988, the company offers a wide range of services, including data communication, networking, cloud computing, and managed services. Lintasarta primarily focuses on providing end-to-end solutions for enterprises, government agencies, and other organizations, helping them with their digital transformation needs.

Job Summary:

We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) who is fluent in Mandarin to join our dynamic team. As an SRE, you will be responsible for maintaining the reliability, scalability, and performance of our critical infrastructure and applications. You will collaborate with software engineers and operations teams to design, build, and operate systems that are robust, automated, and highly available. This role requires strong problem-solving skills, expertise in cloud computing, and fluency in Mandarin to facilitate communication with global teams.

Key Responsibilities:

Ensure the reliability, availability, and performance of production systems.
Design and implement scalable and automated solutions for system monitoring, alerting, and incident management.
Collaborate with development teams to improve application performance and stability.
Conduct root cause analysis and post-mortems for incidents, ensuring continuous improvement.
Manage cloud infrastructure (AWS, GCP, or Azure) and optimize resources for cost-effectiveness.
Develop and maintain CI/CD pipelines for seamless deployment.
Implement security best practices and compliance measures across infrastructure.
Provide on-call support and participate in a rotation to address critical system issues.
Communicate effectively in Mandarin and English with internal and external stakeholders.

Requirements:

Bachelor's degree in Computer Science, Engineering, or a related field.
Fluency in Mandarin and English (both written and verbal) is required.
3+ years of experience in Site Reliability Engineering, DevOps, or System Administration.
Proficiency in cloud platforms such as AWS, Google Cloud, or Azure.
Strong scripting and automation skills (Python, Bash, Terraform, Ansible, etc.).
Experience with containerization technologies (Docker, Kubernetes).
Hands-on experience with monitoring tools (Prometheus, Grafana, ELK stack, etc.).
Knowledge of networking concepts, security best practices, and Linux system administration.
Familiarity with distributed systems, microservices, and database administration.
Ability to troubleshoot and resolve complex system issues efficiently.
Strong communication and collaboration skills to work across global teams.

Preferred Qualifications:

Experience in working with international teams, particularly in China.
Certifications in cloud technologies (AWS Certified DevOps Engineer, Google SRE, etc.).
Experience with chaos engineering and disaster recovery strategies.
Understanding of Agile and DevOps methodologies.

Job Summary:

Key Responsibilities:

Ensure the reliability, availability, and performance of production systems.
Design and implement scalable and automated solutions for system monitoring, alerting, and incident management.
Collaborate with development teams to improve application performance and stability.
Conduct root cause analysis and post-mortems for incidents, ensuring continuous improvement.
Manage cloud infrastructure (AWS, GCP, or Azure) and optimize resources for cost-effectiveness.
Develop and maintain CI/CD pipelines for seamless deployment.
Implement security best practices and compliance measures across infrastructure.
Provide on-call support and participate in a rotation to address critical system issues.
Communicate effectively in Mandarin and English with internal and external stakeholders.

Requirements:

Bachelor's degree in Computer Science, Engineering, or a related field.
Fluency in Mandarin and English (both written and verbal) is required.
3+ years of experience in Site Reliability Engineering, DevOps, or System Administration.
Proficiency in cloud platforms such as AWS, Google Cloud, or Azure.
Strong scripting and automation skills (Python, Bash, Terraform, Ansible, etc.).
Experience with containerization technologies (Docker, Kubernetes).
Hands-on experience with monitoring tools (Prometheus, Grafana, ELK stack, etc.).
Knowledge of networking concepts, security best practices, and Linux system administration.
Familiarity with distributed systems, microservices, and database administration.
Ability to troubleshoot and resolve complex system issues efficiently.
Strong communication and collaboration skills to work across global teams.

Preferred Qualifications:

Experience in working with international teams, particularly in China.
Certifications in cloud technologies (AWS Certified DevOps Engineer, Google SRE, etc.).
Experience with chaos engineering and disaster recovery strategies.
Understanding of Agile and DevOps methodologies.