SRE Specialist – Monitoring & Log Management (#65)
Tokyo
Full time Permanent
Insurance
Job description
We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) with expertise in Dynatrace and Splunk monitoring
The ideal candidate will have experience in implementing and managing monitoring and log management solutions in AWS cloud environments.
Roles & Responsibilities:
- Install, configure, and maintain monitoring solutions for applications and infrastructure
- Implement and manage Dynatrace active gates/OneAgent
- Install and configure monitoring solutions for containers and pods (e.g., AKS, OpenShift, AWS)
- Provide integration solutions for Dynatrace with third-party tools
- Utilize technology to diagnose issues and ensure overall platform health and functionality
- Troubleshoot complex issues related to monitoring and log management
- Develop and implement scripting and automation for monitoring and log management tasks
- Collaborate with vendors to address issues requiring vendor engagement
- Administer Dynatrace and Splunk tools, including dashboards, alerts, and integrations
- Experience working with both Windows and Unix operating systems
- Proficiency in Bash scripting and PowerShell scripting
Qualifications:
- Bachelor’s degree in Computer Science, Information Technology, or related field
- Minimum of 3 years of experience in IT operations, monitoring, and log analysis
- Deep knowledge of Dynatrace and Splunk services and architecture
- Experience with AWS cloud environment
- Strong understanding of containerization technologies (e.g., Docker, Kubernetes)
- Excellent troubleshooting and problem-solving skills
- Ability to work independently and as part of a team
- Strong communication skills in both Japanese and English
Note: Applicant must be located in Japan and bilingual (Japanese and English)
Language requirement
Japanese (Business),
English (Business)
Working hours
9:00-18:00
Back to jobs