Site Reliability Engineer, AVP

Site Reliability Engineer, AVP
Charles River is an established, yet growing global fintech company. This is a new opportunity to join our Melbourne office as a Service Reliability Engineer and be part of a global team. As an experienced DevOps Engineer you will responsible for designing, implementing and supporting SRE practices and technologies assisting scalability, performance and availability of Charles River's global Software as a Service (SaaS) platform. You will drive transformation of monitoring tools and mentor and share knowledge with others. You will collaborate with Tech Services, Application Management, Product and Engineering to enable a rapid feedback loop supporting quality and service delivery.

The successful candidate will:
  • Monitor and report on the availability and performance of our SaaS platform
  • Drive capacity planning through statistical analysis of log and monitoring data
  • Design and deploy full stack application monitoring and log analysis platforms
  • Support rapid and accurate root cause analysis through log and monitoring data analysis
  • Automate provisioning and management of monitoring platforms, collectors, servers, databases and configurations data
  • Assist in developing and tuning VMware design
  • Enhance incident management practices
  • Ensure interoperability with our configuration management architecture
  • Utilize version control and test automation technologies to ensure reliability and availability of provisioning automation

You will have the following qualifications and experience:
  • Bachelors degree in Computer Science, IT
  • 5+ years in SaaS SRE, support, system administration, infrastructure management and automation
  • Understanding of OSI and TCP/IP stacks
  • Experience with full stack and cloud monitoring solutions such as Dynatrace, AppDynamics, Solarwinds
  • Experience with log analysis platforms such as Splunk, Elasticsearch, Logstash, Kibana, Sumo Logic
  • Experience with Microsoft stack including Windows Server, SQL Server, Powershell, and Active Directory
  • Experience tuning and supporting production JRE Java server applications
  • Excellent "soft skills" including interpersonal/verbal and written communications, customer service, teamwork, ability to multi-task, organized, attention to detail, self-motivated, eager to learn, calmness under pressure.
  • Experience with VMware
  • Familiarity with public cloud vendors platforms AWS and Azure
And Ideal Competencies/Experience:
  • Familiarity with Ansible, Rundeck, and Git
  • Performance tuning Java server running on VMWare
  • Experience building and running NOC and SOC
  • Experience with any major workload automation platform
  • Financial services/trading system experience is a plus