Job Description
Job Description
Job Description
***Google Cloud Platform and Splunk Observability Cloud required***
Work Location Options: Onsite from day 1
- Phoenix, AZ - onsite
- Charlotte, NC - onsite
Responsibilities:
- Design modify develop write and implement software solutions for our Continuous Integration and Continuous Deployment processes
- Create and maintain automation scripts developed in Python PowerShell Shell C or other scripting languages Manage cloudbased Infrastructure using Infrastructure as Code toolstechnologies such as Terraform Azure ARM templates and GCP deployment templates
- Create and maintain Continuous Integration Automation with platforms such as Azure DevOps Jenkins and Gitlab CI Work closely with IT team members
- stakeholders and agile teams Assist in defining quality guidelines and standards for DevOps practices Work closely with Architects to ensure all stakeholders are aligned with the organization
- DevOps strategy Perform ongoing routine DevOps maintenance tasks Collaborate with Security teams and implement DevSecOps practices Manage and support Azure and GCP work loads Driven approach to continually improving service levels Build and manage systems infrastructure and applications through automation
- Deploy support and monitor new and existing services platforms and application stacks Engage in improving the whole lifecycle of services from inception through deployment operations and refinement Provide handson technical expertise during service impacting events
- Collaborate with other engineers on code reviews internal infrastructure improvements and process enhancements Use scalability testing to measure tune and optimize system performance
- Introduce enterprise capabilities tools and innovation improving availability in a multicloud ecosystem by evolving observability monitoring logging CICD integration continuous testing performance smoke regression functional chaos
- introduce continuous improvement standardizationautomation capabilities to conduct destructive and resiliency testing
- Automate key SRE metrics and IT Service Operations processes including customer impact availability of critical business flows, SLOSLI adherence, error budget, automate incident process for IT Service Operations through data integrating with unified communications ingnotification systems
- Participate in periodic 24x7 oncall duties
- Share support responsibilities for critical applications and customer journeys onboarded to SRE including remediation of issues through Agile conduct blameless postmortems root cause analysis and introduce continuous improvement solving problems once and for all with the goal of no repeats
Required:
- Google Cloud Platform
- Splunk Obervability Cloud
- Experience with one or more Cloud Platforms Azure GCP or AWS
- Experience with Container technologies: Kubernetes, Docker, PKS
- Experience in managing cloud or hybrid infrastructure
- Experience setting up monitoring in applications and database
- 5 years of systems support analysis experience demonstrated through work or military experience
- 5 years of experience with one or more Agile tools used for tracking user stories or backlogs such as JIRA
- Excellent verbal written and interpersonal communication skills
Desired:
- Experience with Observability Monitoring technologies like Splunk, Dynatrace, DataDog, Elastic Stack,ELK, Grafana, Prometheus, CloudWatch, AZURE Monitoring
- Ability to interact with all levels of an organization including management
- Strong team or technical leadership experience
- Strong verbal written and interpersonal communication skills
- 7 years of experience with Cloud technologies
- Incident Management System experience
- Configuration Management Tools experience
- Experience with Agile Scrum Daily Standup, Sprint Planning, and Sprint Retrospective meetings and Kanban
Skills
Required Skills : Google Cloud Platform (GCP),Cloud,Splunk
Additional Skills : Cloud Engineer,Analyst,Network EngineerThis is a high PRIORITY requisition. This is a PROACTIVE requisition
Job Tags