Senior Site Relibility Engineer
Company
Guidewire
Location
Remote
Type
Full Time
Job Description
Want more jobs like this?
Get jobs that are Remote delivered to your inbox every week.
ESSENTIAL DUTIES AND RESPONSIBILITIES
- Collaborate with development and other SRE teams to enhance the reliability and efficiency of microservices applications.
- Engage with product development (PD) teams by participating in design reviews and production readiness checks.
- Collaborate with engineering teams, providing product feedback and where necessary contribute code to the product
- Work closely with cross-functional teams to ensure seamless integration of new features and services. https://aws.amazon.com/blogs/apn/the-6-pillars-of-the-aws-well-architected-framework/Â
- Analyze data from observability and monitoring tools to improve operational metrics of microservices as well as the entire platform.
- Leverage end-to-end technical expertise gained by engagement with multiple PD teams and analyzing observability data to propose improvements in code and design to improve SLO and prevent incidents.
- Create system documentation and training materials to empower and educate our fellow team members
- Take a purist SRE approach to shared multi-tenant infrastructure for a resilient SaaS microservice-based containerized systems in addition to customer-centric application environments
- Oversee and automate the team’s growing presence in AWS
- Creatively build and develop tooling to aid in driving 24x7x365 follow-the-sun operations of critical production systemsBuild and maintain observability tooling, metrics, and dashboarding for a global platform product infrastructureImprove our incident management lifecycle to identify, mitigate, and learn from reliability risks and issues
- Collaborate with engineering teams, providing product feedback and where necessary contribute code to the product
Education and Work Experience
- Bachelor’s Degree in Computer Science or related field
- Software engineering and task automation skills with Bash, Python, and/or Go are a mustExperience supporting web applications running on Java / Apache / Tomcat in a live production environmentFamiliarity with the Agile software development lifecycle
- Deep background with Linux systems and engineeringHighly experienced with engineering and automating on Amazon Web Services (AWS)
- Prior experience with IaC tools like Terraform/Terragrunt/TerraspacePrior experience with devops/gitops tools (Git, Bitbucket, Flux CD, Teamcity) for gate promotions
- Production-At-Scale support background in a heavily microservice-based worldHands-on engineering and ops expertise in containerization (Docker, Helm, Kubernetes/EKS, CNI and Ingress networking)
- Strong understanding of Single-Sign On, SAML, OAuth (Bonus if hands-on experience with Okta)Seasoned expertise around x.509 certificate technology and basic concepts of encryption
- Experience working with Relational Databases such as Aurora Postgres and/or Oracle RDSAdvanced exposure to application development, web UI (design and development), JSON, application architecture
- Experience strongly utilizing observability tools (logging/APM) like Datadog, CloudWatch, and PagerDuty.
- amiliarity with event store/stream-processing technologies like Kafka or AWS SQSUnderstanding of Open Application Model systems such as KubeVela or Crossplane
Personal Qualities and Soft Skills
- You greatly prefer writing code than clicking a GUI.
- You enjoy teaching, being a mentor to others, and working across boundariesOutstanding troubleshooting skills; ability to think critically and display an aptitude for problem solving
- Strong analytical mind with a penchant for process development and enhancement
- A highly positive can-do attitude with desire for being a team player
- Great communication skills and ability to explain complex technical concepts to a varied audience
- Demonstrate strong follow-through, a strong work ethic and consistently keep and meet commitments
Other Requirements
- Ability to read, write, and speak English
- Ability to speak in public settings, interface with customers, partners and vendors confidently
- Travel – Up to 25% of the job will require travel, approximately a week a month
Date Posted
11/08/2024
Views
0
Similar Jobs
Linux Support Engineer - Voltage Park
Views in the last 30 days - 0
Voltage Park is seeking a Linux Support Engineer for a fulltime remote position The ideal candidate will have command line level Linux sys administrat...
View DetailsTechnical Architect - CDW
Views in the last 30 days - 0
CDW offers a rewarding career opportunity for a Technical Architect with expertise in ServiceNow The role involves delighting customers by collaborati...
View DetailsSenior React.js & Python Developer - Lemon.io
Views in the last 30 days - 0
Lemonio is a marketplace that connects Senior Developers with handpicked startups in the US and Europe They offer projects based on the developers exp...
View DetailsFederal Security Solutions Engineer - Rapid7
Views in the last 30 days - 0
Rapid7 is seeking a Federal Solutions Engineer with 5 years of experience in cybersecurity solutions engineering or technical sales focusing on federa...
View DetailsSales Engineer - Dandy
Views in the last 30 days - 0
Dandy a venturebacked company is revolutionizing the 200B dental industry with advanced technology They are looking for a Sales Engineer with 5 years ...
View DetailsEngineering Manager (Group Practice Tooling & Provider CX) - Headway
Views in the last 30 days - 0
Headway is a mental healthcare company founded in 2019 aiming to build a new mental health care system accessible to everyone They have a national net...
View Details