Site Reliability Engineer - Azure

Intapp United States

Company

Intapp

Location

United States

Type

Full Time

Job Description

The Intapp Cloud Platform is a rapidly growing collection of cloud services. As part of a global team, the ideal candidate will be able to quickly move between architecture, design, and daily operations with an emphasis on scalability and automation.

You will dive deep into operational issues, from the software, systems, automation, and process perspectives. You will understand the challenges around integrating disparate infrastructures into a new facility, processes and procedures and actively contribute to services that can shrink and expand based on demand, self-heal, automatically rollout, etc.

What you will do:

  • Own end-to-end availability (SLO/SLA), reliability, and performance of Intapp Cloud Platform by developing processes, metrics and engineering projects that ensure maximum reliability and uptime for our customers.
  • Develop and maintain dashboards, alerts and operational procedures to increase availability of the Intapp Cloud Platform.
  • You will perform deep dives into both systemic and latent reliability issues; partner with software engineers across the organization to produce and roll out fixes.
  • You will identify and drive opportunities to improve automation for the cloud; scope and create automation for deployment, management and visibility of our services.
  • Participate in 24x7 oncall rotation.

Want more jobs like this?

Get Software Engineering jobs in United States delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

Who you are:

  • You have hands on experience in building and operating fault-tolerant and scalable systems.
  • You have hands-on experience with Incident and RCA processes.
  • You have hands on experience with cloud environments such as AWS, GCP, Azure as well as container technologies and orchestrators such as Kubernetes, Docker or OpenShift.
  • You have strong scripting abilities in Python, Go, or JVM-based languages.
  • You have a solid understanding of continuous integration, deployment and operations concepts.
  • Passion for resolving reliability issues and identify strategies to mitigate going forward.
  • Automation mindset - if you can automate it, do it.


#LI-JS1

Apply Now

Date Posted

10/03/2024

Views

0

Back to Job Listings ❤️Add To Job List Company Info View Company Reviews
Positive
Subjectivity Score: 0.9

Similar Jobs

Senior .Net Developer (HYBRID) - Broadridge

Views in the last 30 days - 0

Broadridge is seeking a seasoned NET Developer to join their Development team working on an advanced financial software platform The role involves sof...

View Details

Senior Transaction Manager - CSC

Views in the last 30 days - 0

The Trustee Senior Transaction Manager at CSC in London is responsible for managing a complex portfolio of structured finance transactions This role i...

View Details

Software Engineer - Client Service Engineering - AMD Public - Associate - London - Goldman Sachs

Views in the last 30 days - 0

Goldman Sachs Asset Management Divisions Client Service Engineering team is seeking a Data team member The role involves managing customer data provid...

View Details

Customer Service Advisor (French Speaker) Vacation Rentals - TripAdvisor

Views in the last 30 days - 0

Tripadvisor the worlds largest travel site is seeking a Customer Service Advisor for Tripadvisor Rentals The role involves handling inboundoutbound ca...

View Details

Primary Design Manager - GE Vernova

Views in the last 30 days - 0

The Primary Design Manager is responsible for ensuring the proper performance of design works adhering to budget deadlines and quality standards This ...

View Details

Internship Field Service Engineer, Scotland - Waters

Views in the last 30 days - 0

Waters Corporation is offering a 12week internship for a Field Service Engineer in Scotland The internship lasting approximately ten weeks provides an...

View Details