Lead Site Reliability Engineer

EPAM Systems Rockaway, NJ

Company

EPAM Systems

Location

Rockaway, NJ

Type

Full Time

Job Description

We are seeking a highly skilled Lead Site Reliability Engineer to join our team.
The ideal candidate will have a strong background in software engineering and systems engineering, with a focus on reliability and scalability in cloud environments, specifically Azure.
Experience the freedom of remote work from anywhere in Kyrgyzstan, whether it's the comfort of your home or our modern office in Bishkek.

#LI-DNI

Responsibilities

  • Design, implement, and maintain highly available and scalable systems across multi-region Azure cloud architectures
  • Ensure disaster recovery plans are in place and tested regularly
  • Configure and enhance monitoring and alerting processes using Prometheus, Grafana, Alertmanager, and OpsGenie
  • Develop dashboards to visualize system performance and reliability metrics
  • Utilize Terraform for infrastructure provisioning and management
  • Implement best practices for continuous deployment and infrastructure changes
  • Work closely with the development team to support ongoing development efforts
  • Communicate with the customer's DevOps team to elaborate on requirements and collaborate on implementations
  • Enhance release management and CI/CD processes using Jenkins
  • Improve system security based on recommendations from the security team
  • Write and test runbooks to streamline operational tasks and incident response
  • Manage and optimize services running on Kubernetes, Docker/Linux environments
  • Handle data persistence using Cosmos DB (Mongo API & SQL API) and MS SQL Server
  • Work with messaging systems like RabbitMQ, Kafka, and EventHub
  • Utilize Azure Networking for secure and efficient communication
Requirements

Want more jobs like this?

Get jobs in Rockaway, NJ delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.
  • 5+ years experience as a DevOps or SRE engineer
  • Proven experience with multi-region Azure cloud architectures
  • Proficiency in Kubernetes and containerization technologies
  • Strong knowledge of Cosmos DB (both Mongo API & SQL API) and MS SQL Server
  • Familiarity with monitoring tools like Prometheus, Grafana, Alertmanager, OpsGenie
  • Experience with .NET Core and ASP.NET Core applications
  • Competency in Docker and Linux environments
  • Expertise in Terraform for infrastructure as code
  • Experience with CI/CD tools
  • Solid understanding of Azure Networking concepts
  • Excellent communication skills, both verbal and written
  • Strong self-motivation and ability to self-manage tasks and projects
Nice to have
  • Experience with Azure IoT Hub and EventHub
We offer
  • We connect like-minded people::
    • Delivering innovative solutions to industry leaders, making a global impact
    • Enjoyable working environment, whether it is the vibrant office or the comfort of your own home
    • Opportunity to work abroad for up to two months per year
    • Relocation opportunities within our offices in 55+ countries
    • Corporate and social events
  • We invest in your growth::
    • Leadership development, career advising, soft skills and well-being programs
    • Certifications, including GCP, Azure and AWS
    • Unlimited access to LinkedIn Learning, Get Abstract, O'Reilly
    • Free English classes with certified teachers
  • We cover it all::
    • Monetary bonuses for engaging in the referral program
    • Medical & family care package
    • Six trust days per year (sick leave without a medical certificate)
    • Coverage of psychology sessions of your choice
    • Discounts for fitness clubs and sports programs
    • Benefits package (sports activities, a variety of stores and services)
EPAM Kyrgyzstan is a team of technologists and innovators united by a passion for technology. In 2022, we opened our first office in Bishkek that works with the world's leading companies across many different industries. EPAM builds a continuously learning organization and helps its employees reach their full potential and achieve their professional goals through learning. Our agile methodologies, client collaboration frameworks, engineering excellence programs, and hybrid teams offer many career paths and development opportunities.

Apply Now

Date Posted

01/24/2025

Views

0

Back to Job Listings ❤️Add To Job List Company Info View Company Reviews
Neutral
Subjectivity Score: 0

Similar Jobs

Sous Chef - Riverton Country Club

Views in the last 30 days - 0

Customer Focused Understanding of how the quality of food impacts the guest experience and the ability to work with the team to meet and exceed guest

View Details

Connected Insights Data Analyst - Subaru of America

Views in the last 30 days - 0

Conducts additional analysis as needed to answer specific business questions Occasionally travels to relevant conferences both Subaru and industry Sub...

View Details

HR Data Analyst - American Water

Views in the last 30 days - 0

The analyst is responsible for gathering requirements designing functional solutions and overseeing the development testing and deployment of reportin...

View Details

Executive Chef/Kitchen Manager - Trattoria Palermo

Views in the last 30 days - 0

Collaborate with frontofhouse management to coordinate banquet events and special functions Develop and design innovative menus that reflect seasonal

View Details

Hand Medicine Orthopedic Surgeon - RWJBarnabas Health

Views in the last 30 days - 0

View Details

Customer Service Representative - GlacierPoint Enterprises

Views in the last 30 days - 0

You will handle execute all incoming and outgoing calls on a daily basis Currently looking for an entry level Customer Service Representative to join...

View Details