Solutions Architect - Cloud Infrastructure

NVIDIA Santa Clara, CA / Remote

Company

NVIDIA

Location

Santa Clara, CA / Remote

Type

Full Time

Job Description

We are excited to announce an opening for a Cloud Solution Architect at NVIDIA and are seeking a passionate individual with a strong interest in cloud infrastructure engineering! If you are enthusiastic about contributing to projects that push the boundaries of cloud-based AI and resilience in large-scale environments, we invite you to read on. NVIDIA is renowned as one of the most sought-after employers in the technology world, offering highly competitive benefits. We are home to some of the most innovative and forward-thinking individuals globally. If you are creative, autonomous, and eager to apply your skills and knowledge in a dynamic environment, we want to hear from you!

What you'll be doing:

  • Working as a key member of our cloud solutions team, you will be the go-to technical expert on NVIDIA's GPU-accelerated cloud offerings, helping clients build resilient and telemetry-driven cloud infrastructures.
  • Collaborating directly with engineering teams to secure design wins, address challenges, and deploy solutions into production, with a focus on developing robust tooling for observability and failure recovery.
  • Acting as a trusted advisor to our clients, understanding their cloud environment, translating requirements into technical solutions, and providing guidance on optimizing NVIDIA DGX Cloud for scalable, reliable, and high-performance workloads.

Want more jobs like this?

Get jobs delivered to your inbox every week.

Select a location
By signing up, you agree to our Terms of Service & Privacy Policy.

What we need to see:

  • 2+ years of experience in cloud infrastructure engineering, AI/ML systems, or large-scale distributed systems.
  • A BS in Computer Science, Electrical Engineering, Mathematics, or Physics, or equivalent experience.
  • A proven understanding of cloud computing and large-scale computing systems.
  • Proficiency in Linux, Windows Subsystem for Linux, and Windows.
  • A passion for machine learning and AI, and the drive to continually learn and apply new technologies.
  • Excellent interpersonal skills, including the ability to explain complex technical topics to non-experts.

Ways to stand out from the crowd:

  • Expertise with orchestration tools like Slurm and Kubernetes.
  • Familiarity with NVIDIA's DGX Cloud, Base Command Platform, and its ecosystem.
  • Hands-on experience designing telemetry systems and failure recovery mechanisms for large-scale cloud infrastructures including observability tools such as Grafana, Prometheus, and OpenTelemetry.
  • Proficiency in deploying and managing cloud-native solutions using platforms such as AWS, Azure, or Google Cloud, with a focus on GPU-accelerated workloads.
  • Contributions to open-source projects showcasing expertise in cloud-AI/infrastructure engineering.

The base salary range is 120,000 USD - 235,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Apply Now

Date Posted

01/13/2025

Views

0

Back to Job Listings ❤️Add To Job List Company Info View Company Reviews
Positive
Subjectivity Score: 0.9

Similar Jobs

Software Engineer, Data Platform (Lead) - Benchling

Views in the last 30 days - 0

Benchling a leading biotechnology company is seeking a Senior Software Engineer to design and implement scalable multitenant services and APIs The rol...

View Details

Senior Product Manager, Enterprise - Atlassian

Views in the last 30 days - 0

Loom a video communication platform for asynchronous work is seeking a Senior Product Manager for its Enterprise team The role involves defining strat...

View Details

Senior Product Manager, Dev Solutions - Atlassian

Views in the last 30 days - 0

Atlassian offers a remote position for a Product Manager in the Dev Solutions team The role involves collaborating with crossfunctional teams to lead ...

View Details

Treasury Management Officer - Technology and Disruptive Commerce - JPMorganChase

Views in the last 30 days - 0

The job posting is for a Treasury Management Officer in Commercial Banking The role involves generating new treasury management business maintaining c...

View Details

Relationship Executive, Middle Market Banking - Executive Director - JPMorganChase

Views in the last 30 days - 0

The job description is for a Relationship Executive role in the Middle Market Banking team The role involves building and retaining profitable relatio...

View Details

Linux Support Engineer - Voltage Park

Views in the last 30 days - 0

Voltage Park is seeking a Linux Support Engineer for a fulltime remote position The ideal candidate will have command line level Linux sys administrat...

View Details