NVIDIA Logo

NVIDIA

Senior Software QA Test Development Engineer

Job Posted 21 Days Ago Posted 21 Days Ago
Be an Early Applicant
Santa Clara, CA
Senior level
Santa Clara, CA
Senior level
The candidate will develop and execute test plans for NVIDIA's platforms, manage reliability testing, and automate server and OS level processes.
The summary above was generated by AI

NVIDIA is the world leader in GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC, datacenters and networking in addition to our traditional OEM business. NVIDIA is also well positioned as the ‘AI Computing Company’, and NVIDIA GPUs are the brains powering Deep Learning software frameworks, analytics, data centers, and driving autonomous vehicles. We have some of the most experienced and dedicated people in the world working for us. If you are dedicated, forward-thinking, and hard-working technical people across countries sounds exciting, this job is for you. NVIDIA is looking for an outstanding individual who thrives in a diverse work environment, has outstanding interpersonal skills and possesses a strong sense of engagement and continuous process improvement. This candidate must have enterprise server integration, strong OS experience, reliability testing with various telemetries, scale out cluster, test plan development, CI/CD and DevOps experience to join our platform SWQA team.

What you’ll be doing:

  • Responsible for the development and execution of NVIDIA HGX/DGX/MGX platform test plan on servers, OS, FW and CUDA SW stack from design doc.

  • Installing and testing various systems OS, server firmware and SW stack.

  • Drive support for root cause analysis on reliability and validation test failures to identify root cause(s) and achieve mitigation.

  • Build, develop/debug server and OS level automation front-end and back-end framework and tests

  • Review partner and supplier test results and prescribe additional reliability testing on components, servers, and packaging as needed.

  • Work in an agile software development team with very high production quality standards.

  • Manage bug lifecycle and collaborate with inter-groups to drive for solutions.

What we need to see:

  • Bachelor’s Degree (or equivalent experience) in a STEM (Science, Technology, Engineering, Math or Physics) field

  • 5+ years proven experience; or Master’s Degree.

  • Proven years of OS and server level automation experience using Python, SHELL, Ansible, Jenkins, C/C++, Java, JavaScript

  • Strong server and OS(Ubuntu, RedHat, CentOS, SuSE, Fedora, Windows and etc…) trouble-shooting and debugging experience in a bare-metal and KVM/VMWare/Hyper-V environment.

  • Good knowledge and hands-on experience in model testing, AI tools/frameworks (TensorFlow, Pytorch, Cursor and etc…), NLP  and LLM benchmarking

  • Experience in developing CI/CD automation processes and DevOps contribution with a real passion for automation.

  • Strong experience in FW, BMC/OpenBMC, Network protocol, internal/external enterprise storage devices, PCIe buses and devices, IO sub-devices, CPU and memory, ACPI, UEFI spec, Redfish - huge plus

  • Proven years of experience in GitHub/Gitlab/Gerrit, PXE, SLURM, Stack/Kubernetes/Docker) – huge plus

Ways to stand out from the crowd:

  • Experience working with NVIDIA GPU hardware is a strong plus.

  • Good to have solid understanding of virtualization in Linux (KVM, Docker orchestrated with Kubernetes)

  • Background in deep learning frameworks is a plus

  • Background in parallel programming ideally CUDA/OpenCL is a plus

    The base salary range is 136,000 USD - 212,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

    You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

    NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

    Top Skills

    Ansible
    C/C++
    Centos
    Cursor
    Docker
    Fedora
    Gerrit
    Git
    Gitlab
    Java
    JavaScript
    Jenkins
    Kubernetes
    Python
    PyTorch
    Redhat
    Shell
    Suse
    TensorFlow
    Ubuntu
    Windows
    HQ

    NVIDIA Santa Clara, California, USA Office

    2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

    Similar Jobs

    Yesterday
    Santa Clara, CA, USA
    Senior level
    Senior level
    Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
    Responsible for developing and executing test plans for NVIDIA platforms, troubleshooting, automation framework, and collaborating on solutions for test failures.
    Top Skills: AnsibleC/C++DockerJavaJavaScriptJenkinsKubernetesPythonPyTorchShellTensorFlow
    An Hour Ago
    Easy Apply
    Hybrid
    3 Locations
    Easy Apply
    Expert/Leader
    Expert/Leader
    Cloud • Software
    As a Principal Software Engineer, you will lead architectural and design efforts for various projects, focusing on AI/ML workloads and cloud-based solutions. Your role will involve collaborating across teams to develop innovative solutions, ensuring optimal architectural strategies, and mentoring peers in a fast-paced environment.
    An Hour Ago
    Easy Apply
    Hybrid
    2 Locations
    Easy Apply
    Senior level
    Senior level
    Cloud • Software
    As an Engineering Manager, you'll lead a team to develop features, enhance architecture, and ensure performance while coordinating cross-team initiatives.
    Top Skills: JavaReactSpring FrameworkTypescriptVuejs

    What you need to know about the San Francisco Tech Scene

    San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

    Key Facts About San Francisco Tech

    • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
    • Major Tech Employers: Google, Apple, Salesforce, Meta
    • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
    • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
    • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
    • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine
    By clicking Apply you agree to share your profile information with the hiring company.

    Sign up now Access later

    Create Free Account

    Please log in or sign up to report this job.

    Create Free Account