Kumo Logo

Kumo

Software Engineer - Infrastructure

Job Posted 14 Days Ago Reposted 14 Days Ago
Hybrid
Mountain View, CA
Mid level
Hybrid
Mountain View, CA
Mid level
Design and build scalable training and inference systems for machine learning platforms focusing on data lakehouses. Engage with customers and iterate on the product.
The summary above was generated by AI

Come and change the world of AI with the Kumo team!


Companies spend millions of dollars to store terabytes of data in data lakehouses, but only leverage a fraction of it for predictive tasks. This is because traditional machine learning is slow and time consuming, taking months to perform feature engineering, build training pipelines, and achieve acceptable performance.


At Kumo, we are building a machine learning platform for data lakehouses, enabling data scientists to train powerful Graph Neural Net models directly on their relational data, with only a few lines of declarative syntax known as Predictive Query Language. The Kumo platform enables users to build models a dozen times faster, and achieve better model accuracy than traditional approaches.


In this role, you will help us design and build the core of our system working alongside seasoned engineering leaders with decades of experience and some of the best researchers from Stanford and other top universities. We are adapting and inventing cutting edge ML techniques to fit well with learning over data warehouses. The technical challenge here is to create scalable storage and compute systems that will fit well with both the data warehouse and the ML training and inference systems and provide for scalability, reliability, restart and security of cloud-first applications. The final product will follow a customer first approach and be extremely easy to use from both ML and cloud deployment.

The Value You'll Add:

  • Design the core of our training and inference systems
  • Design and build systems to scale and integrate
  • Design APIs between the system components to decouple them and make independent development easy
  • Produce designs that can be iterated over time to achieve more scalability
  • Implement the first version of the product
  • Engage with customers, iterate on the product

Your Foundation:

  • BS (preferred MS or Ph.D.) in Computer Science or related technical discipline, or related practical experience
  • Role will be focused on Systems/Scalability/Integrations.
  • 2+ years experience in software design, development, and algorithm-related solutions
  • 2+ years experience programming in OOP languages like Java, Python, or C++

Your Extra Special Sauce:

  • Knowledge of Cloud distributed storage/databases, file systems, and distributed storage.
  • Ideal candidate would have the knowledge to integrate systems and distributed data processing technologies
  • Experience building Micro-services & Cloud Platforms on AWS, and Azure.
  • Experience with industry, open-source projects, and/or academic research in large-data, parallel, and distributed systems.
  • Experience in using and/or contributing to an inference serving technology, PyTorch, TensorFlow, etc.
  • Experience in building machine learning platforms at large scale
  • Understand the fundamentals of machine learning, ideally in both academic and industry environments

Benefits

  • Stock
  • Competitive Salaries
  • Medical Insurance
  • Dental Insurance 

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Top Skills

AWS
Azure
C++
Java
Python
PyTorch
TensorFlow
HQ

Kumo Mountain View, California, USA Office

357 Castro St, Suite 200, Mountain View, CA, United States, 94041

Similar Jobs

2 Days Ago
4 Locations
Senior level
Senior level
Cloud • Information Technology • Machine Learning
As a Software Engineer in Infrastructure, design and implement scalable solutions for managing a global hardware fleet, focusing on automation, backend services, and Kubernetes custom resources.
Top Skills: AnsibleArgoChefFluxGithub ActionsGoGrafanaGrpcKubernetesLinuxPrometheusPuppetSalt
4 Days Ago
Hybrid
San Francisco, CA, USA
Senior level
Senior level
Cloud • Healthtech • Social Impact • Software • Biotech
As an Infrastructure Engineer, design, build, and maintain core infrastructure on AWS, focusing on reliability, security, and automation for biotech applications.
Top Skills: Amazon EcsAnsibleAWSBuildkiteChefGoJavaJenkinsKubernetesPythonRubyTerraform
5 Days Ago
Hybrid
4 Locations
195K-343K Annually
Expert/Leader
195K-343K Annually
Expert/Leader
Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
The Staff Software Engineer will design and optimize ML Infrastructure, develop high-performance inference systems, and collaborate with ML engineers on production models.
Top Skills: C++DockerGoJavaKubernetesNvidia TritonPyTorchTensor RtTensorFlowTensorflow ServingVllm

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account