Fireworks AI Logo

Fireworks AI

AI Infrastructure Engineer

Job Posted 9 Days Ago Posted 9 Days Ago
Be an Early Applicant
Redwood City, CA
Mid level
Redwood City, CA
Mid level
This role involves designing and managing AI infrastructure, optimizing systems for efficiency, and collaborating with teams for AI projects. Requires experience in ML infrastructure and software design.
The summary above was generated by AI


Job Duties:
Design core, backend software components. Interface with other teams to incorporate their innovations and vice versa. Conduct design and code reviews. Analyze and improve efficiency, scalability, and stability of various system resources. Design and implement the hardware and software infrastructure required for AI projects. Procure, configure, and manage servers, GPUs, TPUs, and other hardware resources. Set up cloud-based environments (e.g., AWS, Azure, GCP) for AI workloads. Deploy and manage distributed computing clusters (e.g., Kubernetes) for AI model training and inference. Optimize cluster performance and resource allocation for AI workloads. Monitor cluster health and troubleshoot issues as they arise. Architect and maintain data storage solutions (e.g., data lakes, databases) for AI datasets. Ensure data security, access controls, and data versioning. Implement data pipelines for efficient data ingestion and preprocessing. Develop and maintain automation scripts and tools for infrastructure provisioning and scaling. Implement continuous integration and continuous deployment (CI/CD) pipelines for AI models. Orchestrate workflows for training, evaluation, and deployment of AI models. Optimize infrastructure to handle large-scale AI workloads efficiently. Monitor and analyze system performance, making adjustments as needed. Implement load balancing and scaling strategies to meet demand. Implement security best practices to protect AI infrastructure and data. Stay up-to-date with security vulnerabilities and apply patches and updates. Ensure compliance with relevant data privacy and regulatory requirements. Collaborate with data scientists and AI engineers to understand their infrastructure needs. Provide technical support and troubleshooting assistance for AI infrastructure issues. Train and educate team members on best practices for using AI infrastructure.
Minimum Education & Experience Required:
Must have Bachelor’s degree or the equivalent in Computer Science, Computer Engineering or a related field, plus three (3) years of experience with ML infrastructure (PyTorch, Vertex AI, and Sagemaker) or related experience.


Minimum Skills Required:
Must have experience with: Experience with one or more search engine, recommendations, natural language processing, personalization, or similar applied ML domain. Experience with building, scaling, and optimizing distributed enterprise-grade Machine Learning systems. Experience with architectural patterns of large-scale software applications. Experience with publishing papers in machine learning and/or computer vision conferences and journals. Experience with large-scale machine learning techniques like semi-supervised learning, weakly-supervised learning, and online adaptation of ML models. Experience with publishing machine learning domains such as computer vision and natural language processing.

How to Apply:
Submit resume and apply online at http://www.fireworks.ai/careers and search for job by title.

Top Skills

AWS
Azure
GCP
Kubernetes
PyTorch
Sagemaker
Vertex Ai
HQ

Fireworks AI Redwood, California, USA Office

Redwood, CA, United States, 94063

Similar Jobs

3 Days Ago
Hybrid
San Mateo, CA, USA
239K-392K Annually
Senior level
239K-392K Annually
Senior level
Computer Vision • Gaming • Software • Virtual Reality • Web3 • Metaverse
The AI Platform Engineer will design and build large-scale ML data infrastructure, focusing on Knowledge Graphs and Feature Store, to support AI-driven experiences across Roblox.
Top Skills: Ai/MlAws NeptuneDynamoDBFeastFeature StoreFlinkKafkaKnowledge GraphsNeo4JRayRedisSparkTectonTigergraphVertex Ai Feature Store
195K-343K Annually
Expert/Leader
Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
The Staff Software Engineer will develop scalable machine learning workflows, implement cloud-based infrastructure, and support ML applications integration at Snap Inc.
Top Skills: Caffe2Google Kubernetes EnginePyTorchSagemakerScikit-LearnSpark MlTensorFlowVertex Ai
3 Days Ago
San Jose, CA, USA
Mid level
Mid level
Artificial Intelligence • Robotics • Automation • Manufacturing
The role focuses on managing and enhancing training infrastructure for AI humanoid robots, collaborating with researchers to implement large-scale deep learning solutions.
Top Skills: AnsibleAWSAzureChefGCPKubernetesLsfPuppetPythonPyTorchSlurmTerraform

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account