Profluent

Bioinformatics Engineer

Reposted 6 Days Ago

2 Locations

150K-200K Annually

Mid level

2 Locations

150K-200K Annually

Mid level

The Bioinformatics Engineer will design cloud infrastructure for genomic data processing and analysis, expanding the world's largest protein sequence database and building scalable systems for data management.

The summary above was generated by AI

Profluent is an AI-first protein design company. Founded in 2022, we develop deep generative models to design and validate novel, functional proteins to revolutionize biomedicine. Based in Emeryville, CA, we are backed by leading investors including Spark Capital, Insight Partners, Air Street Capital, AIX Ventures, and Convergent Ventures.

Here at Profluent, data is our lifeline. Our generative models learn the blueprint of life by modeling large-scale evolutionary data, enabling us to engineer and write biology in unprecedented ways. As we continue to push the boundaries of what is possible, the volume of available genomic data is growing exponentially. Managing and extracting insights from this ever-expanding data is at the core of this position.

We're seeking a Bioinformatics Engineer to design and build cutting-edge data and cloud infrastructure capable of handling the immense scale and complexity of our genomic datasets. This role is vital to ensuring that we can efficiently process, store, and analyze petabytes of data, unlocking the full potential of our models and driving discoveries across the life sciences.

As an early team member, you’ll play a pivotal role in building the foundations of our data and cloud architecture. You will have the autonomy to make critical decisions and directly influence the success of our mission to harness the power of data for machine learning.

This is an excellent opportunity to shape the future of AI-driven protein design and to work cross-functionally with a diverse team of experts across machine learning, protein engineering, cell biology, and gene editing.

Responsibilities

Maintain and expand the world’s largest database of protein sequences
Deploy cloud-based pipelines to process and search large-scale genomic datasets
Build cloud databases for scalable storage and fast retrieval of terabases of genomic data, including genomes, genes, proteins, and structures
Clearly document code and communicate outcomes to colleagues

Qualifications

BS, MS, or PhD in Bioinformatics, Genomics, Computer Science, or a related quantitative bioscience field
3+ years of industry or postdoc experience
Experience working with Google Cloud Platform (GCP) or other cloud-based compute services (e.g. AWS)
Experience building cloud pipelines, pipelining tools (snakemake, NextFlow), and containerized applications (docker)
Experience with highly parallelized cloud-based computing platforms (Batch or Kubernetes)
Experience with scalable databases (BigQuery, BigTable) and proficient in database programming (SQL)
Fluent in Python data analysis tools (numpy, pandas, Jupyter notebook, biopython)
Experience with Linux environments and version control (git)

Preferences (but not required)

Experience with bioinformatics tools for sequence and structure analysis
Experience working with next-generation sequencing data
Familiarity with public repositories like UniProt, EBI, JGI, NCBI, and SRA
Familiar with concepts in molecular biology, biochemistry, and structural biology
Biological knowledge about prokaryotic gene and genome structure
Publications in major scientific journals or conferences

Work Authorization Requirement

Applicants must have ongoing work authorization in the United States that does not require employer sponsorship. Sponsorship will not be provided now or at any time in the future for this position.

Employment Eligibility Verification
Legal authorization to work in the United States is required. In compliance with federal law, all persons hired must verify their identity and work eligibility and complete the required employment verification form upon hire.

Hiring Salary Range

$150,000—$200,000 USD

What we offer at Profluent

A high-growth opportunity with meaningful impact
Competitive compensation package
Health insurance (health/dental/vision)
Generous paid time off (PTO) policy
Commitment to physical and mental well-being
More benefits and perks to be added!

Profluent Bio, Inc is an equal opportunity employer promoting diversity and inclusion in the workspace. We do not discriminate on the basis of race, color, religion, marital status, age, national origin, ancestry, physical or mental disability, medical conditions, veteran status, sexual orientation, gender (including gender identity and gender expression), sex (which includes pregnancy, childbirth, and breastfeeding), genetic information, taking or requesting statutorily protected leave, or any other basis protected by law.

Top Skills

AWS

Batch

BigQuery

Bigtable

Biopython

Docker

Google Cloud Platform

Jupyter Notebook

Kubernetes

Nextflow

Numpy

Pandas

Python

Snakemake

SQL

Berkeley, California, United States

Similar Jobs

EvolutionaryScale

Senior Bioinformatics Engineer

8 Hours Ago

Senior level

Artificial Intelligence • Software

The Bioinformatics Engineer will develop scalable data processing pipelines, transform research scripts to production, and collaborate on biological AI model projects.

Top Skills: AirflowSparkAWSAzureDockerGCPLuigiNextflowPythonSnakemakeWdl

Natera

Staff Software Engineer - Bioinformatics

4 Days Ago

San Carlos, CA, USA

161K-200K Annually

Senior level

161K-200K Annually

Senior level

Biotech

Lead the development of bioinformatics analysis workflows, designing and implementing software components, and improve existing front-end applications for clinical processing.

Top Skills: AngularJavaJSONKafkaPythonReactRest ApiSpringSQL

Illumina

Staff Bioinformatics Engineer

16 Days Ago

San Diego, CA, USA

153K-229K Annually

Senior level

153K-229K Annually

Senior level

Healthtech • Biotech

The Staff Bioinformatics Engineer will lead cross-functional teams to design and implement data pipelines, manage data models, and enhance data platforms for genomic datasets. Responsibilities include troubleshooting, code reviews, and process improvements, utilizing strong programming and database skills.

Top Skills: AWSC#DockerEc2GitGoIamJavaJenkinsK8SKafkaMs-SqlMySQLPostgresRdbmsS3SnowflakeSnsSqsTeamcity

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

By clicking Apply you agree to share your profile information with the hiring company.

Profluent

Bioinformatics Engineer

Top Skills

Profluent Berkeley, California, USA Office

Similar Jobs

Senior Bioinformatics Engineer

Staff Software Engineer - Bioinformatics

Staff Bioinformatics Engineer

What you need to know about the San Francisco Tech Scene

Key Facts About San Francisco Tech