Top San Francisco Bay Area, CA Database Companies (113)
The database market is big. How big? Well, according to IDC, it’ll reach $153 billion by 2027. And MongoDB is at the forefront of that innovation with thousands of customers across the globe. We empower developers and businesses to build and deploy the applications they want, wherever they want.
Datavant is a healthcare data firm that aims to eliminate siloed healthcare information to improve medical research and patient care.
SourceScrub is the world's leading data service for firms looking to research, find and connect with privately held companies. Our Private Company Intelligence platform allows deal teams to take a data-driven approach in a traditionally opaque segment of the market. We combine state-of-the-art technology with an unmatched QA process resulting in the freshest and most accurate data set available.
Founded by the leaders that built data teams at LinkedIn and Airbnb, Acryl Data enables you to take back control of your fragmented data stack. We do this by driving the #1 open source Metadata Platform DataHub, which has a community of 8,000+ data practitioners and is deployed in 1,000+ companies. Acryl DataHub is a third-generation streaming metadata platform that integrates...
Snowflake powers the end-to-end data lifecycle – from ingesting and processing data to analyzing and modeling it, to building and sharing data and AI applications – helping engineers, analysts, and leaders innovate faster and achieve more with their data. We're on a mission to empower every enterprise to achieve its full potential through data and AI.
Olive connects to your database & third party services to help you build admin dashboards and support tools in minutes.
Founded in January 2021 and headquartered in San Francisco, RisingWave Labs is an early-stage start-up that innovates database systems. We develop RisingWave, a distributed SQL database for stream processing. We've raised over $40 million from some of the world's top investors. Our vision is to augment enterprise data platforms by delivering timely, reliable, and cost-efficient processing of event data in real...
We Structure the World's Knowledge. Diffbot is a world-class group of AI engineers building a universal database of structured information, to provide knowledge as a service to all intelligent applications. Whether you are building an app that uses web content, an enterprise business application, or a smart robotic assistant, we've got you covered. Thousands of leading companies rely on Diffbot data...
Swiftly is the leading transit data platform for agencies to share real-time passenger information, manage day-to-day operations, and improve service performance. Today, over 140 transit agencies in 8 countries – including LA Metro, MARTA, SEPTA, MBTA, and WMATA – rely on Swiftly to improve on-time performance by up to 40%, increase passenger information accuracy by up to 50%, and analyze...
DagsHub is where people build data science projects. Leverage popular open source tools to version datasets & models, track experiments, label data, and visualize results
Coefficient is a no-code solution that allows business teams to work with real-time data directly from their spreadsheets, automate workflows, and use AI to build for them. You can sync your Google Sheets to your company systems such as Salesforce, Hubspot, Google Analytics, Looker, MySQL, Redshift, Slack and more. This empowers anyone in the company to build reports and analyses...
Greenplum Database® is an advanced, fully featured, open source data warehouse. It provides powerful and rapid analytics on petabyte scale data volumes. Uniquely geared toward big data analytics, Greenplum Database is powered by the world’s most advanced cost-based query optimizer delivering high analytical query performance on large data volumes. Greenplum Database® project is released under the Apache 2 license. We want...
The exponential increase in the amount of data being communicated and processed around the globe is driving the energy consumption of datacenters and communications networks to 17% of total electricity demand worldwide by 2030(1), dramatically increasing pollution, carbon emissions and cost. Empower Semiconductor was founded with the mission to “minimize the energy footprint of the digital economy” by developing novel...
Etleap ETL removes the headaches experienced building data pipelines. A cloud-native platform that seamlessly integrates with AWS infrastructure, Etleap ETL consolidates data without the need for coding. Our purpose is to unlock the power of data and enable analytics teams to be more productive by removing the need for internal IT resources or knowledge of complex scripting languages, reducing the time...
Zilliz is a leading vector database company for production-ready AI. Built by the engineers who created Milvus, the world's most popular open-source vector database, Zilliz is on a mission to unleash data insights with AI. The company builds next-generation database technologies to help organizations rapidly create AI/ML applications, and unlock the potential of unstructured data. By taking the burden of...
At Ascentt, we build AI that truly works for enterprises—secure, intelligent, and built to scale. With over 15 years of expertise in AI, data, and analytics, we empower businesses to transform complexity into clarity. Our AI solutions don’t just process data; they unlock insights, automate workflows, and accelerate decision-making—seamlessly fitting into enterprise systems while keeping security at the forefront. We believe AI...
Arterys was founded to facilitate the global advancement of medicine through data, artificial intelligence and technology. Because a significant proportion of the world's medical data resides in medical images, Arterys set out to tackle several issues around the space, including the enormous workloads radiologists face, the lack of accuracy with many of today's tools, and the need for increased consistency...
Democratizing AI on the Modern Data Stack! The team behind PyG (PyG.org) is working on a turn-key solution for AI over large scale data warehouses. We believe the future of ML is a seamless integration between modern cloud data warehouses and AI algorithms. Our ML infrastructure massively simplifies the training and deployment of ML models on complex data. With over 40,000...
BeeHero maximizes crop yields through precision pollination, combining sophisticated machine learning algorithms with low-cost sensors to stimulate full output potential during peak pollination cycles. By tracking and optimizing pollination in real-time, BeeHero ensures hyper-efficient pollinators that can increase crop yields by 30% on average. Beehero’s platform already enables commercial growers to optimize crop-yield for 70% of major commercial crops. Based...
Work Your Passion. Live Your Purpose.




































