Home     |     Startup Lists
Top Data Startups

Top Data Startups to Watch (and Work for) in 2023

August 2, 2023
      |      Kieran Ryan
engineering code on a laptop screen

My Background

Hi, I'm Kieran! I worked in startups for five years at On Deck and Wefunder. Some of my favorite things that I've worked on during my career include:

  • Wefunder Workaway: 12 founders working on a new idea, 7 days, 1 mansion in Hawaii. One of the teams we picked raised a Series A from Andreessen Horowitz, and two people who met during the trip got married...
  • On Deck: As employee #15, I contributed to the startup's growth from no revenue to a $20 million run rate in less than a year. I led the team that built the admission infrastructure to support and scale ~20 fellowships for founders, capital allocators, operators, and creators.
  • XX: Worked on a pre-YC accelerator program that invested $125K on an MFN SAFE into 8-10 teams per cohort.
  • ODX: Worked on a $100 million accelerator program that invested $125K for 7% into 100s of startups. We partnered with leaders like Flexport to build vertical industry tracks around supply chain & logistics, etc.
  • Ryan Marketing: Created an influencer marketing startup (2 million+ Instagram followers) during college with my twin brother and realized early that monetization for creators was broken. I learned a lot, and it paid enough to cover my schooling, rent, and meals.

The Methodology

I considered several data points to arrive at the startups selected on this list, including company traction, funding raised, valuation, and investors. From my initial list, I then referenced each startup with my personal network to narrow down the list to the one you see below. I take my credibility seriously, so I do not take any money from startups who want to be included on this list to remain impartial. If you want to understand my analysis of startups and prefer long-form content, read my research on the top startups of 2023 afterward.

Databricks Logo

Databricks

Data lakehouse architecture and AI company.

Headquarters

San Francisco, California

Industry

Data, Cloud

Founded

2013

Funding

$3.5 billion

Valuation

$38 billion

Traction

  • Databricks has over 7,000 customers, including Adobe, Rolls Royce, Comcast, Condé Nast, H&M, AT&T, Shell, Walgreens, Air Canada, Toyota, and T-Mobile.
  • Over 40% of the Fortune 500 relies on the Databricks Lakehouse Platform to unify their data.
  • Databricks has hundreds of global partners, including Microsoft, Amazon, Tableau, and Booz Allen Hamilton.

orange X

dbt Labs

Transform data in your warehouse.

Headquarters

Philadelphia, Pennsylvania

Industry

Dev Tools, Data

Founded

2008

Funding

$414.4 million

Valuation

$4.2 billion

Traction

  • 16,000 companies use dbt Labs, including Hubspot, Zip, JetBlue, and Kickstarter.
  • dbt Labs acquired Transform, a platform that allows organizations to work on the metric layer in today’s modern data stack.
Census Logo

Census

Data activation platform built on your warehouse.

Headquarters

San Francisco, California

Industry

Data

Founded

2018

Funding

$80.3 million

Valuation

$630 million

Traction

  • Census has a couple of hundred customers, including Notion, MasterClass, Intercom, ClickUp, Sonos, Canva, Zip, Smartify, Figma, Apollo.io, Guru, Fivetran, Loom, and Mixpanel.
  • Census supports 150+ integrations, including Snowflake, Databricks, Google Sheets, Postgres, BigQuery, MySQL, Azure Synapse, S3, Redshift, and Microsoft SQL Server.
Bigeye logo

Bigeye

Data quality automation platform.

Headquarters

San Francisco, California

Industry

Data

Founded

2019

Funding

$66 million

Valuation

Undisclosed

Traction

  • From Q4 2020 to Q3 2021, Bigeye achieved four consecutive quarters of doubling usage and new customers.
  • Bigeye has tracked 50 million data check runs on its platform.
  • Bigeye customers include Instacart, Udacity, Recharge, Clubhouse, Scale, Docker, SignalFire, Crux, and Mayan.
Domino Data Lab Logo

Domino Data Lab

Unleash data science at scale.

Headquarters

San Francisco, California

Industry

Data

Founded

2013

Funding

$223.6 million

Valuation

Undisclosed

Traction

  • More than 20% of the Fortune 100 are customers of Domino Data Lab, including Lockheed Martin, Johnson & Johnson, Moody’s Analytics, Topdanmark, AES, Bayer, Allstate, BNP Paribas Cardif, Eli Lilly, Ford, Red Hat, GAP, and New York Life.
  • Domino Data Lab typical customer results are 542% ROI, 80% faster model deployment, 50% shorter model lifecycle, and 75% faster onboarding.
"O" with a black dot in the center

Observable

Collaborative data platform and canvas.

Headquarters

San Francisco, California

Industry

Data

Founded

2016

Funding

$46.1 million

Valuation

Undisclosed

Traction

  • As of January 2022, the Observable community surpassed 5 million people.
  • As of May 2023, the Observable community has created 200,000+ data visualization examples to use as a starting point for your company’s visualization.
  • Observable’s customers include thousands of data scientists, data analysts, and engineers at organizations like OpenAI, Stitch Fix, Twitter, The Washington Post, The New York Times, The Economist, NBC News, MIT, McKinsey & Company, and Trase.

Fivetran Logo

Fivetran

Automated data movement platform.

Headquarters

Oakland, California

Industry

Data, SaaS

Founded

2012

Funding

$728.1 million

Valuation

$5.6 billion

Traction

  • As of February 2023, Fivetran reported a $200 million annual revenue run rate and 50% YoY growth.
  • As of February 2023, Fivetran syncs 2 trillion rows monthly, manages 1.3 million+ schema changes monthly, manages 175,000+ connectors, and has a 99.9% guaranteed data pipeline uptime.
  • As of May 2023, Fivetran has 5,000+ customers, including Morgan Stanley, Intercom, Asics, Blend, Forever 21, DocuSign, Everlane, Okta, Optimizely, Square, GoodRx, and Lionsgate.
Anomalo Logo

Anomalo

Complete data quality platform.

Headquarters

Palo Alto, California

Industry

Data

Founded

2018

Funding

$39 million

Valuation

Undisclosed

Traction

  • In Q2 2021, Anomalo surpassed $1 million in annualized recurring revenue, 3x over Q1 2021.
  • Anomalo customers include BuzzFeed, Included Health, Discover Financial Services, Substack, Carta, Faire, Block, Aritzia, Fandom, and Notion.
Metaplane Logo

Metaplane

Observability for modern data teams.

Headquarters

Boston, Massachusetts

Industry

Dev Tools, Data

Founded

2019

Funding

$8.4 million

Valuation

Undisclosed

Traction

  • As of January 2023, Metaplane has 140+ customers, including Imperfect Foods, Reforge, Pipe, Vivian Health, Clearbit, Gorgias, SpotOn, Car and Classic, Vendr, and Mux.
ClickHouse Logo

ClickHouse

A fast open-source column-oriented database management system.

Headquarters

Palo Alto, California

Industry

Data, Dev Tools

Founded

2009

Funding

$300 million

Valuation

$2 billion

Traction

  • As of July 2023, ClickHouse has 100,000+ developers on its platform.
  • As of July 2023, ClickHouse’s GitHub has 29.8K stars, 5.9K forks, and 1,276 contributors, up from 20K stars and 800 contributors in October 2021.
  • As of July 2023, ClickHouse has 4,025 developers in its Slack community.
  • As of July 2023, developers have made 32,000+ pull requests, up from 20,000 in October 2021.
  • ClickHouse customers include Cloudflare, eBay, Uber, Comcast, and Cisco.
Airbyte Logo

Airbyte

Open-source data integration platform.

Headquarters

San Francisco, California

Industry

Data

Founded

2020

Funding

$181.2 million

Valuation

$1.5 billion

Traction

  • As of July 2023, Airbyte has 40,000+ data practitioners on its platform, up from 4,500 in December 2021.
  • As of July 2023, Airbyte has 4,000+ daily active companies on the platform, including Monday, Unity, Calendly, BetterUp, Petco, Monday.com, and Anker.
  • As of July 2023, Airbyte reported syncing 1 petabyte of data and processing 2 billion rows monthly.
  • In 2021, 9,000+ companies synced data on Airbyte.
Atlan Logo

Atlan

Third-gen data catalog.

Headquarters

New York, New York

Industry

Cloud, Data, Artificial Intelligence (AI)

Founded

2019

Funding

$71 million

Valuation

$450 million

Traction

  • Atlan customers include Unilever, Autodesk, Plaid, Monster, Ralph Lauren, Juniper Networks, Elastic, and Nasdaq.
  • In 2021, Atlan reported 10x customer and revenue growth.
Cockroach Labs Logo

Cockroach Labs

A distributed database with standard SQL for cloud applications.

Headquarters

New York, New York

Industry

Cloud, Data

Founded

2015

Funding

$633 million

Valuation

$5 billion

Traction

  • As of March 2023, Cockroach Labs averages 139% annual growth.
  • As of December 2021, Cockroach Labs reported 200 customers, including LaunchDarkly, DigitalOcean, Comcast, Booksy, AllSaints, and Storj.
  • In December 2021, Cockroach Labs reported 3x growth in ARR.
  • In Q3 2021, Cockroach Labs reported a 500% growth in cloud revenue.
Collibra Logo

Collibra

Data catalog, data governance, and data quality.

Headquarters

Brussels, Belgium

Industry

Cloud, Data

Founded

2008

Funding

$640 million

Valuation

$5.25 billion

Traction

  • As of November 2021, Collibra has 500+ customers, including Equifax, Cox Automotive, L'Oréal, Cambia Health Solutions, and Northern Trust.
DragonflyDB Logo

DragonflyDB

The fastest in-memory data store.

Headquarters

Tel Aviv, Israel

Industry

Cloud, Data

Founded

2022

Funding

$21 million

Valuation

Undisclosed

Traction

  • As of July 2023, DragonflyDB claims its software has 25x more QPS and 12x faster snapshotting than Redis.
  • As of July 2023, DragonflyDB claims it offers 30% less memory usage than other market offerings.
Dremio Logo

Dremio

The easy and open data lakehouse platform.

Headquarters

Santa Clara, California

Industry

Cloud, Data

Founded

2015

Funding

$407 million

Valuation

$2 billion

Traction

  • As of July 2023, Dremio customers include Unilever, Samsung, Nokia, Microsoft, Blackrock, Bentley, Abbot, Deloitte, Amazon, and DB Cargo.
Hex Logo

Hex

Do more with data together.

Headquarters

San Francisco, California

Industry

Cloud, Data

Founded

2019

Funding

$102 million

Valuation

Undisclosed

Traction

  • As of March 2023, Hex has 450+ customers, including OpenSea, Fivetran, Glossier, Brex, Noom, Notion, Anthropic, AngelList, Algolia, and Glean.
  • As of March 2023, Hex reported 4x growth in revenue and the size of their business in the previous 12-month period.
Hightouch Logo

Hightouch

Get fresh, accurate customer data in all your tools.

Headquarters

San Francisco, California

Industry

Cloud, Data

Founded

2019

Funding

$90 million

Valuation

$615 million

Traction

  • As of July 2023, Hightouch has 500+ customers, including Spotify, Betterment, Ramp, Calendly, Plaid, Retool, Checkr, GitLab, Tripadvisor, GameStop, and the NBA.
MinIO Logo

MinIO

High-performance Kubernetes native object storage.

Headquarters

Redwood City, California

Industry

Cloud, Data, Artificial Intelligence (AI)

Founded

2014

Funding

$126 million

Valuation

$1 billion

Traction

  • As of July 2023, MinIO counts PayPal, Kayak, Solvinity, Mavenir, Rubrik, Iodine, Hansen Software, and Banregio as customers.
  • As of July 2023, MinIO has 1.25 billion+ Docker pulls, 40K GitHub stars, 25.4K Slack members, and 1.1K contributors.
  • In 2021, MinIO reported 201% ARR growth and 208% customer growth over the past year.
Materialize Logo

Materialize

The streaming database.

Headquarters

New York, New York

Industry

Cloud, Data

Founded

2019

Funding

$101 million

Valuation

Undisclosed

Traction

  • As of July 2023, Materialize counts Ramp, Drizly, Density, Onward, Pluralsight, Centerfield, and Kepler Cheuvreux.
Monte Carlo Logo

Monte Carlo

Data reliability delivered.

Headquarters

San Francisco, California

Industry

Cloud, Data, Artificial Intelligence (AI)

Founded

2019

Funding

$240 million

Valuation

$1.6 billion

Traction

  • As of July 2023, Monte Carlo counts CNN, Fox, GoodRx, Intercom, jetBlue, OpenTable, PagerDuty, Pepsico, SeatGeek, SoFi, Sonos, Vimeo, and Affirm as customers.
  • In May 2022, Monte Carlo reported 2x revenue growth every quarter and an 800% increase in YoY revenue since it last raised money in August 2021.
  • Between Summer 2020 and Summer 2021, Monte Carlo reported 8x in ARR growth.
MotherDuck Logo

MotherDuck

Serverless data analytics.

Headquarters

Seattle, Washington

Industry

Cloud, Data

Founded

2022

Funding

$100 million

Valuation

$400 million

Traction

  • As of September 2023, MotherDuck reported ~2000 users.
  • In June 2023, MotherDuck announced its product is now available to customers.
Neon Logo

Neon

Serverless Postgres.

Headquarters

San Francisco, California

Industry

Cloud, Data

Founded

2021

Funding

$54 million

Valuation

Undisclosed

Traction

  • As of July 2022, Neon counted 700 registered users and 5,000 people on their waitlist.
Redis Labs Logo

Redis Labs

The open-source, in-memory data store.

Headquarters

Mountain View, California

Industry

Cloud, Data

Founded

2011

Funding

$357 million

Valuation

$2 billion

Traction

  • As of July 2023, Redis Labs reported 4 billion+ Docker pulls, 50K+ GitHub stars, and 50+ supported programming languages.
  • As of April 2021, Redis Labs has 8,000+ customers, including 31 Fortune 100 customers.
  • As of April 2021, Redis Labs reported 54% CAGR over the 3-years ended 1/31/21.
  • As of April 2021, Redis Labs reported a 120% net revenue retention rate.
PlanetScale Logo

PlanetScale

The world’s most advanced database platform.

Headquarters

San Francisco, California

Industry

Cloud, Data

Founded

2018

Funding

$105 million

Valuation

Undisclosed

Traction

  • As of July 2023, PlanetScale counts Square, Etsy, Community, MyFitnessPal, Solana, Attentive, Kick, and Barstool Sports as customers.
Supabase Logo

Supabase

The open-source firebase alternative.

Headquarters

San Francisco, California

Industry

Cloud, Data

Founded

2020

Funding

$116 million

Valuation

Undisclosed

Traction

  • As of July 2023, Supabase customers include Mendable, Markprompt, Berri AI, HappyTeams, Xendit, and Mobbin.
  • As of August 2022, Supabase has 110,000 developers on the platform.
  • In May 2022, Supabase reported 80,000 developers created 100,000+ databases on its platform, a 1900% growth over the previous 12-month period.
Tabular Logo

Tabular

Data made simple.

Headquarters

San Jose, California

Industry

Cloud, Data

Founded

2021

Funding

$11 million

Valuation

Undisclosed

Traction

  • As of July 2023, Tabular claims its central table store can reduce cloud storage costs by up to 50%.
Dovetail Logo

Dovetail

Customer insights hub.

Headquarters

Sydney, Australia

Industry

Data

Founded

2017

Funding

$57 million

Valuation

Undisclosed

Traction

  • As of July 2023, Dovetail has 10,000+ users from companies including Deloitte, Datadog, Shopify, GitLab, Atlassian, and the Nielsen Norman Group.
  • In 2021, Dovetail reported 3x revenue and customer growth.
Triple Whale Logo

Triple Whale

The AI data platform for e-commerce.

Headquarters

Columbus, Ohio

Industry

E-commerce, Data

Founded

2021

Funding

$51 million

Valuation

Undisclosed

Traction

  • As of July 2023, Triple Whale customers include the Miami Heat, Moodi, Sene, Outway, Milk, Iron Neck, and True Classic.

Explore More Startups Lists

computer logo with a bunch of icons coming out of it

Top Dev Tools Startups

OpenAI logo with psychadelic color background

Top AI Startups

Truck with yellow remora carbon technology attached to it driving in the desert

Top Climate Startups

Revolut bank card swiping to pay

Top Banking Startups

Two ladies smiling in front of a front desk from "Almond"

Top Healthcare Startups

Sanctuary AI human-like robot

Top Robotics Startups

Painted ladies in SF

Top SF Startups

Lady with cup of coffee looking at vertical farming warehouse

Top NY Startups