LEAD GEN

Wellfound Startup Database

Startup profiles with funding rounds, team size, tech stack, and hiring status. Built for a VC firm tracking early-stage companies across specific verticals.

30K startupsPostgreSQL + Weekly Diff Reports (CSV)Ongoing weekly

The challenge.

A seed-stage VC firm needed to systematically track early-stage startups across fintech, healthtech, and climate verticals. Wellfound (formerly AngelList) has the best startup data, but no bulk export and aggressive rate limiting. The firm's analysts were manually reviewing profiles one by one.

The approach.

01

Scrapy Crawl Architecture

Built a Scrapy spider with depth-first crawling through Wellfound's category and location filters. Used rotating datacenter proxies with per-request delays calibrated to stay under rate limit thresholds.

02

Funding & Team Extraction

Parsed startup profiles for founding date, funding stage, total raised, last round size, team headcount, key hires, and active job postings. Captured investor names and board members when listed.

03

Tech Stack & Signal Analysis

Extracted listed tech stacks and job posting requirements to infer actual technology usage. Built a scoring model that flags startups showing growth signals: rapid hiring, new funding, tech stack expansion.

04

VC-Ready Database

Stored everything in PostgreSQL with weekly diff reports highlighting new startups, funding events, and team changes. Built custom views for each target vertical with configurable alert thresholds.

Sample output.

wellfound-startup-database.json
{
  "company_name": "Optera Climate",
  "vertical": "Climate Tech",
  "founded_year": 2023,
  "funding_stage": "Seed",
  "total_raised_usd": 3200000,
  "team_size": 18
}

The results.

30K

Startups tracked

3

Target verticals

840

Funding events captured

12

Investments sourced from data

Tech stack.

PythonScrapyPostgreSQLVC ResearchWeekly Updates

Ready to get your data?

Book a 30-minute call and I’ll scope your project live. No commitment required.

Or reach out directly:

hello@sidb.work