PUBLIC RECORDS

Form 5500 data scraping service.

Employee benefit plan filings, compliance data, and plan details from DOL Form 5500. For retirement plan advisors, benefits consultants, and insurance brokers.

The problem.

WHY THIS IS HARDER THAN IT LOOKS

Retirement plan advisors, benefits consultants, and insurance brokers use Form 5500 data to prospect employers, benchmark plans, and identify compliance risks. The DOL publishes Form 5500 data, but the search tools are limited and the data requires significant processing to be useful for prospecting and analysis.

For retirement plan advisors, Form 5500 is the most reliable prospecting dataset that exists. Every employer-sponsored retirement plan above a threshold size must file annually with the DOL, and that filing discloses the plan's asset total, participant count, current service providers, and whether the plan had a late filing or compliance issue in the reporting year. An advisor prospecting for rollovers or plan conversions can filter Form 5500 data to find 401(k) plans in a target asset range within a geographic market, then identify which of those plans is using a service provider they know they can compete against on cost or service model. The DOL's own search interface is not built for this kind of prospecting workflow. The data I deliver is cleaned, normalized, and formatted so an advisor can work with it in a CRM or a spreadsheet without needing a data team to make it usable.

Compliance consultants use the same dataset differently. A plan that filed a late return, reported a loan default, or disclosed a prohibited transaction in a prior year is a signal that the plan sponsor may need help with compliance infrastructure. Filtering for plans with recent compliance disclosures in a target geography or industry is a prospecting approach that retirement plan consultants running compliance-focused practices use to find sponsors who already know they have a problem.

This service delivers structured, queryable Form 5500 data. Search by sponsor name, plan type, participant count, or asset range and receive clean records with plan details, service provider information, and filing history. The ScrapeBase API at scrapebase.io has Form 5500 endpoints for self-serve access.

Is this right for you?

GOOD FIT IF ANY OF THESE SOUND LIKE YOU

You are a retirement plan advisor prospecting employers by plan size and asset level

You are a benefits consultant benchmarking client plans against industry peers

You are an insurance broker targeting self-funded health plans for stop-loss coverage

You need Form 5500 data in a structured, queryable format rather than raw DOL search results

You are a compliance consultant prospecting plan sponsors with recent late filings or compliance disclosures

What you receive.

EXACT FIELDS, DELIVERED IN YOUR FORMAT

plan_namestringOfficial plan name from the filing.
sponsor_namestringPlan sponsor (employer).
einstringEmployer Identification Number.
plan_typestringDefined Contribution, Defined Benefit, Health, etc.
total_participantsnumberActive participants in the plan.
total_assetsnumberTotal plan assets.
plan_yearnumberReporting year.
administratorstringPlan administrator.

Sample record.

form-5500.sample.json
{
"plan_name":"Walmart 401(k) Plan",
"sponsor_name":"Walmart Inc.",
"ein":"71-0415188",
"plan_type":"Defined Contribution",
"total_participants": 1480000,
"total_assets": 28400000000,
"plan_year": 2024,
"administrator":"Merrill Lynch",
"extracted_at":"2026-04-14T10:00:00Z"
}

Straightforward pricing.

SCALE DETERMINES PRICE · NO HIDDEN FEES

Plan search

from $199

One-time search by sponsor, size, or plan type. 2-4 days.

  • Up to 5,000 plans
  • Full filing data
  • CSV or Google Sheet
Get a quote →

Prospecting pipeline

from $499/mo

Monthly refresh with new filing detection.

  • Filtered to your targets
  • New plan alerts
  • CRM write-back
Get a quote →

Custom

Custom

Historical analysis, service provider mapping, benchmarking.

  • Full DOL dataset
  • Custom classification
  • Scoping call required
Get a quote →

Frequently asked questions.

EVERYTHING YOU NEED TO KNOW

Sponsor name, EIN, plan type, participant count range, asset range, state, and filing year. The most common search is 401(k) plans above a certain asset threshold for prospecting. For advisors focused on a specific geography, filtering by state and asset range returns a manageable list of target plans that can be loaded directly into a CRM for outreach sequencing.

Yes. Self-funded health plans are identifiable through Form 5500 filings and are a common target for stop-loss insurance brokers and TPA service providers. The data includes participant count and plan year, which allows brokers to segment self-funded plans by employee count and identify prospects that match their underwriting appetite for stop-loss or administrative services.

DOL publishes Form 5500 data on a rolling basis. The pipeline processes new filings as they become available. For monthly monitoring, new filings are captured each cycle.

Yes. Form 5500 Schedule C discloses service providers receiving fees above a reporting threshold, which typically includes the recordkeeper, third-party administrator, investment advisor, and auditor. The pipeline extracts this service provider data so advisors can filter plans by current provider relationship.

Yes. The historical analysis tier pulls multiple years of filings for a plan or a set of plans, showing participant count growth, asset growth, and changes in service providers over time. This is used by advisors to identify plans that have grown into a higher service tier or recently changed recordkeepers.

Plan search starts at $199 for up to 5,000 plans. Prospecting pipeline starts at $499 per month.

Ready to get Form 5500 data?

Book a 30-minute call and I’ll scope it live.