Updated:
Protege
Protege connects proprietary real-world data to AI labs — a16z backed its $30M Series A extension to build infrastructure that turns hospital records and…
Protege
Be Heard. Get Discovered. See our official page at www.linkedin.com/company/join-protege
General information
Firm type
Asset Manager
Year founded
—
AUM
Undisclosed
Location
Region
North America
Country
United States
City
New York
Corporate office
New York, NY, United States
Principals
Bobby Samuels
Co-Founder & CEO
Travis May
Co-Founder & Chairman
Engy Ziedan
Co-Founder & Chief Scientific Officer
Richard Ho
Co-Founder & CTO
Sector focus
Frequently asked questions
Who runs Protege, and what is their background?
Protege was co-founded by Bobby Samuels (CEO), Travis May (Chairman), Engy Ziedan (Chief Scientific Officer), and Richard Ho (CTO). The firm describes its leadership as combining experienced startup operators with depth across both data engineering and scientific research, though specific prior career details are not publicly detailed on the firm's materials.
How does Protege source the data it provides to AI model builders?
Protege does not scrape data from the open web. It negotiates directly with data holders across industries — including hospitals, media archives, and industrial operators — and then curates, de-identifies, and structures that proprietary material into AI-ready datasets. The firm acts as an intermediary that aggregates supply from hundreds of sources and delivers it in formats tailored to each stage of model development.
Does Protege simply license data, or does it also prepare and structure it for specific model training stages?
Protege is not a raw-data broker. Its 'DataLab' team provides domain-specific curation, de-identification, and quality checks so that datasets arrive matched to a builder's specific use case — pre-training corpora, supervised fine-tuning examples, or uncontaminated evaluation benchmarks — rather than as generic data feeds.
What are the primary industry verticals Protege covers?
The firm's public materials emphasize healthcare proprietary data, video archives, audio libraries, motion capture, and what it refers to as 'agentic data.' Healthcare appears to be the most developed vertical, with multiple announced evaluation benchmarks and partner collaborations in clinical documentation and medical billing.
Is Protege a venture-backed startup or a traditional family office investment vehicle?
Protege is a venture-backed operating company, not a family office. It has raised equity financing rounds including a $25 million Series A closed in August 2025 and a $30 million Series A extension led by Andreessen Horowitz announced in January 2026.
Profile maintained by Altss using OSINT (open-source intelligence), regulatory filings, licensed data partners, and verified direct submissions. Read the methodology. Last updated: . Continuous refresh with full update cycles at least every 30 days.
Need institutional-grade insight on asset managers?
Altss delivers:
Prefer a guided tour?
We’ll walk you through: