Asset Manager

Updated:

Protege

Protege connects proprietary real-world data to AI labs — a16z backed its $30M Series A extension to build infrastructure that turns hospital records and…

Protege logo

Protege

Be Heard. Get Discovered. See our official page at www.linkedin.com/company/join-protege

General information

Firm type

Asset Manager

Year founded

AUM

Undisclosed

Location

Region

North America

Country

United States

City

New York

Corporate office

New York, NY, United States

Principals

Bobby Samuels

Co-Founder & CEO

Travis May

Co-Founder & Chairman

Engy Ziedan

Co-Founder & Chief Scientific Officer

Richard Ho

Co-Founder & CTO

Sector focus

AI/MLDigital HealthEnterprise SoftwareMedia & Entertainment

Frequently asked questions

Who runs Protege, and what is their background?

Protege was co-founded by Bobby Samuels (CEO), Travis May (Chairman), Engy Ziedan (Chief Scientific Officer), and Richard Ho (CTO). The firm describes its leadership as combining experienced startup operators with depth across both data engineering and scientific research, though specific prior career details are not publicly detailed on the firm's materials.

How does Protege source the data it provides to AI model builders?

Protege does not scrape data from the open web. It negotiates directly with data holders across industries — including hospitals, media archives, and industrial operators — and then curates, de-identifies, and structures that proprietary material into AI-ready datasets. The firm acts as an intermediary that aggregates supply from hundreds of sources and delivers it in formats tailored to each stage of model development.

Does Protege simply license data, or does it also prepare and structure it for specific model training stages?

Protege is not a raw-data broker. Its 'DataLab' team provides domain-specific curation, de-identification, and quality checks so that datasets arrive matched to a builder's specific use case — pre-training corpora, supervised fine-tuning examples, or uncontaminated evaluation benchmarks — rather than as generic data feeds.

What are the primary industry verticals Protege covers?

The firm's public materials emphasize healthcare proprietary data, video archives, audio libraries, motion capture, and what it refers to as 'agentic data.' Healthcare appears to be the most developed vertical, with multiple announced evaluation benchmarks and partner collaborations in clinical documentation and medical billing.

Is Protege a venture-backed startup or a traditional family office investment vehicle?

Protege is a venture-backed operating company, not a family office. It has raised equity financing rounds including a $25 million Series A closed in August 2025 and a $30 million Series A extension led by Andreessen Horowitz announced in January 2026.

Profile maintained by using OSINT (open-source intelligence), regulatory filings, licensed data partners, and verified direct submissions. Read the methodology. Last updated: . Continuous refresh with full update cycles at least every 30 days.

Need institutional-grade insight on asset managers?

Altss delivers:

Principals with verified direct contactsAllocation history by asset classOSINT-derived deal signals
Book a demo

Prefer a guided tour?

We’ll walk you through:

Interactive funding timelinesCustom mandate & allocation filters
Book a demo

More New York Asset Manager profiles