Marketplace

Off-the-shelf Datasets

Carries turnkey datasets across the highest-priority capability areas — giving labs and product teams immediate licensing access to expert-validated training data without the months-long lead time of a custom build.

American English Full-Duplex Conversational Dataset

Two-speaker American English conversations captured in full-duplex stereo, covering everyday topics with overlapping speech, backchannels, and natural disfluencies preserved.

Languages

English

Countries

United States

French Full-Duplex Conversational Dataset

Naturalistic French conversations between native speakers, captured in full-duplex stereo with overlapping speech and authentic turn-taking.

Languages

French

Countries

FranceBelgiumCanada

Mandarin Full-Duplex Conversational Dataset

Native-speaker Mandarin Chinese conversations recorded in full-duplex stereo across mainland and overseas dialect regions.

Languages

Chinese Mandarin

Countries

ChinaTaiwan

Spanish Full-Duplex Conversational Dataset

Two-speaker Spanish conversations spanning Latin American and European dialects, captured in stereo full-duplex with natural overlap.

Languages

Spanish

Countries

SpainMexicoColombiaArgentina

Vietnamese Full-Duplex Conversational Dataset

Native Vietnamese conversations captured in full-duplex stereo, with North-Central-South dialect coverage and natural turn-taking.

Languages

Vietnamese

Countries

Vietnam

Indonesian Full-Duplex Conversational Dataset

Bahasa Indonesia conversations between native speakers, captured in full-duplex stereo across Java, Sumatra, and Sulawesi.

Languages

Indonesian

Countries

Indonesia
Ready to bring AI into the real world?