Globalme offers pre-packaged or custom-collected conversational data solutions to help power your conversational interfaces. Our pre-packaged phone conversation data sets include:
Need data in another language, accent, or dialect? No problem. We offer Conversational Data Collection in 35+ languages.
This sample data set contains 5 minutes of audio files in each language. The files are in the .WAV audio format with corresponding .JSON transcription files.
This data was initially collected to train a conversational Automatic Speech Recognition (ASR) system on phone call data. Participants held phone conversations with friends and family members through our custom SIP platform. Conversations range in length from 9 to 180 minutes, averaging 30 minutes each.
Transcription was done in timestamped segments by human transcribers without the assistance of ASR, and with a high emphasis placed on accuracy and quality.
This sample is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.