Speech to Text Report 
for 2020

Surveyed
2,744
active users

Represents
9
industries

Respondents from
47+
countries

Rev Compiled a Research Report on How Businesses Use Speech to Text Services

There are several types of text transcription services, from real time transcription to AI transcribed text and human-transcribed audio files. There are also several unique use cases for speech recognition technology, including voice commands, deep learning, call centers, and more. Learn more about how industries in 2020 use speech recognition.

Key Takeaways

1

The use of speech to text services is growing

As organizations continue to produce more content, there is a growing need for services that make content more digestible for internal and external audiences. Organizations convert audio to text to create this content.

54%

of respondents saw an increase in the use of speech to text services this past year

42%

of respondents expect the amount they spend on speech to text services to increase in the next 2-3 years

2

Speech to text services are becoming a critical part of respondents’ workflows

Users rely on speech to text services for many different reasons. These include sorting through hours of video, identifying specific people in audio with multiple speakers, or making content accessible to all customers.

of respondents say speech to text services are a critical part of their current workflow

of respondents have seen an increase in productivity by using speech to text services

3

The main benefits users are experiencing are time & cost savings

By off-loading these tasks to a vendor, users can focus on activities that are crucial to their business, which in turn saves them time and money.

79%

of respondents selected Time Savings as a benefit from using speech to text services

40%

of respondents expect the amount they spend on speech to text services to increase in the next 2-3 years

4

Speech to text users are more likely to use two or more services

An individual user can produce several types of audio and video content. Therefore, different users often need multiple services to meet different needs. These can include a combination of captioning, human transcription, and automatic transcription.

2/3 of respondents use more than one service
The most popular combination is human transcription & automatic transcription

Get the full report

For deeper insights into speech to text 
trends and user attitudes download the full report.