Automated transcription services versus professional human writers

by JodeneAntoniou September 25, 2017 in General posts, Transcription services automated transcription services, professional transcriptionists, typing services, voice recognition 0 118

At Capital Captions, we always do our utmost to keep up to date with the best and newest innovations in the transcription world. We believe transcription, subtitling and translation require genuine skill and ability and therefore, we believe in the value of writers. Whether you’re looking into subtitling, closed captioning, translation or transcription services, there’s too much involved in high quality transcription for software to keep up with. However, more and more we are seeing companies offering automated voice recognition based solutions to transcription services. We frequently publish information on why we think human transcriptions are always superior to automated transcription. Today, however, we’re going to aim to set the record straight once and for all, by putting both to the test.

Our Take on Why Professional Transcription Always Beats Voice Recognition Software

Audio typist Listening skills and experience

Professional transcription requires a high level of listening ability and linguistic experience. Even the best audio typists can struggle to understand different accents without a certain level of experience. In contrast, it’s possible to ‘teach’ some voice recognition software packages to transcribe better through constant correction and user feedback. This can be a useful option for single speaker dictations. However, typical audio transcriptions contain multiple speakers, each with different accents, talking speeds and tones. Voice recognition software just isn’t as adaptable as real human audio typists.

Transcriptionist Writing Ability

Perfect grammar and punctuation can make the difference between a transcript that is flawlessly professional and one that is downright incoherent. Good grammar requires more than just following algorithms dictating that a full stop should be inserted after a long pause, and that a certain chain of words can be preceded by a colon. They do a true understanding of what is being said. Normal speech is unpredictable and often grammatically incorrect. Therefore, it can’t be accurately represented through algorithmic conventions. Professional transcriptionists actively engage with the audio available to construct good sentences. They use intelligence; something which voice recognition will continue to be lacking for a very long time.

Human vs Artificial Intelligence

Decision making is also an important aspect of professional audio transcription. Speakers use filler words, they mumble ahs and erms, and often times, they can mispronounce or abbreviate things. Especially in medical transcription or legal transcription where abbreviations are common, voice recognition software may attempt to transcribe a word where in fact, the speaker intends an abbreviation. For instance, FTSE 500 is often pronounced as ‘footsie’ 500 in financial transcription. An experienced financial transcriptionist would know to transcribe the abbreviation whereas voice recognition would likely make a phonetic guess. Similarly, in intelligent verbatim transcription, typists will often decide to leave out excessive filler words such as ‘you know,’ and ‘sort of’. For this reason, even perfect automated transcription services could only really be used in verbatim transcriptions.

The Struggles for Automated Transcription Services

The above served as just a few examples of things that voice recognition software struggles with in terms of transcription services.

Voice recognition software will struggle with

Poor quality audio for transcription
Foreign or regional accents
Technical abbreviations and jargon
Unusual names of people, places or companies
Audio recordings with multiple people speaking simultaneously (over speaking)
Identifying speakers
The use of grammar and punctuation
Whispers, shouts and other potential speech distortions, e.g. echoes
Complex transcript formats, templates and house styles
Client specifications around anonymising speaker names or highlighting key terms

Putting Automated Transcription to the Test

We have a video file which contains a brief summary of our audio transcription services. It also includes an outline of why professionally written transcription services are always superior to voice recognition and automated transcription. We’ve tried to include a few of the elements above to really put the software to the test. Below, the first PDF contains a professional audio transcript of the video. We have created the second transcript using a well known, respected voice recognition software brand (no naming names!) Watch the video, see what you think and if you’re brave, share your thoughts using the comments section at the bottom of the page.

VIEW THE TRANSCRIPTS HERE

For more information on our transcription services, check out our transcription sectors. Alternatively, get your video subtitling, closed captioning, translation or transcription quote today!

Get a Quote

Check out our most

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Automated Transcription Services: Does Voice Recognition Work?

Our Take on Why Professional Transcription Always Beats Voice Recognition Software

Audio typist Listening skills and experience

Transcriptionist Writing Ability

Human vs Artificial Intelligence

The Struggles for Automated Transcription Services

Putting Automated Transcription to the Test

Recent Posts

Translation Costs and Approaches

Closed Captioning for Theatrical and Musical Performances

Capital Captions – A Top International Captioning and Accessibility Company

Audio Description – Five Top Tips

Translating Video Text, Graphics and Dialogue

Closed Captions for Film Festivals

Latest Posts

Translation Costs and Approaches

Closed Captioning for Theatrical and Musical Performances

Capital Captions – A Top International Captioning and Accessibility Company

Categories

About Capital Captions

Latest Posts

Translation Costs and Approaches

Closed Captioning for Theatrical and Musical Performances

Company

Contact Info

About Us

Latest Posts

Translation Costs and Approaches

Closed Captioning for Theatrical and Musical Performances

Capital Captions – A Top International Captioning and Accessibility Company

Contact Info