Electronic Technology:OCR, TTS, and STT

Optical Character Recognition (OCR), Text to Speech (TTS), and Speech Recognition Software (Speech-to-Text, STT)

These are three technologies that can be used to manipulate the printed word to make it more accessible to the visually impaired and the blind.

Optical Character Recognition

This technology is the combination of a software program with either an camera or a  scanner. The captured image or scan of text presented to the software appears  like a bunch of lines and curves.  The software recognizes these characters and  reformates the text into a computer language, and then presents it as the letters, numbers, symbols, and graphs that can now be recognized and manipulated by the user.  

Once the text is converted by the OCR, with the help of additional software, the text can now be magnified, edited, exported, or transferred to another format, like text-to-speech.

Text-to-Speech (also known as Speech Synthesis)

This is not a new technology. It has been available  to the blind, visually impaired, and  those who are dyslexic  as  the reading machines.  These are text-to-speech hardware specifically designed to give those who are print disabled access to the printed word.  The print is captured by either a camera or a scanner, and then is read out loud. 

Sara:Scanning and Reading Appliance
Sara:Scanning and Reading Appliance

 

 

Eye Pal Ace. Portable scanner and reader
Eye Pal Ace. Portable scanner and reader
The kurzweil National Federation of the Blind Portable Reader
The kurzweil National Federation of the Blind Portable Reader

Software programs for computers and mobile technologies are familiar to us as ‘screen readers.’ This is often included as part of a computer or smartphone accessibility features.

TTS software can also be purchased and downloaded as part of an OCR/TTS software package. Is there an app for that? Of course there is.  There are smartphone apps that you can snap a picture of some text, then activate the reader to have it read to you.

 

Speech-to-Text ( also known as Speech or Voice Recognition)

This is the talk-to-text software feature of computers and mobile devices which allows the user to bypass the keyboard and mouse. It serves two functions:

  1. Instruct the computer to perform tasks by voice commands, such as “Start…”, “Scroll…”, “Apps…” etc. There is a defined set of commands for each operating system, which requires a little training.
  2. This software will takes dictation. It can convert dictated words into word processing documents, emails, and fill in online forms. The spoken words appear on the screen, which can be saved, edited, or exported.

    Dragon Naturally Speaking, best known third party STT software.
    Dragon Naturally Speaking, best known third party STT software.

Most operating systems include speech recognition software as part of their suite of accessibility features. If you are someone who needs a workhorse  dictation program for work or creative writing, there are voice recognition software programs that can be purchased. While operating system programs can get the job done, third-party voice recognition software tends to be faster and more accurate.  That means that when proofreading your document, there should be fewer errors.  If you have used the microphone on your smartphone to dictate a text messages, you know that there are frequently some laughable word errors.

Leave a Reply

Your email address will not be published. Required fields are marked *