24.12.2012 Views

Thoracic Imaging 2003 - Society of Thoracic Radiology

Thoracic Imaging 2003 - Society of Thoracic Radiology

Thoracic Imaging 2003 - Society of Thoracic Radiology

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Voice Recognition<br />

Theresa C. McLoud, M.D.<br />

Associate Radiologist-in-Chief, Director <strong>of</strong> Education; <strong>Thoracic</strong> Radiologist, Massachusetts General Hospital<br />

Pr<strong>of</strong>essor <strong>of</strong> <strong>Radiology</strong>, Harvard Medical School, Boston, Massachusetts<br />

Voice recognition technology allows radiologists to achieve<br />

efficiency goals and become more competitive in the digital<br />

environment. It provides a link that can improve the speed <strong>of</strong><br />

communication between radiologists and their referring physicians.<br />

Speech recognition systems were introduced into main<br />

stream large radiology departments in the United States in the<br />

mid 1990’s. Most systems require limited amounts <strong>of</strong> learning<br />

and adaptation compared to other transcription systems and<br />

methods. Most s<strong>of</strong>tware packages allow easy integration into<br />

existing radiology and hospital information systems. The major<br />

disadvantage <strong>of</strong> voice recognition comes from radiologists’<br />

resistance to change and fear <strong>of</strong> technology. In addition, voice<br />

recognition does require more radiologist time and secretarial<br />

skills. However, the benefits to the hospital, referring clinicians<br />

and patients, are impressive because radiology reports are<br />

immediately available on the radiology and hospital information<br />

systems.<br />

Voice recognition s<strong>of</strong>tware at the human interface level comprises<br />

four core technologies; 1) the recognition <strong>of</strong> spoken<br />

human speech, 2) the synthesis <strong>of</strong> the spoken speech into readable<br />

characters, 3) the identification <strong>of</strong> the speaker and author<br />

verification, and 3) the understanding <strong>of</strong> the recognized word.<br />

These technologies are <strong>of</strong>ten referred to as speech recognition<br />

or speech-to-text; speech synthesis or text-to-speech; speaker<br />

identification and verification; and natural language understanding.<br />

Most voice recognition systems have certain hardware<br />

requirements. These include specific processor and speeds, usually<br />

a Pentium 200 MHz chip as well as an operating system<br />

usually a Windows NT platform. In addition to individual<br />

workstations, the system requires integration into a network.<br />

The network allows integration into the radiology information<br />

system and ultimately into PACS and the hospital information<br />

system. Most systems also require the use <strong>of</strong> high quality<br />

microphones and sound cards to achieve high accuracy rates.<br />

Integration <strong>of</strong> a voice recognition system into a radiology<br />

department is not a trivial process. Most systems <strong>of</strong>fer several<br />

key features. These include an RIS interface, i.e. an interface<br />

with links to the radiology information system, the hospital<br />

information system, PACS and the billing system. The second<br />

component consists <strong>of</strong> standardized reports. These allow radiologists<br />

to create pre-defined reports for individual radiologists or<br />

the institution. The third components are templates/macros.<br />

Most systems allow the creation <strong>of</strong> not only standard reports but<br />

customizable templates and fields. The template capability<br />

gives the radiologist the flexibility to create a form with blank<br />

areas that vary with each dictation. Feature filled macros allow<br />

standard phrases or components <strong>of</strong> report to be easily inserted in<br />

the text. These functions allow greater efficiency. The fourth<br />

component is customizable fields. Most packages permit customized<br />

definitions by the institution and multiple fields associated<br />

with the report. These fields may include ICD-9, CPT, BI-<br />

RADS, ACR /NEMA codes or ACR pathology identifiers. The<br />

fifth component includes bar code interfaces. Most voice recognition<br />

s<strong>of</strong>tware packages support the use <strong>of</strong> a bar code laser<br />

reader or integrated microphone laser reader into the system.<br />

The final component is a system for security.<br />

The most important feature <strong>of</strong> speech recognition is report<br />

turnaround. In our own institution, the previously utilized dictation<br />

system required an average <strong>of</strong> over three days to complete.<br />

This included transcription, review by the radiologist and<br />

the resident. The introduction <strong>of</strong> voice recognition eliminates<br />

the transcription and correction steps and a report by a staff<br />

radiologist becomes immediately finalized after being dictated<br />

and edited. The turnaround time in our own department has<br />

now dropped to 0.4 days.<br />

REFERENCES:<br />

Mehta A. Voice Recognition. In: PACS. A Guide to the Digital<br />

Revolution.<br />

Eds: Dreyer KJ, Mehta A, Thrall JH. Springer-Verlag New York,<br />

Inc., 2002.<br />

Chapter 11, pages 281-302.<br />

179<br />

TUESDAY

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!