Processing

Please wait...

Settings

Settings

1. WO1996003741 - SYSTEM AND METHOD FOR FACILITATING SPEECH TRANSCRIPTION

Publication Number WO/1996/003741
Publication Date 08.02.1996
International Application No. PCT/US1995/009130
International Filing Date 19.07.1995
Chapter 2 Demand Filed 05.02.1996
IPC
G10L 15/02 2006.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
02Feature extraction for speech recognition; Selection of recognition unit
G10L 15/05 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
04Segmentation; Word boundary detection
05Word boundary detection
G10L 15/26 2006.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
26Speech to text systems
CPC
G10L 15/02
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
02Feature extraction for speech recognition; Selection of recognition unit
G10L 15/05
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
04Segmentation; Word boundary detection
05Word boundary detection
G10L 2015/025
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
02Feature extraction for speech recognition; Selection of recognition unit
025Phonemes, fenemes or fenones being the recognition units
G10L 2015/228
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
22Procedures used during a speech recognition process, e.g. man-machine dialogue
226Taking into account non-speech caracteristics
228of application context
Applicants
  • INTERNATIONAL META SYSTEMS, INC. [US/US]; 6th floor 100 North Sepulveda Boulevard El Segundo, CA 90245, US
Inventors
  • PFISTER, Henry, L.; US
  • SMITH, George, W.; US
  • TSUCHIYA, Masahiro; US
Agents
  • STEFFIN, William, C. ; Lyon & Lyon First Interstate World Center Suite 4700 633 West Fifth Street Los Angeles, CA 90071-2066, US
Priority Data
08/278,26621.07.1994US
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) SYSTEM AND METHOD FOR FACILITATING SPEECH TRANSCRIPTION
(FR) SYSTEME ET PROCEDES FACILITANT LA TRANSCRIPTION DE LA PAROLE
Abstract
(EN)
The invention provides a system and method for facilitating speech transcription which accepts continuous speech from any of a variety of conventional devices capable of converting spoken words to electromagnetic signals, including microphones or telephones and, if the input signal is analog, converts the input signal from analog to digital format. The digitized signal is then processed in the time and frequency domains to extract spectral speech features which are used to match the input speech with associated phonemes. According to the invention, possible word choices may be extrapolated from the associated phonemes, and visually displayed in textual representations. The visually displayed text may then be edited and processed into final form. Systems and methods for facilitating the invention are also disclosed.
(FR)
Un système et un procédé facilitant la transcription de la parole, dans lesquels sont acceptés des signaux vocaux continus provenant d'un quelconque dispositif classique parmi une pluralité de dispositifs classiques capables de convertir des mots énoncés en signaux électromagnétiques, microphones et téléphones compris, et qui, si le signal d'entrée est analogique, convertissent le signal d'entrée analogique en format numérique. Le signal numérisé est ensuite traité dans les domaines temporel et fréquentiel pour extraire les caractéristiques vocales spectrales qui sont utilisées pour faire correspondre les signaux vocaux d'entrée aux phonèmes associés. Selon cette invention, des choix de mots possibles peuvent être extrapolés à partir des phonèmes associés et affichés visuellement dans des représentations textuelles. Le texte affiché visuellement peut ensuite être mis en forme et traité sous sa forme finale. Des systèmes et des procédés de mise en application de l'invention sont également décrits.
Also published as
Latest bibliographic data on file with the International Bureau