Processing

Please wait...

Settings

Settings

Goto Application

1. WO2020068332 - VOCAL TRIGGERING OF PRESENTATION TRANSITIONS

Publication Number WO/2020/068332
Publication Date 02.04.2020
International Application No. PCT/US2019/048243
International Filing Date 27.08.2019
IPC
G06F 3/16 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
3Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
16Sound input; Sound output
G10L 15/00 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
CPC
G06F 16/3331
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
30of unstructured textual data
33Querying
3331Query processing
G06F 3/167
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
3Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
16Sound input; Sound output
167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
G10L 15/05
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
04Segmentation; Word boundary detection
05Word boundary detection
G10L 15/083
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
083Recognition networks
G10L 15/22
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
22Procedures used during a speech recognition process, e.g. man-machine dialogue
G10L 2015/088
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
088Word spotting
Applicants
  • DISH NETWORK L.L.C. [US]/[US]
Inventors
  • MESHRAM, Shruti
Agents
  • SAAB, Karam
  • GRAY, Charles
  • SWEHLA, Aaron
  • SHERWINTER, Daniel J.
  • KILPATRICK TOWNSEND & STOCKON LLP
Priority Data
16/146,27028.09.2018US
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) VOCAL TRIGGERING OF PRESENTATION TRANSITIONS
(FR) DÉCLENCHEMENT VOCAL DE TRANSITIONS DE PRÉSENTATION
Abstract
(EN)
Various arrangements for triggering transitions within a slide-based presentation are presented. An audio-based trigger system receives a plurality of trigger words. A database is created that maps trigger words to slide transitions. A voice-based request is received to initiate audio control of the slide-based presentation being output by the presentation system. An audio stream is monitored for trigger words. Based on accessing a database, a slide transition to be performed is identified based on a recognized trigger word. A slide transition request is transmitted to a presentation system that indicates a slide to which a transition should occur. The presentation system transitions to the slide based on the received slide transition request. Training process for speaker recognition to discriminate voice of presenter from voice of audience to monitor only the voice of the presenter in the audio stream for the trigger words.
(FR)
L'invention concerne diverses dispositions visant à déclencher des transitions à l'intérieur d'une présentation basée sur des diapositives. Un système déclencheur sur base audio reçoit une pluralité de mots déclencheurs. Une base de données est créée qui associe des mots déclencheurs à des transitions de diapositives. Une demande vocale est reçue pour amorcer la commande audio de la présentation basée sur des diapositives qui est délivrée par le système de présentation. Un flux audio est surveillé à la recherche de mots déclencheurs. En fonction d'un accès à une base de données, une transition de diapositive à effectuer est identifiée d'après un mot déclencheur reconnu. Une demande de transition de diapositive est transmise à un système de présentation, indiquant une diapositive vers laquelle une transition doit avoir lieu. Le système de présentation effectue la transition vers la diapositive d'après la demande reçue de transition de diapositive. L'invention comprend un processus d'apprentissage pour la reconnaissance du locuteur, afin de distinguer la voix du présentateur de la voix de l'auditoire pour surveiller uniquement la voix du présentateur dans le flux audio à la recherche des mots déclencheurs.
Also published as
Latest bibliographic data on file with the International Bureau