Processing

Please wait...

Settings

Settings

Goto Application

1. WO2022076062 - DYNAMIC EXPANSION OF ACRONYMS IN AUDIO CONTENT

Publication Number WO/2022/076062
Publication Date 14.04.2022
International Application No. PCT/US2021/043809
International Filing Date 30.07.2021
IPC
G10L 21/0364 2013.1
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0316by changing the amplitude
0364for improving intelligibility
G06F 40/40 2020.1
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
40Handling natural language data
40Processing or translation of natural language
G10L 15/26 2006.1
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
26Speech to text systems
CPC
G06F 40/242
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
40Handling natural language data
20Natural language analysis
237Lexical tools
242Dictionaries
G10L 15/197
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
18using natural language modelling
183using context dependencies, e.g. language models
19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
197Probabilistic grammars, e.g. word n-grams
G10L 15/26
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
26Speech to text systems
G10L 21/0364
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0316by changing the amplitude
0364for improving intelligibility
Applicants
  • MICROSOFT TECHNOLOGY LICENSING, LLC [US]/[US]
Inventors
  • BATEMAN, Amelia
Agents
  • SWAIN, Cassandra T.
  • BARKER, Doug
  • CHATTERJEE, Aaron C.
  • CHEN, Wei-Chen Nicholas
  • CHOI, Daniel
  • CHURNA, Timothy
  • DINH, Phong
  • EVANS, Patrick
  • GABRYJELSKI, Henry
  • GUPTA, Anand
  • HWANG, William C.
  • JARDINE, John S.
  • LEE, Sunah
  • LEMMON, Marcus
  • MARQUIS, Thomas
  • MEYERS, Jessica
  • ROPER, Brandon
  • SPELLMAN, Steven
  • SULLIVAN, Kevin
  • WALKER, Matt
  • WIGHT, Stephen A.
  • WISDOM, Gregg
  • WONG, Ellen
  • WONG, Thomas S.
  • ZHANG, Hannah
  • AKHTER, Julia
  • KADOURA, Judy M.
  • NIU, Bo
  • OLANIRAN, Qudus
  • BROWN, Renee
  • TRAN, Kimberly
Priority Data
17/063,15905.10.2020US
Publication Language English (en)
Filing Language English (EN)
Designated States
Title
(EN) DYNAMIC EXPANSION OF ACRONYMS IN AUDIO CONTENT
(FR) EXTENSION DYNAMIQUE D’ACRONYMES DANS DU CONTENU AUDIO
Abstract
(EN) Examples of the present disclosure describe systems and methods for dynamically expanding acronyms in audio content. In aspects, a user access of an audio resource may be detected. The audio content of the audio resource may be evaluated to identify acronyms. One or more of the identified acronyms may be evaluated based on a user-specific context of the user and/or a global context associated with the user. Based on the evaluated context(s), expansion candidates and corresponding confidence scores may be determined for each identified acronym. Based on the confidence scores, an expansion candidate may be selected and used to replace the identified acronym when the audio content is consumed by the user.
(FR) Des exemples de la présente invention décrivent des systèmes et des procédés destinés à étendre dynamiquement des acronymes dans du contenu audio. Selon des aspects, un accès d’utilisateur d’une ressource audio peut être détecté. Le contenu audio de la ressource audio peut être évalué pour identifier des acronymes. Un ou plusieurs des acronymes identifiés peuvent être évalués sur la base d’un contexte spécifique de l’utilisateur et/ou d’un contexte global associé à l’utilisateur. Sur la base du ou des contextes évalués, des candidats d’extension et des scores de confiance correspondants peuvent être déterminés pour chaque acronyme identifié. Sur la base des scores de confiance, un candidat d’extension peut être sélectionné et utilisé pour remplacer l’acronyme identifié lorsque le contenu audio est consommé par l’utilisateur.
Related patent documents
Latest bibliographic data on file with the International Bureau