Processing

Please wait...

Settings

Settings

Goto Application

Offices all Languages en Stemming true Single Family Member false Include NPL false
RSS feed can only be generated if you have a WIPO account

Save query

A private query is only visible to you when you are logged-in and can not be used in RSS feeds

Query Tree

Refine Options

Offices
All
Specify the language of your search keywords
Stemming reduces inflected words to their stem or root form.
For example the words fishing, fished,fish, and fisher are reduced to the root word,fish,
so a search for fisher returns all the different variations
Returns only one member of a family of patents
Include Non-Patent literature in results

Full Query

AIfunctionalapplicationsSpeechProcessingSpeechSynthesis

Side-by-side view shortcuts

General
Go to Search input
CTRL + SHIFT +
Go to Results (selected record)
CTRL + SHIFT +
Go to Detail (selected tab)
CTRL + SHIFT +
Go to Next page
CTRL +
Go to Previous page
CTRL +
Results (First, do 'Go to Results')
Go to Next record / image
/
Go to Previous record / image
/
Scroll Up
Page Up
Scroll Down
Page Down
Scroll to Top
CTRL + Home
Scroll to Bottom
CTRL + End
Detail (First, do 'Go to Detail')
Go to Next tab
Go to Previous tab

Analysis

1.20200342307Swarm fair deep reinforcement learning
US 29.10.2020
Int.Class G06N 3/08
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computing arrangements based on biological models
02Neural networks
08Learning methods
Appl.No 16395187 Applicant International Business Machines Corporation Inventor Aaron K. Baughman

Fair deep reinforcement learning is provided. A microstate of an environment and reaction of items in a plurality of microstates within the environment are observed after an agent performs an action in the environment. Semi-supervised training is utilized to determine bias weights corresponding to the action for the microstate of the environment and the reaction of the items in the plurality of microstates within the environment. The bias weights from the semi-supervised training are merged with non-bias weights using an artificial neural network. Over time, it is determined where bias is occurring in the semi-supervised training based on merging the bias weights with the non-bias weights in the artificial neural network. A deep reinforcement learning model that decreases reliance on the bias weights is generated based on determined bias to increase fairness.

2.12112752Cohort determination in natural language processing
US 08.10.2024
Int.Class G10L 15/22
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
22Procedures used during a speech recognition process, e.g. man-machine dialog
Appl.No 17688279 Applicant Amazon Technologies, Inc. Inventor Rahul Gupta

Devices and techniques are generally described for cohort determination in natural language processing. In various examples, a first natural language input to a natural language processing system may be determined. The first natural language input may be associated with a first account identifier. A first machine learning model may determine first data representing one or more words of the first natural language input. A second machine learning model may determine second data representing one or more acoustic characteristics of the first natural language input. Third data may be determined, the third data including a predicted performance for processing the first natural language input by the natural language processing system. The third data may be determined based on the first data representation and the second data representation.

3.20210183392PHONEME-BASED NATURAL LANGUAGE PROCESSING
US 17.06.2021
Int.Class G10L 15/26
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
26Speech to text systems
Appl.No 17028361 Applicant LG ELECTRONICS INC. Inventor Kwangyong LEE

A natural language processing method and apparatus are disclosed. A natural language processing method according to an embodiment of the present disclosure includes extracting a phoneme string from a text corpus labeled with recognition information including at least one of one named entity (NE) or speech intention, generating a phoneme-based training data set by labeling the recognition information in the extracted phoneme string, and generating an artificial neural network-based learning model (LM) using the generated training data set. The natural language processing method of the present disclosure may be associated with an artificial intelligence module, a drone (Unmanned Aerial Vehicle, UAV), a robot, an AR (Augmented Reality) device, a VR (Virtual Reality) device, a device associated with 5G services, etc.

4.20140180975INSTANCE WEIGHTED LEARNING MACHINE LEARNING MODEL
US 26.06.2014
Int.Class G06N 99/00
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
99Subject matter not provided for in other groups of this subclass
Appl.No 13725653 Applicant INSIDESALES.COM, INC. Inventor Martinez Tony Ramon

An instance weighted learning (IWL) machine learning model. In one example embodiment, a method of employing an IWL machine learning model to train a classifier may include determining a quality value that should be associated with each machine learning training instance in a temporal sequence of reinforcement learning machine learning training instances, associating the corresponding determined quality value with each of the machine learning training instances, and training a classifier using each of the machine learning training instances. Each of the machine learning training instances includes a state-action pair and is weighted during the training based on its associated quality value using a weighting factor that weights different quality values differently such that the classifier learns more from a machine learning training instance with a higher quality value than from a machine learning training instance with a lower quality value.

5.2013364041Instance weighted learning machine learning model
AU 09.07.2015
Int.Class G06F 15/18
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
15Digital computers in general; Data processing equipment in general
18in which a program is changed according to experience gained by the computer itself during a complete run; Learning machines
Appl.No 2013364041 Applicant InsideSales.com, Inc. Inventor Martinez, Tony Ramon
An instance weighted learning (IWL) machine learning model. In one example embodiment, a method of employing an IWL machine learning model to train a classifier may include determining a quality value that should be associated with each machine learning training instance in a temporal sequence of reinforcement learning machine learning training instances, associating the corresponding determined quality value with each of the machine learning training instances, and training a classifier using each of the machine learning training instances. Each of the machine learning training instances includes a state-action pair and is weighted during the training based on its associated quality value using a weighting factor that weights different quality values differently such that the classifier learns more from a machine learning training instance with a higher quality value than from a machine learning training instance with a lower quality value.
6.20210304769Generating and using text-to-speech data for speech recognition models
US 30.09.2021
Int.Class G10L 15/26
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
26Speech to text systems
Appl.No 15931788 Applicant MICROSOFT TECHNOLOGY LICENSING, LLC Inventor Guoli Ye

Systems, methods, and devices are provided for generating and using text-to-speech (TTS) data for improved speech recognition models. A main model is trained with keyword independent baseline training data. In some instances, acoustic and language model sub-components of the main model are modified with new TTS training data. In some instances, the new TTS training is obtained from a multi-speaker neural TTS system for a keyword that is underrepresented in the baseline training data. In some instances, the new TTS training data is used for pronunciation learning and normalization of keyword dependent confidence scores in keyword spotting (KWS) applications. In some instances, the new TTS training data is used for rapid speaker adaptation in speech recognition models.

7.20210280195Infrastructure automation platform to assist in performing actions in response to tasks
US 09.09.2021
Int.Class G10L 17/00
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
17Speaker identification or verification techniques
Appl.No 16809078 Applicant Accenture Global Solutions Limited Inventor Madhan Kumar Srinivasan

A device may receive user personalized data and user activity data identifying tasks and actions performed by a user, and may perform natural language processing on the user personalized data and the user activity data to generate processed textual data. The device may train machine learning models based on the processed textual data to generate trained machine learning models, and may receive, from a client device, a command identifying a particular task to be performed. The device may process the command and the user activity data, with the trained machine learning models, to determine whether a particular action in the user activity data correlates with the particular task. The device may perform actions when the particular action correlates with the particular task.

8.20140188462System and method for analyzing ambiguities in language for natural language processing
US 03.07.2014
Int.Class G06F 17/00
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
17Digital computing or data processing equipment or methods, specially adapted for specific functions
Appl.No 14201974 Applicant Zadeh Lotfi A. Inventor Zadeh Lotfi A.

Specification covers new algorithms, methods, and systems for artificial intelligence, soft computing, and deep learning/recognition, e.g., image recognition (e.g., for action, gesture, emotion, expression, biometrics, fingerprint, facial, OCR (text), background, relationship, position, pattern, and object), large number of images (“Big Data”) analytics, machine learning, training schemes, crowd-sourcing (using experts or humans), feature space, clustering, classification, similarity measures, optimization, search engine, ranking, question-answering system, soft (fuzzy or unsharp) boundaries/impreciseness/ambiguities/fuzziness in language, Natural Language Processing (NLP), Computing-with-Words (CWW), parsing, machine translation, sound and speech recognition, video search and analysis (e.g. tracking), image annotation, geometrical abstraction, image correction, semantic web, context analysis, data reliability (e.g., using Z-number (e.g., “About 45 minutes; Very sure”)), rules engine, control system, autonomous vehicle, self-diagnosis and self-repair robots, system diagnosis, medical diagnosis, biomedicine, data mining, event prediction, financial forecasting, economics, risk assessment, e-mail management, database management, indexing and join operation, memory management, and data compression.

9.20100312562Hidden Markov model based text to speech systems employing rope-jumping algorithm
US 09.12.2010
Int.Class G10L 13/00
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
13Speech synthesis; Text to speech systems
Appl.No 12478342 Applicant Microsoft Corporation Inventor Wang Wenlin

A rope-jumping algorithm is employed in a Hidden Markov Model based text to speech system to determine start and end models and to modify the start and end models by setting small co-variances. Disordered acoustic parameters due to violation of parameter constraints are avoided through the modification and result in stable line frequency spectrum for the generated speech.

10.3937170SPEECH ANALYSIS FOR MONITORING OR DIAGNOSIS OF A HEALTH CONDITION
EP 12.01.2022
Int.Class G10L 25/66
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/-G10L21/129
48specially adapted for particular use
51for comparison or discrimination
66for extracting parameters related to health condition
Appl.No 20185364 Applicant NOVOIC LTD Inventor WESTON JACK
The invention relates to a computer-implemented method of training a machine learning model for performing speech analysis for monitoring or diagnosis of a health condition. The method uses training data comprising audio speech data and comprises obtaining one or more linguistic representations that each encode a sub-word, word, or multiple word sequence, of the audio speech data; obtaining one or more audio representations that each encode audio content of a segment of the audio speech data; combining the linguistic representations and audio representations into an input sequence comprising: linguistic representations of a sequence of one or more words or sub-words of the audio speech data; and audio representations of segments of the audio speech data, where the segments together contain the sequence of the one or more words or sub-words. The method further includes training a machine learning model using unsupervised learning to map the input sequence to a target output to learn combined audio-linguistic representations of the audio speech data for use in speech analysis for monitoring or diagnosis of a health condition.