Processing

Please wait...

Settings

Settings

Goto Application

1. US20200392178 - PROTEIN-TARGETED DRUG COMPOUND IDENTIFICATION

Office United States of America
Application Number 17004104
Application Date 27.08.2020
Publication Number 20200392178
Publication Date 17.12.2020
Publication Kind A1
IPC
C07K 1/04
CCHEMISTRY; METALLURGY
07ORGANIC CHEMISTRY
KPEPTIDES
1General processes for the preparation of peptides
04on carriers
G16C 20/50
GPHYSICS
16INFORMATION AND COMMUNICATION TECHNOLOGY SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
20Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
50Molecular design, e.g. of drugs
G16B 5/00
GPHYSICS
16INFORMATION AND COMMUNICATION TECHNOLOGY SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
5ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks
CPC
G16B 5/00
GPHYSICS
16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
5ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks
C07K 1/047
CCHEMISTRY; METALLURGY
07ORGANIC CHEMISTRY
KPEPTIDES
1General methods for the preparation of peptides ; , i.e. processes for the organic chemical preparation of peptides or proteins of any length
04on carriers
047Simultaneous synthesis of different peptide species; Peptide libraries
G16C 20/50
GPHYSICS
16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
20Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
50Molecular design, e.g. of drugs
Applicants INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventors Matteo Manica
Maria Rodriguez Martinez
Jannis Born
Joris Cadow
Title
(EN) PROTEIN-TARGETED DRUG COMPOUND IDENTIFICATION
Abstract
(EN)

Methods and systems are provided for identifying drug compounds for targeting proteins in tissue cells. Such a method includes providing a neural network model which comprises an attention-based protein encoder and a molecular decoder. The protein encoder is pretrained in an autoencoder architecture to encode an input protein sequence into an output vector in a latent space representing proteins. The molecular decoder is pretrained in an autoencoder architecture to generate compound data, defining a compound molecule, from an input vector in a latent space representing molecules. The protein encoder and molecular decoder are coupled such that the input vector of the molecular decoder is dependent on the output vector of the protein encoder for an input protein sequence.