Processing

Please wait...

Settings

Settings

Goto Application

1. WO2020095155 - LARGE MODEL SUPPORT IN DEEP LEARNING

Publication Number WO/2020/095155
Publication Date 14.05.2020
International Application No. PCT/IB2019/059294
International Filing Date 30.10.2019
IPC
G06F 15/16 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
15Digital computers in general; Data processing equipment in general
16Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
CPC
G06F 13/4282
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
13Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
38Information transfer, e.g. on bus
42Bus transfer protocol, e.g. handshake; Synchronisation
4282on a serial bus, e.g. I2C bus, SPI bus
G06N 3/04
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computer systems based on biological models
02using neural network models
04Architectures, e.g. interconnection topology
G06N 3/084
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computer systems based on biological models
02using neural network models
08Learning methods
084Back-propagation
Applicants
  • INTERNATIONAL BUSINESS MACHINES CORPORATION [US]/[US]
  • IBM UNITED KINGDOM LIMITED [GB]/[GB] (MG)
  • IBM (CHINA) INVESTMENT COMPANY LIMITED [CN]/[CN] (MG)
Inventors
  • CHO, Minsik
  • FINKLER, Ulrich, Alfons
  • ZOLOTOV, Vladimir
  • KUNG, David
Agents
  • FOURNIER, Kevin
Priority Data
16/180,86405.11.2018US
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) LARGE MODEL SUPPORT IN DEEP LEARNING
(FR) PRISE EN CHARGE DE GRANDS MODÈLES EN APPRENTISSAGE PROFOND
Abstract
(EN)
Techniques that facilitate model support in deep learning are provided. In one example, a system includes a graphics processing unit and a central processing unit memory. The graphics processing unit processes data to train a deep neural network. The central processing unit memory stores a portion of the data to train the deep neural network. The graphics processing unit provides, during a forward pass process of the deep neural network that traverses through a set of layers for the deep neural network from a first layer of the set of layers to a last layer of the set of layers that provides a set of outputs for the deep neural network, input data for a layer from the set of layers for the deep neural network to the central processing unit memory.
(FR)
L'invention concerne des techniques qui facilitent la prise en charge de modèles en apprentissage profond. Dans un exemple, un système comprend une unité de traitement graphique et une mémoire d'unité centrale de traitement. L'unité de traitement graphique traite des données destinées à entraîner un réseau neuronal profond. La mémoire d'unité centrale de traitement stocke une partie des données destinées à entraîner le réseau neuronal profond. Pendant un processus de passage direct du réseau neuronal profond, qui traverse un ensemble de couches pour le réseau neuronal profond, d'une première couche de l'ensemble de couches à une dernière couche de l'ensemble de couches qui fournit un ensemble de sorties pour le réseau neuronal profond, l'unité de traitement graphique fournit, à la mémoire d'unité centrale de traitement, des données d'entrée pour une couche de l'ensemble de couches pour le réseau neuronal profond.
Also published as
Latest bibliographic data on file with the International Bureau