Processing

Please wait...

Settings

Settings

Goto Application

1. WO2020192868 - EVENT DETECTION

Publication Number WO/2020/192868
Publication Date 01.10.2020
International Application No. PCT/EP2019/057258
International Filing Date 22.03.2019
IPC
G06K 9/00 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
G06K 9/46 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
46Extraction of features or characteristics of the image
G06K 9/62 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
62Methods or arrangements for recognition using electronic means
CPC
G06K 2009/00738
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
00624Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
00711Recognising video content, e.g. extracting audiovisual features from movies, extracting representative key-frames, discriminating news vs. sport content
00738Event detection
G06K 9/00718
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
00624Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
00711Recognising video content, e.g. extracting audiovisual features from movies, extracting representative key-frames, discriminating news vs. sport content
00718Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
G06K 9/4628
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
46Extraction of features or characteristics of the image
4604Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes, intersections
4609by matching or filtering
4619Biologically-inspired filters, e.g. receptive fields
4623with interaction between the responses of different filters
4628Integrating the filters into a hierarchical structure
G06K 9/6202
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
62Methods or arrangements for recognition using electronic means
6201Matching; Proximity measures
6202Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
G06K 9/6271
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
62Methods or arrangements for recognition using electronic means
6267Classification techniques
6268relating to the classification paradigm, e.g. parametric or non-parametric approaches
627based on distances between the pattern to be recognised and training or reference patterns
6271based on distances to prototypes
G06K 9/6292
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
62Methods or arrangements for recognition using electronic means
6288Fusion techniques, i.e. combining data from various sources, e.g. sensor fusion
6292of classification results, e.g. of classification results related to same input data
Applicants
  • HUAWEI TECHNOLOGIES CO., LTD. [CN]/[CN]
  • REDZIC, Milan [RS]/[DE] (US)
Inventors
  • REDZIC, Milan
  • LIU, Shaoqing
  • YUAN, Peng
Agents
  • KREUZ, Georg
Priority Data
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) EVENT DETECTION
(FR) DÉTECTION D'ÉVÉNEMENT
Abstract
(EN)
A neural network-based video processing system for determining a correlation between two time-spaced images from a video stream, the system comprising: an image feature map extractor comprising a neural network, the image feature map extractor being configured to determine, for each of the images, a feature map comprising a plurality of channels and a plurality of locations of pixels in the image, the feature map representing the response of the respective image over each of the plurality of channels and at each of the plurality of locations of pixels in the respective image; and a feature map aggregator configured to form an aggregated feature map by weighting each of the values in the feature map by (i) a factor representing the total channel response at the location corresponding to the respective value in the feature map normalised with respect to the total channel response over the respective image and (ii) a factor that indicates the extent to which the feature map indicates the image's response over the channel corresponding to the respective value in the feature map; the system being configured to determine the correlation by comparing the aggregated feature maps for each of the images.
(FR)
L'invention concerne un système de traitement vidéo basé sur un réseau neuronal pour déterminer une corrélation entre deux images espacées dans le temps à partir d'un flux vidéo, le système comprenant : un extracteur de carte de caractéristiques d'image comprenant un réseau neuronal, l'extracteur de carte de caractéristiques d'image étant configuré pour déterminer, pour chacune des images, une carte de caractéristiques comprenant une pluralité de canaux et une pluralité d'emplacements de pixels dans l'image, la carte de caractéristiques représentant la réponse de l'image respective sur chacun de la pluralité de canaux et à chacun de la pluralité d'emplacements de pixels dans l'image respective ; et un agrégateur de cartes de caractéristiques configuré pour former une carte de caractéristiques agrégée en pondérant chacune des valeurs dans la carte de caractéristiques par (i) un facteur représentant la réponse de canal totale à l'emplacement correspondant à la valeur respective dans la carte de caractéristiques normalisée par rapport à la réponse de canal totale sur l'image respective et (ii) un facteur qui indique l'étendue à laquelle la carte de caractéristiques indique la réponse de l'image sur le canal correspondant à la valeur respective dans la carte de caractéristiques ; le système étant configuré pour déterminer la corrélation par comparaison des cartes de caractéristiques agrégées pour chacune des images.
Latest bibliographic data on file with the International Bureau