Processing

Please wait...

Settings

Settings

Goto Application

1. WO2020087713 - VIDEO QUALITY INSPECTION METHOD AND APPARATUS, COMPUTER DEVICE AND STORAGE MEDIUM

Publication Number WO/2020/087713
Publication Date 07.05.2020
International Application No. PCT/CN2018/123132
International Filing Date 24.12.2018
IPC
H04N 21/44 2011.01
HELECTRICITY
04ELECTRIC COMMUNICATION TECHNIQUE
NPICTORIAL COMMUNICATION, e.g. TELEVISION
21Selective content distribution, e.g. interactive television or video on demand
40Client devices specifically adapted for the reception of, or interaction with, content, e.g. STB ; Operations thereof
43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronizing decoder's clock; Client middleware
44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to MPEG-4 scene graphs
G06Q 10/06 2012.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
10Administration; Management
06Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
G06K 9/00 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
G10L 15/26 2006.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
26Speech to text systems
CPC
G06K 9/00268
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
00268Feature extraction; Face representation
G06K 9/00744
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
00624Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
00711Recognising video content, e.g. extracting audiovisual features from movies, extracting representative key-frames, discriminating news vs. sport content
00744Extracting features from the video content, e.g. video "fingerprints", or characteristics, e.g. by automatic extraction of representative shots or key frames
G06Q 10/06395
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
10Administration; Management
06Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
063Operations research or analysis
0639Performance analysis
06395Quality analysis or management
G06Q 40/08
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
40Finance; Insurance; Tax strategies; Processing of corporate or income taxes
08Insurance, e.g. risk analysis or pensions
G10L 15/26
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
26Speech to text systems
Applicants
  • 深圳壹账通智能科技有限公司 ONE CONNECT SMART TECHNOLOGY CO., LTD. (SHENZHEN) [CN]/[CN]
Inventors
  • 付舒婷 FU, Shuting
Agents
  • 深圳众鼎专利商标代理事务所(普通合伙) SHENZHEN ZHONGDING INTELLECTUAL PROPERTY AGENCY
Priority Data
201811301549.902.11.2018CN
Publication Language Chinese (ZH)
Filing Language Chinese (ZH)
Designated States
Title
(EN) VIDEO QUALITY INSPECTION METHOD AND APPARATUS, COMPUTER DEVICE AND STORAGE MEDIUM
(FR) PROCÉDÉ ET APPAREIL D'INSPECTION DE QUALITÉ VIDÉO, DISPOSITIF INFORMATIQUE, ET SUPPORT DE STOCKAGE
(ZH) 视频质检方法、装置、计算机设备及存储介质
Abstract
(EN)
Disclosed is a video quality inspection method, which is used for solving the problem of low timeliness of video quality inspection. The method provided by the present application comprises: extracting frames from a target video to obtain each video picture; carrying out facial recognition on each video picture, and detecting whether the face of a designated person is included in each video picture, so as to obtain a first detection result corresponding to each video picture; carrying out speech recognition on speech of the target video to obtain a target text; calculating a required reading rate of a required reading text according to the target text and the preset required reading text; calculating an unreadability rate of an unreadable text according to the target text and the preset unreadable text; detecting whether the required reading rate is higher than a preset first threshold value and the unreadability rate is lower than a preset second threshold value, so as to obtain a second detection result; if so, determining that the target video passes the quality inspection; and if not, determining that the target video does not pass the quality inspection. Also provided in the present application are a video quality inspection apparatus, a computer device and a storage medium.
(FR)
L'invention concerne un procédé d'inspection de qualité vidéo, qui est utilisé pour résoudre le problème de faible rapidité d'inspection de qualité vidéo. Le procédé selon la présente invention comprend les étapes consistant à : extraire des trames d'une vidéo cible pour obtenir chaque image vidéo ; effectuer une reconnaissance faciale sur chaque image vidéo, et détecter si le visage d'une personne désignée est inclus dans chaque image vidéo, de façon à obtenir un premier résultat de détection correspondant à chaque image vidéo ; effectuer une reconnaissance vocale sur la parole de la vidéo cible pour obtenir un texte cible ; calculer un taux de lecture requis d'un texte de lecture requis selon le texte cible et le texte de lecture requis prédéfini ; calculer un taux de non-lisibilité d'un texte non lisible selon le texte cible et le texte non lisible prédéfini ; détecter si le taux de lecture requis est supérieur à une première valeur seuil prédéfinie et le taux de non-lisibilité est inférieur à une seconde valeur seuil prédéfinie, de façon à obtenir un second résultat de détection ; si tel est le cas, déterminer que la vidéo cible réussit l'inspection de qualité ; et si tel n'est pas le cas, déterminer que la vidéo cible échoue à l'inspection de qualité. La présente invention concerne également un appareil d'inspection de qualité vidéo, un dispositif informatique et un support de stockage.
(ZH)
本申请公开了一种视频质检方法,用于解决视频质检时效性低的问题。本申请提供的方法包括:对目标视频进行抽帧处理,得到各个视频图片;对各个视频图片进行人脸识别,检测各个视频图片中是否包括指定人员的人脸,得到各个视频图片对应的第一检测结果;对目标视频的语音进行语音识别处理,得到目标文本;根据目标文本和预设的必读文本计算必读文本的必读率;根据目标文本和预设的不可读读文本计算不可读文本的不可读率;检测是否必读率高于预设第一阈值且不可读率低于预设第二阈值,得到第二检测结果;若均为是,则确定目标视频质检通过;反之,则确定目标视频质检不通过。本申请还提供视频质检装置、计算机设备及存储介质。
Also published as
Latest bibliographic data on file with the International Bureau