(EN) The invention discloses a real-time binocular stereo matching method based on an adaptive candidate parallax prediction network. The method comprises the following steps: firstly, carrying out multi-scale feature extraction on a three-dimensional image pair by utilizing a two-dimensional convolutional neural network to obtain high-resolution and low-resolution feature maps; then, in the first stage of parallax estimation, using a low-resolution feature map is used to perform parallax coarse estimation; after the adaptive candidate parallax is predicted by using the rough estimation result andthe left image information, in the second-stage parallax estimation, carrying out fine parallax estimation by using the prediction result and the high-resolution feature map; and finally, performing hierarchical refinement on the disparity map to obtain a full-size disparity map. Compared with an existing coarse-to-fine stereo matching neural network, the invention has the advantages that more accurate dynamic offset can be predicted for a fine parallax estimation stage so as to meet different parallax correction requirements of various targets in the image. Due to the effectiveness of dynamicprediction, a two-stage processing structure is designed to greatly improve the calculation precision and speed of the real-time binocular stereo matching network.
(ZH) 本发明公开了一种基于自适应候选视差预测网络的实时双目立体匹配方法。该方法首先利用二维卷积神经网络对立体图像对进行多尺度特征提取,得到高、低分辨率的特征图。然后,第一阶段视差估计利用低分辨率特征图进行视差粗估计。在利用粗估计结果和左图信息预测得到自适应候选视差以后,第二阶段视差估计则利用预测结果和高分辨率特征图进行精细视差估计。最后,对视差图进行层次化精修得到全尺寸视差图。与现有的由粗到精立体匹配神经网络相比,本发明可以为精细视差估计阶段预测更准确的动态偏移量,以满足图像中各种目标不同的视差校正需求。由于动态预测的有效性,本发明设计了两级处理结构以大幅提高实时双目立体匹配网络的计算精度和速度。