(EN) An apparatus and method for displaying multimedia data described according to a MusicPhotoVideo (MPV) format. In the apparatus, it is checked whether an asset selected by a user is comprised of single photo data and one or more video data, reference information needed for displaying the photo data and the one or more video data is extracted, and the photo data and the one or more video data are extracted using the extracted reference information and displayed sequentially displayed using a predetermined displaying method.