Certains contenus de cette application ne sont pas disponibles pour le moment.
Si cette situation persiste, veuillez nous contacter àObservations et contact
1. (WO2019046065) MÉTADONNÉES DE NAVIGATION AVEC SENSIBILISATION AU CONTENU MULTIMÉDIA
Note: Texte fondé sur des processus automatiques de reconnaissance optique de caractères. Seule la version PDF a une valeur juridique

CLAIMS

1. A method of processing media content comprising video content and associated audio content, the method comprising:

receiving the video content and the associated audio content;

analyzing the associated audio content;

determining, based on the analysis, one or more navigation points for enabling navigation of the media content, the one or more navigation points indicating points of interest in the associated audio content for short-term rewinding;

embedding the one or more navigation points into metadata for the media content; and outputting the video content, the associated audio content, and the metadata;

wherein analyzing the audio content involves applying speech detection to the audio content;

wherein the one or more navigation points are placed at respective starting points of spoken utterances included in the associated audio content.

2. The method of claim 1, wherein the one or more navigation points indicate respective offsets from a starting point of a respective current frame.

3. The method of claim 1 or 2, wherein the metadata is aligned with the associated audio content.

4. The method of any one of claims 1 to 3, wherein the metadata enables content-aware navigation of the media content.

5. The method of any one of claims 1 to 4, wherein the method is performed at an encoder for encoding the media content; and

the method further comprises receiving an input of one or more additional navigation points.

6. The method of any one of claims 1 to 5, further comprising:

generating an audio- visual representation of the media content based on the video content, the associated audio content, and the metadata.

7. The method of claim 6, further comprising:

modifying and replaying the media content with improved intelligibility and/or coherence in response to a user instruction instructing replay from one of the one or more navigation points.

8. The method of claim 6 or 7, further comprising:

setting a scan rate for scanning through the media content at least in part based on a density of the one or more navigation points over time.

9. The method of any one of claims 6 to 8, further comprising:

setting a correspondence between points on a visual representation of a scan bar and points in time in the video content at least in part based on a density of the one or more navigation points over time.

10. The method of any one of claims 6 to 9, further comprising:

providing a fast-forward replay mode in which respective portions of the media content are replayed starting from respective ones of the one or more navigation points.

11. The method of any one of claims 6 to 10, further comprising:

resuming playback after a pause of the replay at a timing indicated by a most recent one of the one or more navigation points.

12. An encoder comprising a processor and a memory storing instructions for causing the processor to perform the operations of any one of claims 1 to 11.

13. A decoder comprising a processor and a memory storing instructions for causing the processor to perform the operations of any one of claims 1 to 11.

14. A program for causing a computer to perform the operations of any one of claims 1 to 11 when performed on the computer.

15. A computer-readable storage medium storing a program for causing a computer to perform the operations of any one of claims 1 to 11 when performed on the computer.