이 애플리케이션의 일부 콘텐츠는 현재 사용할 수 없습니다.
이 상황이 계속되면 다음 주소로 문의하십시오피드백 및 연락
1. (WO2019046065) MEDIA-AWARE NAVIGATION METADATA
유의사항: 이 문서는 자동 광학문자판독장치(OCR)로 처리된 텍스트입니다. 법률상의 용도로 사용하고자 하는 경우 PDF 버전을 사용하십시오

CLAIMS

1. A method of processing media content comprising video content and associated audio content, the method comprising:

receiving the video content and the associated audio content;

analyzing the associated audio content;

determining, based on the analysis, one or more navigation points for enabling navigation of the media content, the one or more navigation points indicating points of interest in the associated audio content for short-term rewinding;

embedding the one or more navigation points into metadata for the media content; and outputting the video content, the associated audio content, and the metadata;

wherein analyzing the audio content involves applying speech detection to the audio content;

wherein the one or more navigation points are placed at respective starting points of spoken utterances included in the associated audio content.

2. The method of claim 1, wherein the one or more navigation points indicate respective offsets from a starting point of a respective current frame.

3. The method of claim 1 or 2, wherein the metadata is aligned with the associated audio content.

4. The method of any one of claims 1 to 3, wherein the metadata enables content-aware navigation of the media content.

5. The method of any one of claims 1 to 4, wherein the method is performed at an encoder for encoding the media content; and

the method further comprises receiving an input of one or more additional navigation points.

6. The method of any one of claims 1 to 5, further comprising:

generating an audio- visual representation of the media content based on the video content, the associated audio content, and the metadata.

7. The method of claim 6, further comprising:

modifying and replaying the media content with improved intelligibility and/or coherence in response to a user instruction instructing replay from one of the one or more navigation points.

8. The method of claim 6 or 7, further comprising:

setting a scan rate for scanning through the media content at least in part based on a density of the one or more navigation points over time.

9. The method of any one of claims 6 to 8, further comprising:

setting a correspondence between points on a visual representation of a scan bar and points in time in the video content at least in part based on a density of the one or more navigation points over time.

10. The method of any one of claims 6 to 9, further comprising:

providing a fast-forward replay mode in which respective portions of the media content are replayed starting from respective ones of the one or more navigation points.

11. The method of any one of claims 6 to 10, further comprising:

resuming playback after a pause of the replay at a timing indicated by a most recent one of the one or more navigation points.

12. An encoder comprising a processor and a memory storing instructions for causing the processor to perform the operations of any one of claims 1 to 11.

13. A decoder comprising a processor and a memory storing instructions for causing the processor to perform the operations of any one of claims 1 to 11.

14. A program for causing a computer to perform the operations of any one of claims 1 to 11 when performed on the computer.

15. A computer-readable storage medium storing a program for causing a computer to perform the operations of any one of claims 1 to 11 when performed on the computer.