Processing

Please wait...

PATENTSCOPE will be unavailable a few hours for maintenance reason on Saturday 31.10.2020 at 2:00 PM JST
Settings

Settings

Goto Application

1. WO2020033239 - AUDIO STREAM MIXING SYSTEM AND METHOD

Note: Text based on automatic Optical Character Recognition processes. Please use the PDF version for legal matters

[ EN ]

What is claimed is:

1. A method of mixing a second audio stream with a first audio stream in an audio output device, the method comprising:

buffering the first audio stream;

determining an insertion spot within the first audio stream;

outputting the buffered first audio stream, including modifying output of at least a portion of the first audio stream to accommodate output of the second audio stream at the insertion spot to minimize or avoid content degradation or loss of the first audio stream;

outputting the second audio stream at the insertion spot; and

continuing output of the buffered first audio stream after completion of the second audio stream.

2. The method of claim 1, wherein the second audio stream is received during output of the first audio stream.

3. The method of claim 1, wherein the buffering of the first audio stream occurs in response to receipt of the second audio stream.

4. The method of claim 1, wherein the first audio stream is a radio stream.

5. The method of claim 1, wherein the first audio stream is playback of content from a tangible storage medium local to the audio output device.

6. The method of claim 5, wherein the tangible storage medium is a compact disc, unified serial bus medium, hard drive, or a computer memory.

7. The method of claim 1, wherein the second audio stream is received by the audio output device with an urgency or maximum delay indicator.

8 The method of claim 7, wherein the method includes:

identifying a maximum delay for output of the second audio stream based on the urgency or maximum delay indicator.

9. The method of claim 7, further comprising:

determining the insertion spot based, at least in part, on the urgency or maximum delay indicator.

10. The method of claim 1, wherein determining the insertion spot includes finding a gap or pause within the first audio stream.

11. The method of claim 1, wherein determining the insertion spot includes analyzing the first audio stream using one or more speech analysis techniques to find ends of sentences, phrases, words, or other natural points of interruption.

12. The method of claim 11, wherein the one or more speech analysis techniques includes at least one of Voice Activity Detection (VAD), Automatic Speech Recognition (ASR), and Natural Language Understanding (NLU).

13. The method of claim 1, wherein determining the insertion spot includes finding a low volume level within the first audio stream.

14. The method of claim 1, wherein modifying the first audio stream at the insertion spot includes pausing the first audio stream at the insertion spot.

15. The method of claim 1, wherein modifying the first audio stream at the insertion spot includes modifying a time-frequency structure of the first audio stream.

16. The method of claim 1, wherein modifying the first audio stream at the insertion spot includes warping the first audio stream at or near the insertion spot.

17. The method of claim 1, wherein outputting the second audio stream at the insertion spot includes modifying the second audio stream.

18. The method of claim 17, wherein modifying the second audio stream includes modifying a time-frequency structure of the second audio stream.

19. The method of claim 17, wherein modifying the second audio stream includes modifying a voice style of the second audio stream to improve intelligibility relative to the first audio stream.

20. The method of claim 1, wherein the audio output device is or forms part of a vehicle head unit.

21. The method of claim 20, wherein the second audio stream is an announcement or an alert from a vehicle navigation system, vehicle monitoring system, or a text to speech system.

22. The method of claim 1, wherein the second audio stream is an announcement or an alert from an advertising system.

23. The method of claim 1, wherein the audio output device is a wireless portable device comprising a mobile phone, tablet, or phablet.

24. An audio stream mixing system, comprising:

one or more processors coupled to one or more computer storage devices, one or more first audio stream sources, one or more second audio stream sources, and one or more audio output devices, wherein the one or more processors are configured to:

buffer the first audio stream from a first audio stream source in the one or more computer storage devices;

determine an insertion spot within the first audio stream;

output the buffered first audio stream via the one or more audio output devices, wherein the one or more processors are configured to modify output of at least a portion of the first audio stream to accommodate output of a second audio stream at the insertion spot to minimize or avoid content degradation or loss of the first audio stream, the second audio stream received from a second audio stream source;

output the second audio stream via the one or more audio output devices at the insertion spot; and

continue output of the buffered first audio stream via the one or more audio output devices after completion of the second audio stream.

25. The system of claim 24, wherein the second audio stream is received during output of the first audio stream.

26. The system of claim 24, wherein the one or more processors are configured to buffer the first audio stream in response to receipt of the second audio stream.

27. The system of claim 24, wherein the first audio stream is a radio stream.

28. The system of claim 24, wherein the first audio stream is playback of content from a tangible storage medium local to the audio output device.

29. The system of claim 28, wherein the tangible storage medium is a compact disc, unified serial bus medium, hard drive, or a computer memory.

30. The system of claim 24, wherein the second audio stream is received by the audio output device with an urgency or maximum delay indicator.

31. The system of claim 30, wherein the one or more processors are configured to: identify a maximum delay for output of the second audio stream based on the urgency or maximum delay indicator.

32. The system of claim 30, wherein the one or more processors are configured to: determine the insertion spot based, at least in part, on the urgency or maximum delay indicator.

33. The system of claim 24, wherein the one or more processors are configured to find a gap or pause within the first audio stream to determine the insertion spot.

34. The system of claim 24, wherein the one or more processors are configured to analyze the first audio stream using one or more speech analysis techniques to find ends of sentences, phrases, words, or other natural points of interruption to determine the insertion spot.

35. The system of claim 34, wherein the one or more speech analysis techniques includes at least one of Voice Activity Detection (VAD), Automatic Speech Recognition (ASR), and Natural Language Understanding (NLU).

36. The system of claim 24, wherein the one or more processors are configured to find a low volume level within the first audio stream to determine the insertion spot.

37. The system of claim 24, wherein the one or more processors are configured to pause the first audio stream at the insertion spot as a modification of the first audio stream.

38. The system of claim 24, wherein the one or more processors are configured to modify a time-frequency structure of the first audio stream as a modification of the first audio stream.

39. The system of claim 24, wherein the one or more processors are configured to warp the first audio stream at or near the insertion spot as a modification to the first audio stream.

40. The system of claim 24, wherein the one or more processors are configured to modify the second audio stream and output the second audio stream as a modified second audio stream at the insertion spot.

41. The system of claim 40, wherein the modified second audio stream includes a modified time-frequency structure.

42. The system of claim 40, wherein the one or more processors are configured to modify a voice style of the second audio stream to improve intelligibility relative to the first audio stream.

43. The system of claim 24, wherein the audio output device is or forms part of a vehicle head unit.

44. The system of claim 43, wherein the second audio stream is an announcement or an alert from a vehicle navigation system, vehicle monitoring system, or a text to speech system.

45. The system of claim 24, wherein the second audio stream is an announcement or an alert from an advertising system.

46. The system of claim 24, wherein the audio output device is a wireless portable device comprising a mobile phone, tablet, or phablet.