Sinusoidal Loudness Estimation

Description

In recent years, the proliferation of internet streaming applications has brought about the need for low-bit rate speech and audio coding methods. Several parametric designs such as the sinusoids+transients+noise model have been somewhat successful for speech and audio synthesis. Low-bit rate and streaming applications are restricted by only being able to transmit a limited number of parameters for loudness estimation, resulting in reduced audio quality. Current techniques for selecting relevant parameters are based on either Signal to Mask Ratio (SMR) or loudness patterns. Despite their popularity, an SMR focus has a tendency to neglect perceptually relevant sinusoids, and the loudness focused methods are computationally costly and aren?t proficient in delay?critical applications.

Researchers at Arizona State University have created a novel and efficient technique for estimating relevant perceptual quantities like loudness patterns of individual sinusoids. This routine is helpful for accurate parameter selection in low-bit rate speech and audio applications.

Potential Applications

  • Speech and Audio applications
    • MPEG-4 HVXC speech coder
    • MPEG-4 HILN audio coder
  • Low-Bit Rate applications
    • Digital Audio Broadcasting
    • Internet Streaming
  • Volume Control
  • Hearing Aid Technologies

Benefits and Advantages

  • 90% faster CPU time for algorithm execution
  • Sufficient computational efficiency for real-time use
  • Less than 1/2 the loudness error of common method as number of sinusoidal components selected increases

Case ID:
M09-118P
Published:
01-25-2011
Last Updated:
12-05-2018

Patent Information

App Type:
Non-Provisional
Serial No.:
12/822,875
Patent No.:
9,055,374
File Date:
06-24-2010
Issue Date:
06-09-2015
Expire Date:
12-07-2033

For More Information, Contact