SIMD signal processing with NEON
SUMMARY In a previous article we demonstrated how we use CortexM0 and CortexM4 to implement signal processing algorithms where power and memory optimization are the key success criteria. We detail here one high-performance algorithm of our catalog optimized for CortexA-ARMv8. IIR/BiQuad filters are a key building block used in digital signal processing. We describe a floating-point filter implementation using advanced SIMD NEON. The algorithm is optimized to consume less than 4 CPU cycles per sample. PROBLEM Time-to-market is a key business success criterion for the design of consumer products. Here are some reasons to go fast go straight with ARM processors and advanced SIMD NEON: Here we want to design an IIR/BiQuad filter processing non-interleaved audio samples. IIR filtering is a challenge for the firmware designer because the pipeline depth gives a limit to the maximum data throughput, due to the recursive loops: you need to wait the computation of the recursive path before saving the next samples. The longer the pipeline depth and the longer it takes to compute the filtered audio samples. There is quite a high number of audio channels in multimedia audio systems like the one found in cars. Each channel of the original 5.1 format is processed through a cascade of IIR filters to compensate the frequency response of each loudspeakers, and to give some specific user experience depending on use-cases mixing (telephony, GPS voice, alarm, music, …). Consequently, the IIR filter must be implemented with optimized codes for power and latency reasons. SOLUTION At Firmware-Developments we have cumulated years of expertise in firmware optimization topics, both on the problems of signal quality, standards, patents and low foot-print fixed-point implementations. You can contact us to tune for you this IIR program which has below characteristics: CONTACTS Firmware Developments email : contact @ firmware-developments.com Phone Number +33 698 846 090 Address : “Les Alcyons”, 5b Av. de l’Ilette, 06600 Antibes, France. Websites : firmware-developments.com and twitter.com/fw_devs REFERENCES CortexA72 Software Optimization Guide. ARM Architecture Reference Manual ARMv8, for ARMv8-A architecture profile. ARM CortexA Series - Programmer’s Guide for ARMv8-A. Choosing the Best Processor for your Audio DSP Application. AES137 (L. A. 2014)