[1] |
COCHRAN W T, COOLEY J W, FAVIN D L, et al. What is the fast Fourier tranform?[J]. Proceedings of the IEEE, 1967, 55(10): 1664-1674.
DOI
URL
|
[2] |
李焱, 张云泉. 异构平台上性能自适应FFT框架[J]. 计算机研究与发展, 2014, 51(3):637-649.
|
|
LI Y, ZHANG Y Q. An automatic performance tuning framework for FFT on heterogenous platforms[J]. Journal of Computer Research and Development, 2014, 51(3): 637-649.
|
[3] |
陈暾, 李志豪, 贾海鹏, 等. 基于ARMv8平台的多维FFT实现与优化研究[J]. 计算机学报, 2019, 42(11):2384-2402.
|
|
CHEN T, LI Z H, JIA H P, et al. Multi-dimensional FFT implementation and optimization on ARMv8 platform[J]. Chinese Journal of Computers, 2019, 42(11): 2384-2402.
|
[4] |
ARM. ARM performance libraries (ARMPL) 19.2.0[EB/OL]. [2020-09-10]. https://static.docs.arm.com/101004/1920/arm_ performance_libraries_reference_101004_1920_00_en.pdf
|
[5] |
WANG E, ZHANG Q, SHEN B, et al. Intel math kernel library[M]// High-Performance Computing on the Intel® Xeon PhiTM. Berlin: Springer, 2014.
|
[6] |
FRIGO M, JOHNSON S G. FFTW: an adaptive software architecture for the FFT[C]// Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, Seattle, May 12-15, 1998. Piscataway: IEEE, 1998: 1381-1384.
|
[7] |
DUHAMEL P, VETTERLI M. Fast Fourier transforms: a tutorial review and a state of the art[J]. Signal Processing, 1990, 19(4): 259-299.
DOI
URL
|
[8] |
COOLEY J W, TUKEY J W. An algorithm for the machine calculation of complex Fourier series[J]. Mathematics of Computation, 1965, 19(90): 297-301.
DOI
URL
|
[9] |
龚彤艳, 张广婷, 贾海鹏, 等. 一种偶数基Cooley-Tukey FFT高性能实现方法[J]. 计算机科学, 2020, 47(1):31-39.
|
|
GONG T Y, ZHANG G T, JIA H P, et al.. High-performance implementation method for even basis of Cooley-Tukey FFT[J]. Computer Science, 2020, 47(1): 31-39.
|
[10] |
WANG X, JIA H P, LI Z H, et al. Implementation and optimization of multi-dimensional real FFT on ARMv8 platform[C]// LNCS 11335: Proceedings of the 18th Interna-tional Conference on Algorithms and Architectures for Parallel Processing, Guangzhou, Nov 15-17, 2018. Cham: Springer, 2018: 338-353.
|
[11] |
LI Z H, JIA H P, ZHANG Y Q, et al. Automatic generation of high-performance FFT kernels on Arm and x86 CPUs[J]. IEEE Transactions on Parallel and Distributed Systems, 2020, 31(8): 1925-1941.
DOI
URL
|
[12] |
AMD. AOCL: AMD optimizing CPU libraries[EB/OL]. [2020-09-12]. https://developer.amd.com/wp-content/resources/AMD-CPULibrariesUserGuide_1.0.pdf
|
[13] |
NVIDIA. The NVIDIA CUDA fast Fourier transform library[EB/OL]. [2020-09-23]. https://developer.nvidia.com/cufft .
|
[14] |
FRIGO M, JOHNSON S G. The design and implementation of FFTW3[J]. Proceedings of the IEEE, 2005, 93(2): 216-231.
DOI
URL
|
[15] |
Intel. Intel math kernel library (Intel MKL) 2019 update4 [EB/OL]. [2020-09-20]. https://software.intel.com/en-us/mkl .
|