Page 40 - 《软件学报》2025年第12期
P. 40
沈莉 等: swJulia: 面向新一代神威超级计算机的 Julia 语言编译系统 5421
[2] Fu HH, Liao JF, Yang JZ, Wang LN, Song ZY, Huang XM, Yang C, Xue W, Liu FF, Qiao FL, Zhao W, Yin XQ, Hou CF, Zhang CL, Ge
W, Zhang J, Wang YG, Zhou CB, Yang GW. The Sunway TaihuLight supercomputer: System and applications. Science China
Information Sciences, 2016, 59(7): 072001. [doi: 10.1007/s11432-016-5588-7]
[3] Darema F. New software technologies for the development and runtime support of complex applications. The Int’l Journal of High
Performance Computing Applications, 1999, 13(3): 180–190. [doi: 10.1177/109434209901300302]
[4] Laros III JH, Kelly SM, Levenhagen MJ, Pedretti K. Investigating methods of supporting dynamically linked executables on high
performance computing platforms. Technical Report, SAND2009-5515. Albuquerque: Sandia National Laboratories (SNL), 2009. [doi:
10.2172/1035350]
[5] Beazley DM, Ward BD, Cooke IR. The inside story on shared libraries and dynamic loading. Computing in Science & Engineering, 2001,
3(5): 90–97. [doi: 10.1109/5992.947112]
[6] Bezanson J, Karpinski S, Shah V, Edelman A. Julia: A fast dynamic language for technical computing. arXiv:1209.5145, 2012.
[7] Bezanson J, Edelman A, Karpinski S, Shah VB. Julia: A fresh approach to numerical computing. SIAM Review, 2017, 59(1): 65–98.
[doi: 10.1137/141000671]
[8] Gao JG, Zheng F, Qi FB, Ding YJ, Li HL, Lu HS, He WQ, Wei HM, Jin LF, Liu X, Gong DY, Wang F, Zheng Y, Sun HH, Zhou Z, Liu
Y, You HT. Sunway supercomputer architecture towards exascale computing: Analysis and practice. Science China Information Sciences,
2021, 64(4): 141101. [doi: 10.1007/s11432-020-3104-7]
[9] The Julia language. 2024. https://github.com/JuliaLang/julia
[10] Jansen P. TIOBE index for April 2025. 2024. https://www.tiobe.com/tiobe-index/
[11] CUDA.jl. 2024. https://github.com/JuliaGPU/CUDA.jl
[12] Besard T, Foket C, De Sutter B. Effective extensible programming: Unleashing Julia on GPUs. IEEE Trans. on Parallel and Distributed
Systems, 2019, 30(4): 827–841. [doi: 10.1109/TPDS.2018.2872064]
[13] Rackauckas C, Nie Q. DifferentialEquations.jl —A performant and feature-rich ecosystem for solving differential equations in Julia.
Journal of Open Research Software, 2017, 5(1): 15. [doi: 10.5334/jors.151]
[14] Mogensen PK, Riseth AN. Optim: A mathematical optimization package for Julia. Journal of Open Research Software, 2018, 3(24): 615.
[doi: 10.21105/joss.00615]
[15] FinEtools: Finite element tools in Julia. 2024. https://github.com/PetrKryslUCSD/FinEtools.jl
[16] Aho J, Vuotikka AJ, Frondelius T. Introduction to JuliaFEM, an open source FEM solver. Rakenteiden Mekaniikka, 2019, 52(3):
148–159. [doi: 10.23998/rm.75103]
[17] Pawar S, San O. CFD Julia: A learning module structuring an introductory course on computational fluid dynamics. Fluids, 2019, 4(3):
159. [doi: 10.3390/fluids4030159]
[18] Innes M. Flux: Elegant machine learning with Julia. Journal of Open Source Software, 2018, 3(25): 602. [doi: 10.21105/joss.00602]
[19] Luo XZ, Liu JG, Zhang P, Wang L. Yao.jl: Extensible, efficient framework for quantum algorithm design. Quantum, 2020, 4: 341. [doi:
10.22331/q-2020-10-11-341]
[20] Liu Y, Liu X, Li F, Fu HH, Yang YL, Song JW, Zhao PP, Wang Z, Peng DJ, Chen HR, Guo C, Huang HL, Wu WZ, Chen DX. Closing
the “quantum supremacy” gap: Achieving real-time simulation of a random quantum circuit using a new Sunway supercomputer. In: Proc.
of the 2021 Int’l Conf. for High Performance Computing, Network, Storage and Analysis. Saint Louis: IEEE, 2021. 1–12. [doi: 10.1145/
3458817.3487399]
[21] Kulkarni PA. JIT compilation policy for modern machines. In: Proc. of the 2011 ACM Int’l Conf. on Object Oriented Programming
Systems Languages and Applications. Portland: ACM, 2011. 773–788. [doi; 10.1145/2048066.2048126]
[22] Shen L, Zhou WH, Wang F, Xiao Q, Wu WH, Zhang LF, An H, Qi FB. swLLVM: Optimized compiler for new generation Sunway
supercomputer. Ruan Jian Xue Bao/Journal of Software, 2023, 35(5): 2359–2378 (in Chinese with English abstract). http://www.jos.org.
cn/1000-9825/6896.htm [doi: 10.13328/j.cnki.jos.006896]
[23] Wu W, Qian H, Zhu Q, Wang J, Fan XJ. Research on full-chip programming for Sunway heterogeneous many-core processor. In: Proc. of
the 3rd World Symp. on Software Engineering. Xiamen: ACM, 2021. 174–179. [doi: 10.1145/3488838.3488868]
[24] Nuzman D, Zaks A. Outer-loop vectorization-revisited for short SIMD architectures. In: Proc. of the 2008 Int’l Conf. on Parallel
Architectures and Compilation Techniques. Toronto: IEEE, 2008. 2–11.
[25] Porpodas V. Supergraph-SLP auto-vectorization. In: Proc. of the 26th Int’l Conf. on Parallel Architectures and Compilation Techniques.
Portland: IEEE, 2017. 330–342. [doi: 10.1109/PACT.2017.21]
[26] Torlai G, Mazzola G, Carrasquilla J, Troyer M, Melko R, Carleo G. Neural-network quantum state tomography. Nature Physics, 2018,
14(5): 447–450. [doi: 10.1038/s41567-018-0048-5]

