{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,10]],"date-time":"2025-04-10T05:45:52Z","timestamp":1744263952422,"version":"3.37.3"},"reference-count":26,"publisher":"Wiley","issue":"8-9","license":[{"start":{"date-parts":[[2002,6,27]],"date-time":"2002-06-27T00:00:00Z","timestamp":1025136000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/summer-heart-0930.chufeiyun1688.workers.dev:443\/http\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"funder":[{"name":"National Science Foundation","award":["CCR-9120008","ACI-9619020"],"award-info":[{"award-number":["CCR-9120008","ACI-9619020"]}]},{"name":"NASA-AMES","award":["NAG 2-1181"],"award-info":[{"award-number":["NAG 2-1181"]}]},{"DOI":"10.13039\/100008902","name":"Los Alamos National Laboratory","doi-asserted-by":"crossref","award":["03891-99-23"],"award-info":[{"award-number":["03891-99-23"]}],"id":[{"id":"10.13039\/100008902","id-type":"DOI","asserted-by":"crossref"}]},{"name":"DARPA and Rome Laboratory, Air Force Materiel Command, USAF","award":["F30602-96-1-0159"],"award-info":[{"award-number":["F30602-96-1-0159"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Concurrency and Computation"],"published-print":{"date-parts":[[2002,7]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>High\u2010Performance Fortran (HPF) was envisioned as a vehicle for modernizing legacy Fortran codes to achieve scalable parallel performance. To a large extent, today's commercially available HPF compilers have failed to deliver scalable parallel performance for a broad spectrum of applications because of insufficiently powerful compiler analysis and optimization. Substantial restructuring and hand\u2010optimization can be required to achieve acceptable performance with an HPF port of an existing Fortran application, even for regular data\u2010parallel applications. A key goal of the Rice dHPF compiler project has been to develop optimization techniques that enable a wide range of existing scientific applications to be ported easily to efficient HPF with minimal restructuring. This paper describes the challenges to effective parallelization presented by complex (but regular) data\u2010parallel applications, and then describes how the novel analysis and optimization technologies in the dHPF compiler address these challenges effectively, without major rewriting of the applications. We illustrate the techniques by describing their use for parallelizing the NAS SP and BT benchmarks. The dHPF compiler generates multipartitioned parallelizations of these codes that are approaching the scalability and efficiency of sophisticated hand\u2010coded parallelizations. Copyright \u00a9 2002 John Wiley &amp; Sons, Ltd.<\/jats:p>","DOI":"10.1002\/cpe.647","type":"journal-article","created":{"date-parts":[[2002,9,11]],"date-time":"2002-09-11T02:05:28Z","timestamp":1031709928000},"page":"741-767","source":"Crossref","is-referenced-by-count":15,"title":["Advanced optimization strategies in the Rice dHPF compiler"],"prefix":"10.1002","volume":"14","author":[{"given":"J.","family":"Mellor\u2010Crummey","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"V.","family":"Adve","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"B.","family":"Broom","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"D.","family":"Chavarr\u00eda\u2010Miranda","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"R.","family":"Fowler","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"G.","family":"Jin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"K.","family":"Kennedy","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Q.","family":"Yi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"311","published-online":{"date-parts":[[2002,6,27]]},"reference":[{"issue":"1","key":"e_1_2_1_2_2","first-page":"1","article-title":"High Performance Fortran language specification","volume":"2","year":"1993","journal-title":"Scientific Programming"},{"key":"e_1_2_1_3_2","doi-asserted-by":"crossref","DOI":"10.1063\/1.4823319","volume-title":"The High Performance Fortran Handbook","author":"Koelbel C","year":"1994"},{"unstructured":"The Portland Group. HPF versions of the NAS parallel application benchmarks.ftp:\/\/ftp.pgroup.com\/pub\/HPF\/examples\/npb.tar.gz[September1998].","key":"e_1_2_1_4_2"},{"unstructured":"BaileyD HarrisT SaphirW van der WijngaartR WooA YarrowM.The NAS parallel benchmarks 2.0.Technical Report NAS\u201095\u2010020 NASA Ames Research Center December1995.","key":"e_1_2_1_5_2"},{"key":"e_1_2_1_6_2","first-page":"102","volume-title":"Proceedings of Supercomputing 1993","author":"Van der Wijngaart RF","year":"1993"},{"volume-title":"Proceedings of the SIGPLAN '98 Conference on Programming Language Design and Implementation","year":"1998","author":"Adve V","key":"e_1_2_1_7_2"},{"volume-title":"Proceedings of SC98: High Performance Computing and Networking","year":"1998","author":"Adve V","key":"e_1_2_1_8_2"},{"doi-asserted-by":"publisher","key":"e_1_2_1_9_2","DOI":"10.1007\/3-540-40889-4_21"},{"key":"e_1_2_1_10_2","series-title":"Lecture Notes in Computer Science","volume-title":"European Conference on Parallel Computing (Euro\u2013Par)","author":"Chavarr\u00eda\u2010Miranda D","year":"2001"},{"unstructured":"DarteA Mellor\u2010CrummeyJ FowlerR Chavarr\u00eda\u2010MirandaD.On efficient parallelization of line\u2010sweep computations.9th Workshop on Compilers for Parallel Computers Edinburgh Scotland June2001.","key":"e_1_2_1_11_2"},{"key":"e_1_2_1_12_2","first-page":"613","volume-title":"Frontiers '95: The 5th Symposium on the Frontiers of Massively Parallel Computation","author":"Paleczny M","year":"1995"},{"unstructured":"AdveVikram JinGuohua Mellor\u2010CrummeyJohn KennedyKen YiQing.Design and evaluation of a computation partitioning framework for data\u2010parallel compilers.Technical Report CS\u2010TR01\u2010382 Department of Computer Science Rice University 2001.","key":"e_1_2_1_13_2"},{"volume-title":"Proceedings of the SIGPLAN '89 Conference on Programming Language Design and Implementation","year":"1989","author":"Rogers A","key":"e_1_2_1_14_2"},{"doi-asserted-by":"publisher","key":"e_1_2_1_15_2","DOI":"10.1145\/63047.63061"},{"doi-asserted-by":"publisher","key":"e_1_2_1_16_2","DOI":"10.1137\/0908060"},{"doi-asserted-by":"publisher","key":"e_1_2_1_17_2","DOI":"10.1142\/S0129053393000025"},{"key":"e_1_2_1_18_2","first-page":"31","volume-title":"Proceedings of the Fourth Symposium on Operating Systems Design and Implementation (OSDI 2000)","author":"Brown AD","year":"2000"},{"doi-asserted-by":"publisher","key":"e_1_2_1_19_2","DOI":"10.1016\/S0167-8191(97)00114-2"},{"issue":"3","key":"e_1_2_1_20_2","first-page":"5","article-title":"Compiling High Performance Fortran for distributed\u2010memory systems","volume":"7","author":"Harris J","year":"1995","journal-title":"Digital Technical Journal of Digital Equipment Corporation"},{"volume-title":"Proceedings of the SIGPLAN '93 Conference on Programming Language Design and Implementation","year":"1993","author":"Amarasinghe S","key":"e_1_2_1_21_2"},{"unstructured":"KellyW MaslovV PughW RosserE ShpeismanT WonnacottD.The Omega library interface guide.Technical Report Department of Computer Science University of Maryland College Park MD April1996.","key":"e_1_2_1_22_2"},{"doi-asserted-by":"publisher","key":"e_1_2_1_23_2","DOI":"10.1002\/cpe.4330020303"},{"doi-asserted-by":"crossref","unstructured":"Chavarr\u00eda\u2010MirandaD Mellor\u2010CrummeyJ TsarangT.Data\u2010parallel compiler support for multipartitioning.Technical Report CS\u2010TR01\u2010374 Department of Computer Science Rice University March2001.","key":"e_1_2_1_24_2","DOI":"10.1007\/3-540-44681-8_36"},{"doi-asserted-by":"publisher","key":"e_1_2_1_25_2","DOI":"10.1023\/A:1018776615791"},{"volume-title":"Proceedings of Supercomputing '97","year":"1997","author":"Parashar M","key":"e_1_2_1_26_2"},{"doi-asserted-by":"publisher","key":"e_1_2_1_27_2","DOI":"10.1006\/jpdc.2001.1724"}],"container-title":["Concurrency and Computation: Practice and Experience"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/summer-heart-0930.chufeiyun1688.workers.dev:443\/https\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fcpe.647","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/summer-heart-0930.chufeiyun1688.workers.dev:443\/https\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fcpe.647","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/summer-heart-0930.chufeiyun1688.workers.dev:443\/https\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/cpe.647","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,9]],"date-time":"2024-12-09T01:05:39Z","timestamp":1733706339000},"score":1,"resource":{"primary":{"URL":"https:\/\/summer-heart-0930.chufeiyun1688.workers.dev:443\/https\/onlinelibrary.wiley.com\/doi\/10.1002\/cpe.647"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2002,6,27]]},"references-count":26,"journal-issue":{"issue":"8-9","published-print":{"date-parts":[[2002,7]]}},"alternative-id":["10.1002\/cpe.647"],"URL":"https:\/\/summer-heart-0930.chufeiyun1688.workers.dev:443\/https\/doi.org\/10.1002\/cpe.647","archive":["Portico"],"relation":{},"ISSN":["1532-0626","1532-0634"],"issn-type":[{"type":"print","value":"1532-0626"},{"type":"electronic","value":"1532-0634"}],"subject":[],"published":{"date-parts":[[2002,6,27]]}}}