【矩阵乘法的GPU并行程序】,并画出规模和时间对比图(n=500,1000,1500,2000,3000,5000)

上传者: 42214698 | 上传时间: 2022-12-26 19:19:13 | 文件大小: 7.85MB | 文件类型: ZIP
题目描述 编写一个矩阵乘法的GPU并行程序,并且与对应规模的串行程序进行运行时间的比对(n=500,1000,1500,2000,3000,5000),画出规模和时间对比图。 矩阵A(n,n) 矩阵B(n,n) C = A x B 要求: 1、完成程序的开发并验证其正确性,完成一个实验报告(程序源代码、变量和语句的详细说明) 2、在实验报告中通过图表说明CPU串行和GPU并行在各种规模的运行时间; 3、在实验报告中通过图表说明GPU并行不同的数据分配在各种规模的运行时间。 设计思路 矩阵实验的代码环境为VS2019 community+CUDA 10.1,在vs2019中运行确定无问题后,用xtfp上传该cu文件,在shell中在跑一遍 自己写的作业,用学校分配的并行网络,跑出来的,实打实的结果 预览:https://img-blog.csdnimg.cn/87873b9ed0a840c3b156e1bc3faca024.png

文件下载

资源详情

[{"title":"( 39 个子文件 7.85MB ) 【矩阵乘法的GPU并行程序】,并画出规模和时间对比图(n=500,1000,1500,2000,3000,5000)","children":[{"title":"矩阵","children":[{"title":"juzhen_time.txt <span style='color:#111;'> 80B </span>","children":null,"spread":false},{"title":"CUDA 10.1 Runtime1.sln <span style='color:#111;'> 1.07KB </span>","children":null,"spread":false},{"title":"kernel.cu <span style='color:#111;'> 5.77KB </span>","children":null,"spread":false},{"title":".vs","children":[{"title":"CUDA 10.1 Runtime1","children":[{"title":"v16","children":[{"title":"Browse.VC.db <span style='color:#111;'> 8.11MB </span>","children":null,"spread":false},{"title":".suo <span style='color:#111;'> 43.00KB </span>","children":null,"spread":false},{"title":"ipch","children":[{"title":"AutoPCH","children":[{"title":"74d8c74eb15a1bad","children":[{"title":"KERNEL.ipch <span style='color:#111;'> 5.06MB </span>","children":null,"spread":false}],"spread":true},{"title":"a07315cbf416576d","children":[{"title":"KERNEL.ipch <span style='color:#111;'> 5.06MB </span>","children":null,"spread":false}],"spread":true},{"title":"9b18a0c100c0ba63","children":[{"title":"KERNEL.ipch <span style='color:#111;'> 5.06MB </span>","children":null,"spread":false}],"spread":true},{"title":"f3ee4b89ea8952a5","children":[{"title":"KERNEL.ipch <span style='color:#111;'> 5.06MB </span>","children":null,"spread":false}],"spread":false}],"spread":true}],"spread":true}],"spread":true}],"spread":true}],"spread":true},{"title":"x64","children":[{"title":"Debug","children":[{"title":"CUDA 10.1 Runtime1.pdb <span style='color:#111;'> 1.07MB </span>","children":null,"spread":false},{"title":"kernel.cu.cache <span style='color:#111;'> 1.11KB </span>","children":null,"spread":false},{"title":"kernel.cu-136850102.deps <span style='color:#111;'> 6.88KB </span>","children":null,"spread":false},{"title":"CUDA 10.1 Runtime1.log <span style='color:#111;'> 99B </span>","children":null,"spread":false},{"title":"CUDA 10..C6C9471B.tlog","children":[{"title":"CUDA 10.1 Runtime1.write.1u.tlog <span style='color:#111;'> 17.57KB </span>","children":null,"spread":false},{"title":"CudaCompile.write.1u.tlog <span style='color:#111;'> 128B </span>","children":null,"spread":false},{"title":"CudaCompile.read.1u.tlog <span style='color:#111;'> 13.69KB </span>","children":null,"spread":false},{"title":"link.command.1.tlog <span style='color:#111;'> 5.38KB </span>","children":null,"spread":false},{"title":"CUDA 10.1 Runtime1.lastbuildstate <span style='color:#111;'> 159B </span>","children":null,"spread":false},{"title":"link.read.1.tlog <span style='color:#111;'> 7.55KB </span>","children":null,"spread":false},{"title":"link.write.1.tlog <span style='color:#111;'> 1.25KB </span>","children":null,"spread":false}],"spread":true},{"title":"kernel.cu280824394.deps <span style='color:#111;'> 6.83KB </span>","children":null,"spread":false},{"title":"kernel.cu1759965465.deps <span style='color:#111;'> 6.81KB </span>","children":null,"spread":false},{"title":"vc142.pdb <span style='color:#111;'> 116.00KB </span>","children":null,"spread":false},{"title":"CUDA 10.1 Runtime1.ilk <span style='color:#111;'> 2.21MB </span>","children":null,"spread":false},{"title":"kernel.cu.obj <span style='color:#111;'> 110.83KB </span>","children":null,"spread":false},{"title":"CUDA 10.1 Runtime1.lib <span style='color:#111;'> 1.89KB </span>","children":null,"spread":false},{"title":"CUDA 10.1 Runtime1.exp <span style='color:#111;'> 783B </span>","children":null,"spread":false},{"title":"CUDA 10.1 Runtime1.vcxproj.FileListAbsolute.txt <span style='color:#111;'> 279B </span>","children":null,"spread":false},{"title":"CUDA 10.1 Runtime1.exe <span style='color:#111;'> 594.00KB </span>","children":null,"spread":false},{"title":"CUDA 10.1 Runtime1.exe.recipe <span style='color:#111;'> 305B </span>","children":null,"spread":false}],"spread":false}],"spread":true},{"title":"CUDA 10.1 Runtime1.vcxproj <span style='color:#111;'> 4.52KB </span>","children":null,"spread":false},{"title":"juzhen_bing_differntType_time.txt <span style='color:#111;'> 719B </span>","children":null,"spread":false},{"title":"CUDA 10.1 Runtime1.vcxproj.user <span style='color:#111;'> 168B </span>","children":null,"spread":false},{"title":"cpp.hint <span style='color:#111;'> 225B </span>","children":null,"spread":false},{"title":"备用代码","children":[{"title":"juzhen_bing_differentType_time.cu <span style='color:#111;'> 5.76KB </span>","children":null,"spread":false},{"title":"我的矩阵备用代码.txt <span style='color:#111;'> 5.76KB </span>","children":null,"spread":false},{"title":"juzhen_chuan+bing_float.cu <span style='color:#111;'> 5.75KB </span>","children":null,"spread":false},{"title":"比较gpu在不同数据的时间.txt <span style='color:#111;'> 5.81KB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"矩阵乘法的GPU并行程序.docx <span style='color:#111;'> 374.18KB </span>","children":null,"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明