AMATH 483代做、C++程序语言代写
AMATH 483 / 583 (roche) - HW6
Due Friday May 31, 11:59pm PT
May 24, 2024
Homework 6 (80 points, 0 EC points)
1. (+20) Complex double linear system solver. Plot both the log of the residual and the log of the
normalized error ( kbAzk2
kAk1 kzk2 ✏machine ) versus the square matrix dimensions 16,32,64,...,8192 for the following
LAPACK routine. It is supported in the OpenBLAS build on Hyak. Submit your plot, and label it
accordingly.
l a p a c k i n t LAPACKE zgesv( int matrix orde r ,
l a p a c k i n t n ,
l a p a c k i n t nrhs ,
lapack compl ex doubl e ∗ a ,
l a p a c k i n t lda ,
l a p a c k i n t ∗ ipiv ,
lapack compl ex doubl e ∗ b ,
l a p a c k i n t ldb );
Use the following snippet code to initialize your matrices and rhs vectors and note the headers I use:
#include
#include
#include
#include
#include
#include
#include
#include
#include
#include
. . .
int main () {
. . .
a =( s td : : complex∗) malloc ( s izeof ( s td : : complex) ∗ ma ∗ na ) ;
b = ( s td : : complex∗) malloc ( s izeof ( s td : : complex) ∗ ma ) ;
z = ( s td : : complex∗) malloc ( s izeof ( s td : : complex) ∗ na ) ;
. . .
s rand ( 0 );
int k =0;
for ( int j = 0 ; j < na ; j++) {
for ( int i = 0 ; i < ma ; i++) {
a [ k ] = 0 . 5 − (double ) rand () / (double )RANDMAX
+ s td : : complex(0 , 1)
∗ ( 0 . 5 − (double ) rand () / (double )RANDMAX) ;
i f ( i==j ) a [ k]∗= s tat ic cas t(ma ) ;
k++;
}
}
s rand ( 1 );
for ( int i = 0 ; i < ma; i++) {
b [ i ] = 0 . 5 − (double ) rand () / (double )RANDMAX
+ s td : : complex(0 , 1)
∗ ( 0 . 5 − (double ) rand () / (double )RANDMAX) ;
}
. . .
12. (+20) CPU-GPU data copy speed on HYAK. Write a C++ code to measure the data copy performance
between the host CPU and GPU (host to device), and between the GPU and the host CPU (device to host). Copy
8 bytes to 256MB increasing in multiples of 2. Plot the bandwidth for both directions: (bytes per second) on the
y-axis and the bu↵er size in bytes on the x-axis. Submit your plot and test code.
3. (+20) Compare FFTW to CUFFT on HYAK. Measure and plot the performance of calculating the gradient
of a 3D double complex plane wave defined on cubic lattices of dimension n3 from 163 to n = 2563, stride n⇤ = 2
for both the FFTW and CUDA FFT (CUFFT) implementations on HYAK. Let each n be measured ntrial times
and plot the average performance for each case versus n, ntrial 3. Submit your performance plot which should
have ’FLOPs’ on the y-axis (or some appropriate unit of FLOPs) and the dimension of the cubic lattices (n) on
the x-axis. You will need to estimate the operation count of computing the derivative using FFT on a lattice.
4. (+20) Fourier transforms. Evaluate the Fourier transform of the following functions by hand. Use the definitions
I provided (includes p1
2⇡ , this is common in physics but also now the default used in WolframAlpha - a powerful
math AI tool) as well as the definition for Dirac delta I used in lecture if needed.
(a) f(x) = 1
p2⇡ e
1
22 (xµ)2
(b) f(t) = sin(!0t) , !0 constant
(c) f(x) = ea|x| and a > 0
(d) (distribution) f(t) = (t)
2

热门主题

课程名

mktg2509 csci 2600 38170 lng302 csse3010 phas3226 77938 arch1162 engn4536/engn6536 acx5903 comp151101 phl245 cse12 comp9312 stat3016/6016 phas0038 comp2140 6qqmb312 xjco3011 rest0005 ematm0051 5qqmn219 lubs5062m eee8155 cege0100 eap033 artd1109 mat246 etc3430 ecmm462 mis102 inft6800 ddes9903 comp6521 comp9517 comp3331/9331 comp4337 comp6008 comp9414 bu.231.790.81 man00150m csb352h math1041 eengm4100 isys1002 08 6057cem mktg3504 mthm036 mtrx1701 mth3241 eeee3086 cmp-7038b cmp-7000a ints4010 econ2151 infs5710 fins5516 fin3309 fins5510 gsoe9340 math2007 math2036 soee5010 mark3088 infs3605 elec9714 comp2271 ma214 comp2211 infs3604 600426 sit254 acct3091 bbt405 msin0116 com107/com113 mark5826 sit120 comp9021 eco2101 eeen40700 cs253 ece3114 ecmm447 chns3000 math377 itd102 comp9444 comp(2041|9044) econ0060 econ7230 mgt001371 ecs-323 cs6250 mgdi60012 mdia2012 comm221001 comm5000 ma1008 engl642 econ241 com333 math367 mis201 nbs-7041x meek16104 econ2003 comm1190 mbas902 comp-1027 dpst1091 comp7315 eppd1033 m06 ee3025 msci231 bb113/bbs1063 fc709 comp3425 comp9417 econ42915 cb9101 math1102e chme0017 fc307 mkt60104 5522usst litr1-uc6201.200 ee1102 cosc2803 math39512 omp9727 int2067/int5051 bsb151 mgt253 fc021 babs2202 mis2002s phya21 18-213 cege0012 mdia1002 math38032 mech5125 07 cisc102 mgx3110 cs240 11175 fin3020s eco3420 ictten622 comp9727 cpt111 de114102d mgm320h5s bafi1019 math21112 efim20036 mn-3503 fins5568 110.807 bcpm000028 info6030 bma0092 bcpm0054 math20212 ce335 cs365 cenv6141 ftec5580 math2010 ec3450 comm1170 ecmt1010 csci-ua.0480-003 econ12-200 ib3960 ectb60h3f cs247—assignment tk3163 ics3u ib3j80 comp20008 comp9334 eppd1063 acct2343 cct109 isys1055/3412 math350-real math2014 eec180 stat141b econ2101 msinm014/msing014/msing014b fit2004 comp643 bu1002 cm2030
联系我们
EMail: 99515681@qq.com
QQ: 99515681
留学生作业帮-留学生的知心伴侣!
工作时间:08:00-21:00
python代写
微信客服:codinghelp
站长地图