Commit dce250b
Refactor: use less memory and optimize performance to calculate force and stress in pw base (#4047)
* Refactor: use less memory to calculate stress in pw base
* Fix: nondiagonal matrix element in PW-Stress calculation
* Fix: uspp stress calculation
* add gemm_op in gpu and add vkb_op
* delete comment
* delete comment2
* Fix: error in merging
* update ops.
* add correct vq.
* finish stress ops!
* Fix: compiling error
* Refactor: delete GlobalC used in stress_nl code
* Fix: compiling error in CUDA
* Fix: DCU error in synchronize_ptrs
* Fix: GlobalC in Sto_stress_PW
* add new stress and force in high speed.
* delete time prints.
* Fix: compiling error from merge
* Fix: compiling error from merge on GPU
* remove some if(GPU)
* finish force remove gpus.
* Memory: reduce memory allocation for hpsi and spsi in diagH_subspace_init func (GPU)
* fix: compiler error
* Memory: reduce memory allocation for hpsi and spsi in diagH_subspace_init func (GPU)
* Memory: reduce memory allocation for hpsi and spsi in diagH_subspace_init func (GPU)
* Fix: force calculation error
* Fix: error of Lcao_in_PW
* Fix: error in GPU
* Fix: force error
* Try: use one temp psi for cg
* fix: error in cg
* Fix: sdft error
* fix: compiling error
* Refactor: new class tool for pw_stress code
* fix: memory leak in stress
* Refactor: force calculation for pw code
* Fix: force error
* delete useless code of old force
* fix: compiling error on GPU/DCU
* Fix: error on GPU
* Fix: uninitialize error on force
* Fix: Makefile.objects
* update dylmr2
* Fix: compiling error
* [pre-commit.ci lite] apply automatic fixes
* Fix: force performance
* Fix: error in cuda
* Fix: compiling error in cuda
* Fix: compiling error in cuda
* [pre-commit.ci lite] apply automatic fixes
* Fix: compiling error in CUDA
* Fix: compiling error in CUDA/ROCM
* Fix: stress error
* [pre-commit.ci lite] apply automatic fixes
* update from PR comments
* [pre-commit.ci lite] apply automatic fixes
* add timer and optimize stress
* fix: g_plus_k
* [pre-commit.ci lite] apply automatic fixes
* Fix: delete pointers
* [pre-commit.ci lite] apply automatic fixes
* Fix: stress time
* [pre-commit.ci lite] apply automatic fixes
* fix: add annotations
* [pre-commit.ci lite] apply automatic fixes
---------
Co-authored-by: dyzheng <zhengdy@bjaisi.com>
Co-authored-by: Mohan Chen <mohan.chen.chen.mohan@gmail.com>
Co-authored-by: stargrys <771582678@qq.com>
Co-authored-by: wqzhou <33364058+WHUweiqingzhou@users.noreply.github.com>
Co-authored-by: Religious-J <1569978990@qq.com>
Co-authored-by: pre-commit-ci-lite[bot] <117423508+pre-commit-ci-lite[bot]@users.noreply.github.com>1 parent 2c451de commit dce250b
File tree
40 files changed
+10831
-8828
lines changed- source
- module_base/module_device
- cuda
- rocm
- module_esolver
- module_hamilt_pw
- hamilt_pwdft
- kernels
- cuda
- rocm
- test
- hamilt_stodft
- module_hsolver
40 files changed
+10831
-8828
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
576 | 576 | | |
577 | 577 | | |
578 | 578 | | |
| 579 | + | |
| 580 | + | |
579 | 581 | | |
580 | 582 | | |
581 | 583 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
269 | 269 | | |
270 | 270 | | |
271 | 271 | | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
272 | 276 | | |
273 | 277 | | |
274 | 278 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
141 | 141 | | |
142 | 142 | | |
143 | 143 | | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
144 | 148 | | |
145 | 149 | | |
146 | 150 | | |
147 | | - | |
148 | 151 | | |
149 | 152 | | |
150 | 153 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
265 | 265 | | |
266 | 266 | | |
267 | 267 | | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
268 | 272 | | |
269 | 273 | | |
270 | 274 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
910 | 910 | | |
911 | 911 | | |
912 | 912 | | |
| 913 | + | |
913 | 914 | | |
914 | 915 | | |
915 | 916 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
253 | 253 | | |
254 | 254 | | |
255 | 255 | | |
256 | | - | |
| 256 | + | |
| 257 | + | |
| 258 | + | |
257 | 259 | | |
258 | 260 | | |
259 | 261 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
| 11 | + | |
11 | 12 | | |
12 | 13 | | |
13 | 14 | | |
| |||
32 | 33 | | |
33 | 34 | | |
34 | 35 | | |
| 36 | + | |
35 | 37 | | |
36 | 38 | | |
37 | 39 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
495 | 495 | | |
496 | 496 | | |
497 | 497 | | |
498 | | - | |
499 | 498 | | |
500 | 499 | | |
501 | 500 | | |
| |||
0 commit comments