LASP SSW-DFT-NN auto-train python-lib

Author

Modified: James.Misaka.Bourbon.Liu

Original: ZPLiu's Group (SDHuang, SCMa, ZPLiu et. al.)

Last Update: 2022-09-05

Version: V1.2.5

Program Structure

Requirement

Python 3.6+ (better 3.8)

NumPy, matplotlib, Pandas. Scipy, multiprocessing et. al.

Miniconda or Anaconda to construct Python-3.8 env is recommended

Original LASP_PythonLib use Python-2.7, which is TOTALLY OUT-Of-DATE

Function

SSW-NN-autotrain Machine Learning Potentials of LASP.
python (and shell/fortran) scripts which can be used in LASP Calculation. e.g:
1. vasp2lasptrain.py transfer VASP-label result to TrainStr.txt and TrainFor.txt. (independently)
2. shiftformat.py transfer between arc-file and Traindata-file. (independently)
3. traindata_analysis.py to give infomation of TrainStr.txt and TrainFor.txt.
4. splitarc_auto.py to split muti-struc-arcfile to each input.arc(lasp.str) or DFT_label_inputfile.
5. pos_arc_shift.py transfer between POSCAR and struc-arc-file.
6. shellscript in little_script dir.
7. computeQ.py for calculate dafa for PlotQE usage.

How to run SSW-NN-autotrain and auto.py

1. modify console file

1.1 some following are parameters often need to change

StartfromVASP 0   # 0 start from SSW sampling provided with NN pot , 1 start from allstr.arc-0 in VASP dir, which is often used to train the first NN pot
Nbad   40    # structures for VASP every cycle
cpupernode 96   # CPU total core (not suggested running between 2 or more CPUs)
SSWcheckcycle  600   # SSW time clock 600 seconds

%block cpuperjob
SSW  24         # cores per SSW job
VASP 24         # cores per VASP job
NN   0          # should be designated in jobs.sh
%endblock cpuperjob

1.2 provide the binary program

%block prog
SSW  /home10/bin/lasp-1.0-release/lasp
VASP  /home10/bin/lasp-1.0-release/lasp
VASPgamma  /home10/bin/lasp-1.0-release/lasp.gamma
NN  /home10/bin/lasp-1.0-release/lasp
%endblock prog

1.3 provide the element name in console file

%block base
O   0.0
H   0.0
%endblock base

2. modify jobs.sh

make sure the name of NN pot is correct, e.g. sed -i 's/H2O/PtOH.pot/g' jobs.sh
modify the number of cycles, default is 100, in: for i in {1..100}
modify the cpu/cores required for your computing cluster (modify it in jobs.sh)

3. create arc file for SSW sampling: allstr-ini.arc

in SSW/sourcedir/allstr-ini.arc

you may get allstr-ini.arc from the examples of structure which you need to add and train in your pot

4. check NN directory

In rootdir/NN you should prepare:

   lasp.in   # lasp_NNtrain input file
   H2O.pot          # not required if start from scratch
   H2O.input        # if start from scratch, use "newrun" for pot
   TrainStr.txt     # if start from scratch, just creat an empty file
   TrainFor.txt     # if start from scratch, just creat an empty file
   adjust_factor  # can ignore

lasp already has a lot of Train*.txt files for different systems please first download TrainStr.txt TrainFor.txt from www.lasphub.com

5. make sure add python exec path in jobs.sh

You'd better have anaconda env in your server, otherwise you can use intel-python:

You can use intel-python by define it in .bashrc:

export PYTHONPATH=/data/apps/intel/intelpython3/bin:$PYTHONPATH

6. qsub jobs.sh (or sbatch jobs_local.slurm)

Other scripts

related to traindata
1. shiftformat.py: arc2train or train2arc usage
2. vasp2lasptrain.py: vasp-dft result directly to TrainStr.txt and TrainFor.txt
3. cut_traindata.py: cut TrainStr/TrainFor by
4. traindata_analysis.py: print-out statistic infomation of TrainStr/TrainFor
related to arc_data
1. findGM.py: find top100(can be set) global minimum structure from SSW result
2. splitarc_auto.py: split all-str arc file to one-str arc file (or dft-project dir)
3. nodejob.py and nodejob_coor.py: collect all-str from SSW to VASP-DFT
4. collect_vasp_label.py: collect and screen all-str from SSW to VASP-DFT (not test)
5. screen_data.py: from auto.py, used to screen all-str from SSW result (not test)
related to coordination patterns
1. dft_setting dir: used for coor_verlet_sample.py
2. coor_verlet_sample.py: two-steps dynamic verlet sampling based on stucture similarity described by coordination patterns
3. update_patterns.py: generate coordination patterns of all-str (and add to exist database)
  1. parallel version is still coding/refining, this version have lower speed

some tips

SSW/sourcedir have input.i for SSW-NN input file, may need to check
remember check SSW VASP NN dir before finally running
Remember check auto.py(SSW_choosing_mode) and jobs.sh before qsub/sbatch
Coordination-Patterns method waiting for using
auto.py and SSW-DFT-NN auto still need to be tested

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
138_scripts		138_scripts
NN		NN
PlotQE		PlotQE
SSW		SSW
VASP		VASP
dft_setting		dft_setting
image		image
little_scripts		little_scripts
.DS_Store		.DS_Store
.gitignore		.gitignore
PeriodicTable.py		PeriodicTable.py
README.md		README.md
allstr_new.py		allstr_new.py
atom_k.py		atom_k.py
auto.py		auto.py
bond_k.py		bond_k.py
calQ.f90		calQ.f90
calQ.so		calQ.so
computeQ.py		computeQ.py
console		console
coor_verlet_sample.py		coor_verlet_sample.py
coordination_pattern.py		coordination_pattern.py
cut_traindata_by_ele.py		cut_traindata_by_ele.py
cut_traindata_by_natom.py		cut_traindata_by_natom.py
findGM.py		findGM.py
get_shrinked_allstr.py		get_shrinked_allstr.py
hostfile.py		hostfile.py
jobs_local.slurm		jobs_local.slurm
makefile		makefile
nodejob.py		nodejob.py
nodejob_coor.py		nodejob_coor.py
pos_arc_shift.py		pos_arc_shift.py
screen_data.py		screen_data.py
shiftformat.py		shiftformat.py
splitarc_auto.py		splitarc_auto.py
structure_new.py		structure_new.py
traindata_analysis.py		traindata_analysis.py
traindata_analysis_more.py		traindata_analysis_more.py
trainstr2arc.py		trainstr2arc.py
update_patterns.py		update_patterns.py
update_patterns_parallel.py		update_patterns_parallel.py
vasp2lasptrain.py		vasp2lasptrain.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LASP SSW-DFT-NN auto-train python-lib

Author

Program Structure

Requirement

Function

How to run SSW-NN-autotrain and auto.py

1. modify console file

1.1 some following are parameters often need to change

1.2 provide the binary program

1.3 provide the element name in console file

2. modify jobs.sh

3. create arc file for SSW sampling: allstr-ini.arc

4. check NN directory

5. make sure add python exec path in jobs.sh

6. qsub jobs.sh (or sbatch jobs_local.slurm)

Other scripts

some tips

About

Releases

Packages

Languages

Phorbol/LASP_autotrain_lib

Folders and files

Latest commit

History

Repository files navigation

LASP SSW-DFT-NN auto-train python-lib

Author

Program Structure

Requirement

Function

How to run SSW-NN-autotrain and auto.py

1. modify console file

1.1 some following are parameters often need to change

1.2 provide the binary program

1.3 provide the element name in console file

2. modify jobs.sh

3. create arc file for SSW sampling: allstr-ini.arc

4. check NN directory

5. make sure add python exec path in jobs.sh

6. qsub jobs.sh (or sbatch jobs_local.slurm)

Other scripts

some tips

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages