Appendix
This lists all possible inputs which can be listed in inp.yaml
. Some options may be omitted, in which case default values are chosen. The type
columns follow the Python typing
conventions.
Salted definition (inp.salted)
var name |
type |
usage |
saltedname |
str |
A label to identify a particular training setup |
saltedpath |
str |
Location of all files produced by SALTED |
System definition (inp.system)
var name |
type |
usage |
filename |
str |
An XYZ file consisting of input structures |
species |
List[str] |
Ordered list of element species |
average |
bool |
Whether we use averaged coefficients to set an offset. Normally this should be true, unless a delta-density is learned. |
parallel |
bool |
Whether to use MPI parallelization |
field |
bool |
Option for using external field. For predicting densities without external fields, set to False |
var name |
type |
usage |
path2qm |
str |
Location of training data |
qmcode |
Union[Literal["aims"], Literal["cp2k"], Literal["pyscf"]] |
Which ab initio software was used to generate training data. |
qmbasis |
str |
Basis set to use when generating the training data (unused by AIMS) |
dfbasis |
str |
A label for the auxiliary basis set used to expand the density |
Rascaline atomic environment parameters (inp.descriptor.rep[n])
[n]
below stands for nth local environment. E.g. rep[n]
should be rep1
and rep2
for the first and second local environment respectively. See SOAP descriptions for more details.
var name |
type |
usage |
type |
Union[Literal["rho"], Literal["V"]] |
Representation type, "rho" for atomic density and "V" for atomic potential |
rcut |
float |
Radial cutoff (Angstrom) |
nrad |
int |
Number of radial functions |
nang |
int |
Number of angular functions |
sig |
float |
Gaussian function width (Angstrom) |
neighspe |
List[str] |
Ordered list of atomic species |
Feature sparsification parameters (inp.descriptor.sparsify)
var name |
type |
usage |
nsamples |
int |
Number of structures to use for feature sparsification |
ncut |
int |
Sets maximum number of SOAP features kept. |
Prediction variables (inp.prediction)
var name |
type |
usage |
filename |
str |
An XYZ file consisting of structures whose densities we wish to predict |
predname |
str |
A label to identify a particular set of predictions |
predict_data |
str |
Path to ab initio output for prediction, relative to path2qm |
ML variables (inp.gpr)
var name |
type |
usage |
z |
float |
Kernel exponent \(\zeta\) for GPR |
Menv |
int |
Number of reference environments |
Ntrain |
int |
Number of training structures |
trainfrac |
float |
Training dataset fraction. Training dataset size is \(\mathrm{Ntrain}\times\mathrm{trainfrac}\) |
regul |
float |
Regularization parameter \(\eta\) for GPR |
eigcut |
float |
Eigenvalues cutoff for RKHS projection |
gradtol |
float |
Minimum gradient norm tolerance for CG minimization |
restart |
bool |
Whether to restart from previous minimization checkpoint |
blocksize |
int |
Divide dataset into blocks with blocksize for MPI matrix inversion |
trainsel |
Union[Literal["sequential"], Literal["random"]] |
Train at random or sequentially for MPI matrix inversion |