Structure files #73

vsumaria · 2022-01-23T22:26:25Z

Is there a possibility to use extended XYZ or ase trajectory files to read the structural energy and force data instead of OUTCAR?

In the data_processing.py, you load_structures function is creating a ase atoms object, so I think there should be a way to directly read files other than OUTCAR with ase for the E, F, Position data right?

JisuJung928 · 2022-01-24T02:37:19Z

Hello, vsumaria.

I fixed the format bug immediately so that up-to-date SIMPLE-NN can handle the various format including extended XYZ.
I am sorry for the inconvenience.

Please update your SIMPLE-NN and enjoy the training!

Best regards,
Jisu.

JisuJung928 · 2022-01-24T04:51:03Z

You should set train_model to false in input.yaml to run the code in parallel.
MPI parallelization is supported for only generate_feature and preprocess.

Please let me know if you have any problems during running.
Also, I recommend you to visit our new documentation page.

Best regards,
Jisu.

vsumaria · 2022-01-24T04:58:28Z

Yes, I went through the documentation and realized that parallelization was meant for only generate_feature and preprocess.

I have one issue in the evaluation process (using the same SiO2 example). I am now sure what I am doing wrong, but I changed the training input.yaml nodes to 15-15. But while evaluating using the "continue: checkpoint_bestmodel.pth.tar", I get the following error:

size mismatch for nets.Si.lin.lin_0.weight: copying a param with shape torch.Size([15, 70]) from checkpoint, the shape in current model is torch.Size([30, 70]).

vsumaria · 2022-01-24T05:04:17Z

Ok, I realized my mistake, I just had to add the nodes and other NN settings back in the evaluation input.yaml as well.

Thanks!

vsumaria · 2022-01-24T06:03:24Z

Few more issues. I was not trying to use for a more complex system I have been wanting to use the code for. It has 5 elements, as a result the number of SF are high. When trying to preprocess the data, it is leading to segmentation issues (memory problem).

Any suggestions on how to work with such large systems?

vsumaria · 2022-01-24T06:31:48Z

It would be good if there would be a way to estimate the memory usage for the preprocessing, that can help identify what parallel settings should be used.

JisuJung928 · 2022-01-25T00:49:31Z

The memory of preprocessed data increases linearly and squarely as the number of SFs and atoms, respectively.
Therefore, if you suffer from a memory problem, please reduce the number of atoms in the system first.
Additionally, the less number of SFs sometimes provides enough accuracy in the application (radial < 8 and angular < 16).
Please test whether your system really needs all SFs or not.
(We solved the LAMMPS memory problem that occurred at more than 5 elements before.)

JisuJung928 · 2022-01-25T00:52:04Z

I agree with your suggestion. We will consider how to estimate memory usage before preprocessing.

Thanks!

JisuJung928 closed this as completed Jan 24, 2022

JisuJung928 reopened this Jan 24, 2022

JisuJung928 added the enhancement New feature or request label Jan 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Structure files #73

Structure files #73

vsumaria commented Jan 23, 2022

JisuJung928 commented Jan 24, 2022

JisuJung928 commented Jan 24, 2022

vsumaria commented Jan 24, 2022

vsumaria commented Jan 24, 2022

vsumaria commented Jan 24, 2022

vsumaria commented Jan 24, 2022

JisuJung928 commented Jan 25, 2022

JisuJung928 commented Jan 25, 2022

Structure files #73

Structure files #73

Comments

vsumaria commented Jan 23, 2022

JisuJung928 commented Jan 24, 2022

JisuJung928 commented Jan 24, 2022

vsumaria commented Jan 24, 2022

vsumaria commented Jan 24, 2022

vsumaria commented Jan 24, 2022

vsumaria commented Jan 24, 2022

JisuJung928 commented Jan 25, 2022

JisuJung928 commented Jan 25, 2022