Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Structure files #73

Open
vsumaria opened this issue Jan 23, 2022 · 8 comments
Open

Structure files #73

vsumaria opened this issue Jan 23, 2022 · 8 comments
Labels
enhancement New feature or request

Comments

@vsumaria
Copy link

Is there a possibility to use extended XYZ or ase trajectory files to read the structural energy and force data instead of OUTCAR?

In the data_processing.py, you load_structures function is creating a ase atoms object, so I think there should be a way to directly read files other than OUTCAR with ase for the E, F, Position data right?

@JisuJung928
Copy link
Member

Hello, vsumaria.

I fixed the format bug immediately so that up-to-date SIMPLE-NN can handle the various format including extended XYZ.
I am sorry for the inconvenience.

Please update your SIMPLE-NN and enjoy the training!

Best regards,
Jisu.

@JisuJung928
Copy link
Member

You should set train_model to false in input.yaml to run the code in parallel.
MPI parallelization is supported for only generate_feature and preprocess.

Please let me know if you have any problems during running.
Also, I recommend you to visit our new documentation page.

Best regards,
Jisu.

@vsumaria
Copy link
Author

Yes, I went through the documentation and realized that parallelization was meant for only generate_feature and preprocess.

I have one issue in the evaluation process (using the same SiO2 example). I am now sure what I am doing wrong, but I changed the training input.yaml nodes to 15-15. But while evaluating using the "continue: checkpoint_bestmodel.pth.tar", I get the following error:

size mismatch for nets.Si.lin.lin_0.weight: copying a param with shape torch.Size([15, 70]) from checkpoint, the shape in current model is torch.Size([30, 70]).

@vsumaria
Copy link
Author

Ok, I realized my mistake, I just had to add the nodes and other NN settings back in the evaluation input.yaml as well.

Thanks!

@vsumaria
Copy link
Author

Few more issues. I was not trying to use for a more complex system I have been wanting to use the code for. It has 5 elements, as a result the number of SF are high. When trying to preprocess the data, it is leading to segmentation issues (memory problem).

Any suggestions on how to work with such large systems?

@JisuJung928 JisuJung928 reopened this Jan 24, 2022
@vsumaria
Copy link
Author

It would be good if there would be a way to estimate the memory usage for the preprocessing, that can help identify what parallel settings should be used.

@JisuJung928
Copy link
Member

The memory of preprocessed data increases linearly and squarely as the number of SFs and atoms, respectively.
Therefore, if you suffer from a memory problem, please reduce the number of atoms in the system first.
Additionally, the less number of SFs sometimes provides enough accuracy in the application (radial < 8 and angular < 16).
Please test whether your system really needs all SFs or not.
(We solved the LAMMPS memory problem that occurred at more than 5 elements before.)

@JisuJung928
Copy link
Member

I agree with your suggestion. We will consider how to estimate memory usage before preprocessing.

Thanks!

@JisuJung928 JisuJung928 added the enhancement New feature or request label Jan 25, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants