forked from academicpages/academicpages.github.io
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
bb185a5
commit 1eb2f9c
Showing
5 changed files
with
33 additions
and
4 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
--- | ||
title: "CLOUD: A Scientific Foundation Model for Crystal Property Prediction" | ||
collection: talks | ||
type: "Poster" | ||
permalink: /talks/2024-04-03-Cloud | ||
venue: "MICDE Scientific Foundation Model Conference" | ||
date: 2024-04-03 | ||
location: "Ann Arbor, MI" | ||
--- | ||
|
||
|
||
**Abstract** | ||
|
||
Property prediction of crystals is crucial for material design. However, developing machine learning models for these tasks is hampered by the need for labeled data from costly experiments or Density Functional Theory (DFT), resulting in limited data size and poor generalization to new crystals. Foundation models (FMs) present a potential solution with their self-supervised pretraining on unlabeled datasets for better representation learning and transferability. Yet, applying FMs to crystals is challenging due to the sparse number of valid structures for pretraining and the inadequacy of existing representations to capture critical structural information like symmetry. Herein, We propose the CrystaL fOUnDation model (CLOUD), a Transformer-based foundation model for crystal property prediction. CLOUD utilizes a novel symmetry-aware string representation that efficiently encodes symmetry, equivalent sites, and constituting atoms, eliminating the need for coordinate information or equivariant models. Pretrained on million-scale crystal data from various databases via Masked Language Modeling (MLM), CLOUD is then fine-tuned and assessed on eight MatBench datasets. The model not only significantly outperforms structure-agnostic models and achieves near state-of-the-art results on two datasets, but also demonstrates robust scaling with data and model size. This suggests CLOUD's potential as a scalable solution for crystal foundation models, capable of learning from billions of unlabeled crystal data. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
--- | ||
title: "CLOUD: A Scientific Foundation Model for Crystal Property Prediction" | ||
collection: talks | ||
type: "Poster" | ||
permalink: /talks/2024-06-19-Cloud | ||
venue: "Molecular Machine Learning Conference" | ||
date: 2024-06-19 | ||
location: "Montreal, Quebec" | ||
--- | ||
|
||
Spotlight Paper at MoML 2024 | ||
|
||
**Abstract** | ||
|
||
Property prediction of crystals is crucial for material design. However, developing machine learning models for these tasks is hampered by the need for labeled data from costly experiments or Density Functional Theory (DFT), resulting in limited data size and poor generalization to new crystals. Foundation models (FMs) present a potential solution with their self-supervised pretraining on unlabeled datasets for better representation learning and transferability. Yet, applying FMs to crystals is challenging due to the sparse number of valid structures for pretraining and the inadequacy of existing representations to capture critical structural information like symmetry. Herein, We propose the CrystaL fOUnDation model (CLOUD), a Transformer-based foundation model for crystal property prediction. CLOUD utilizes a novel symmetry-aware string representation that efficiently encodes symmetry, equivalent sites, and constituting atoms, eliminating the need for coordinate information or equivariant models. Pretrained on million-scale crystal data from various databases via Masked Language Modeling (MLM), CLOUD is then fine-tuned and assessed on eight MatBench datasets. The model not only significantly outperforms structure-agnostic models and achieves near state-of-the-art results on two datasets, but also demonstrates robust scaling with data and model size. This suggests CLOUD's potential as a scalable solution for crystal foundation models, capable of learning from billions of unlabeled crystal data. |
Binary file not shown.