StarEmbed: Benchmarking Time Series Foundation Models on Astronomical Observations of Variable Starts

The first benchmark to test the state-of-the-art TSFMs on stellar time series observations ("light curves").

A complete benchmark framework for astronomical time series. This repository includes tools for (1) preprocessing raw light curves, (2) generating embeddings (with TSFMs and Astromer), (3) engineering handcrafted features, and (4) comprehensive evaluations on clustering, classification, and out-of-distribution detection.

| 🏠Benchmark Page | 🤗Huggingface Dataset | 📖Paper |

Directory Overview

`src/datasets/`

Raw light curve preprocessing and data preparation scripts
→ See datasets/README.md for detailed preprocessing workflows

`src/model/`

Time series foundation model implementations and embedding generation

Astromer 1&2: Transformer-based astronomical time series model
Chronos: Amazon's forecasting foundation model
Moirai: Salesforce's universal time series model
compute_avg_embeddings.py: Generate combined embeddings from multi-band data

`src/benchmark/`

Evaluation pipeline with pre-computed embeddings

Classification: kNN, Linear models, MLPs, Random Forest with HPO
Clustering: K-Means, hierarchical clustering, t-SNE visualization
→ See benchmark/README.md for complete evaluation workflows

`bash_script/`

job scripts for evaluation with hyperparameter search and multi-run script

Quick Start

Preprocess data: datasets/ → Raw light curves to standardized format
Generate embeddings: model/ → Extract features using TSFMs
Create combined embeddings: model/compute_avg_embeddings.py → Multi-band aggregation
Run evaluations: benchmark/ → Classification, clustering, visualization

License

All the code are under MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 117 Commits
bash_script		bash_script
output		output
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

StarEmbed: Benchmarking Time Series Foundation Models on Astronomical Observations of Variable Starts

| 🏠Benchmark Page | 🤗Huggingface Dataset | 📖Paper |

Directory Overview

`src/datasets/`

`src/model/`

`src/benchmark/`

`bash_script/`

Quick Start

License

About

Uh oh!

Contributors 4

Uh oh!

Languages

License

skai-institute/StarEmbed

Folders and files

Latest commit

History

Repository files navigation

StarEmbed: Benchmarking Time Series Foundation Models on Astronomical Observations of Variable Starts

| 🏠Benchmark Page | 🤗Huggingface Dataset | 📖Paper |

Directory Overview

src/datasets/

src/model/

src/benchmark/

bash_script/

Quick Start

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 4

Uh oh!

Languages

`src/datasets/`

`src/model/`

`src/benchmark/`

`bash_script/`