TSCBD (AAAI25)

This is the codebase for the paper: Content-Aware Balanced Spectrum Encoding in Masked Modeling for Time Series Classification.

Introduction

Due to the superior ability of global dependency, transformer and its variants have become the primary choice in Masked Time-series Modeling (MTM) towards time-series classification task. In this paper, we experimentally analyze that existing transformer-based MTM methods encounter with two under-explored issues when dealing with time series data: (1)they encode features by performing long-dependency ensemble averaging, which easily results in rank collapse and feature homogenization as the layer goes deeper; (2) they exhibit distinct priorities in fitting different frequency components contained in the time-series, inevitably leading to spectrum energy imbalance of encoded feature. To tackle these issues, we propose an auxiliary content-aware balanced decoder (CBD) to optimize the encoding quality in the spectrum space within masked modeling scheme. Specifically, the CBD iterates on a series of fundamental blocks, and thanks to two tailored units, each block could progressively refine the masked representation via adjusting the interaction pattern based on local content variations of time-series and learning to recalibrate the energy distribution across different frequency components. Moreover, a dual-constraint loss is devised to enhance the mutual optimization of vanilla decoder and our CBD. Extensive experimental results on ten time series classification datasets show that our method nearly surpasses a bunch of baselines. Meanwhile, a series of explanatory results are showcased to sufficiently demystify the be haviors of our method.

Get Started

1、Prepare Data.

UCR dataset can be obtained from UCR Time Series Classification Archive.
UEA dataset can be obtained from Time Series Classification Website.
The preprocessed HAR dataset can be obtained from TS-TCC codebase.

The datasets need to be placed in the dataset folder.

2、Training.

python main.py --training_mode self_supervised --selected_dataset selected_dataset

Other training configs can be found in main.py.

3、Evaluation

Train Linear:

python main.py --training_mode train_linear --selected_dataset selected_dataset

Fine Tuning:

python main.py --training_mode finetune --selected_dataset selected_dataset

Citation

If you find this repo useful, please cite our paper.

@misc{han2024contentawarebalancedspectrumencoding,
      title={Content-aware Balanced Spectrum Encoding in Masked Modeling for Time Series Classification}, 
      author={Yudong Han and Haocong Wang and Yupeng Hu and Yongshun Gong and Xuemeng Song and Weili Guan},
      year={2024},
      eprint={2412.13232},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2412.13232}, 
}

Acknowledement

We appreciate the following github repos a lot for their valuable code base or datasets:

TS-TCC: https://github.com/emadeldeen24/TS-TCC
SimMTM: https://github.com/thuml/SimMTM?tab=readme-ov-file
TimeMAE: https://github.com/Mingyue-Cheng/TimeMAE

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
baseline_run_log		baseline_run_log
config_files		config_files
data_provider		data_provider
experiments_logs		experiments_logs
figs		figs
layer		layer
models		models
rebuttal_img		rebuttal_img
train_step		train_step
utils		utils
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TSCBD (AAAI25)

Introduction

Get Started

Citation

Acknowledement

About

Releases

Packages

Languages

License

Simon98-AI/CBD

Folders and files

Latest commit

History

Repository files navigation

TSCBD (AAAI25)

Introduction

Get Started

Citation

Acknowledement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages