Pre-trained feature extraction by C-Achard · Pull Request #24 · weigertlab/trackastra

C-Achard · 2025-02-24T10:56:51Z

Allows using pre-trained backbones for feature generation.

Backbones :

Embedding selection mode :

Nearest
Mean of patches -> Current implementation is a bit slow

Data augmentation - Done

Add latest commits from main
Clean-up of unnecessary loose files and debug utilities

Note : currently too slow. Move feat. extractor to a single one for the class, maybe precompute embeddings before data loading/access is needed

- Improve nearest patch code - Keep embeddings in memory and in a single chunk to save on disk read operations

…d config

* (WIP) Refactor feat_dim in CTCData + add pretrained_feats mode to train.py * (WIP) Training with pretrained features * Update data.py * Fix training on several folders * Fix SAM2 input range * Add micro SAM * Skip SAM2 under-the-hood preprocessing * Fix dataset pickling issue with pretrained_config * Minor fixes and tweaks for SAM2 features * Small train fixes * Revert reassign ops to inplace, enable per param clipping * Update train.py * (WIP) Update API for pred to use pretrained_feats * Update pretrained_features.py * Add input proj dropout * Remove per param grad clipping, add input lin dropout * Update train.py * Add weight decay param * Add regionprops features to WRPretrainedFeats * Fix incorrect dims of WRFeat features * Revert "Fix incorrect dims of WRFeat features" This reverts commit a597bf4. * Fix feat dim mismatch when using additional feats in WRPretrained Feats * Add NaN guards earlier in training * Update train.py * Update train.py * Contiune debugging empty feats * (WIP) Debug NaN loss * Disable autocast for proj layer due to NaNs * EXPERIMENTAL Self-attention in input * Revert "EXPERIMENTAL Self-attention in input" This reverts commit 05b8262. * EXPERIMENTAL Additional linear for input features * EXPERIMENTAL Fix forward and add input layernorm * EXPERIMENTAL Normalize features at input * Add param for additional layer * Fix WRFeat dimension * Several checks/fixes * Remove dim check * Remove input normalization, add batch norm * Fix norm dim * Disable batch norm affine * Add max patches mode * Update pretrained_features.py * Add from_config and additional features from inference * Update train.py * Update model.py * Add random features * EXPERIMENTAL PCA preprocessor * Fix max frames in PCA * Add wrfeat demo * Revert "Add wrfeat demo" This reverts commit 84754ba. * Update wrfeat.py * Merge pull request weigertlab#27 from anwai98/patch-1 Remove verbosity argument in LRScheduler * API for augmented SAM2 features (#3) * WIP - API for augmented SAM2 features * Disable PCA for now * Fix for aug pretrained features * WIP CTCDataAugPretrainedFeats * Functional augmented CTCDataPretrained * Update train.py * Fix saving pre-trained models to pkl * Add sampling from disk + fix val being augmented * Fix h5 loading errors * Try to fix wrong CTCData class being used after caching * Fix incorrect dataset_kwargs use * Fix dataset kwargs issue * Update augs * Update pretrained_features.py * Test no aug * Update pretrained_features.py * Add online coords augmentation * Fix key error * Update pretrained_features.py * Fix key dtype mismatch * Fix empty frame handling * Fix cropping missing A * Update data.py * Fix aug data pipeline * Update data.py * Fix A cropping * Fix cropping return dtype * Update data.py * EXP Disable pos * Update model.py * Update model.py * Stronger augmentations * Update pretrained_features.py * Add use coords toggle in model * Augmentation fixes * Add use_coords config+arg * Replace WRAugContainer with WRFeat subclass * Update lint Github action * Remove numerize dep * Fix training for CPU debugging * Update models metadata and Readme * Update model URLs * Remove numpy 2 pin * Fix model loading API * Humanize dep * WIP Regionprops features w/ offline aug pretrained feats * use_coords=False uses time information * Fix addtional feats arg passing * Fix additional feats exp * Many args and training improvements/fixes * Fix get_features * New regionprops * Fixes for AugPretrainedFeats loading * TAP Features (#4) * WIP TAPFeatures * WIP TAP Features * Fixes for TAP feats * Functional TAP feats training * Add CellposeSAM * Fix save path recursion bug * Update pretrained_features.py * SAM2 high-res features * try object-level additional encoding * EXP Feature rotation based on aug coords * Fixes for feature rotation transform * Update pretrained_features.py * Feats dimensions and rotation fixes * Add new aug level * Debug model w/ encoded labels * Additional LayerNorm for features * Add RandomScale augment * Fix new LayerNorm dim * Update deps + remove cropping debug message * Updated pretrained features API (#6) * WIP Update model API for easier handling of pretrained feats * WIP Update pretrained_feats in model * WIP Fix dims * FInal fiexes for new API * Disable skip crop + fix issue in wrong pretrained feats shape loading * Fix mean patches * Debug aug feats * Fix wrong timepoint shift in aug pretrained feats * Remove expand_dims args * Update data.py * Update train.py * Disable intensity values aug for now * Update normalization for pretrained augs To re-introduce intensity values augmentations * Update pretrained_features.py * Fix feature mismatch btw val and train for mean patches * Update pretrained_features.py * Disable early stop + cfg + pt_reduced dim in cfg + fix disable xy coords + disable intens. augs * Update pretrained augs caching and augs API * Fix wrong arg name in model cfg * More dims for feat rotation + debug mean patches Also fix caching issues when running w/ and w/o regionprops between runs * Add CoTracker * Change default augs * Fix CoTracker input size change for augs * Fix prediction for latest pretrained feats API * EXP test norm feat * Prediction fixes * Update pretrained_features.py * Add explicit masks ref in agg_patches_exact * Fix misaligned timepoint use in mean_patches * Update feature normlaization * GaussianBLur + pretrained feats augmentations tweaks * Norm after mean * Revert previous; norm before mean * Update from main (#5) * Extend installation instructions * Add docstrings;cursorrules * Add docstrings;cursorrules * Add docstrings;cursorrules * merged * Fixes + __init__ for pretrained features * Add missing border_dist_fast --------- Co-authored-by: Benjamin Gallusser <[email protected]> Co-authored-by: Martin Weigert <[email protected]> * Improve problematic h5 group deletion * Clean up feature extractor * Remove deprecated PCA args + best config * Inference fixes * Add median aggregation * Use small epsilon in empty features agg to avoid NaNs downstream * Add support for dask arrays * Add several configs * Add ckpt file path option * h5 swmr + cfgs * Add missing median mode * Fix train args pt mode * WIP No h5 offline augmentation * Configs * Load aug pt feats from RAM by default * Add image shape record for aug feats * Fix mistake in image shape use in rotate feats * Fix GaussianBlur record * Add parallel aug computation * Disable norm + small tweaks * Fix h5 image shape loading * Improve feature extraction for parallel augs + fix Cotracker * Fix typo in precompute_image_embeddings * Handle missing objects * Setting for seed in Random features * Fix None seed error * Handle occasional missing labels due to augs * Update deepcell cfg * Zarr caching for augmentations (#7) * Change data caching to .zarr and improve parallelization * Make image percentile norm consistent + update missing label msg * Fix CellposeSAM norm step * Fix feature mismatch * Fix load from disk * Disable parallel aug for CoTracker and TAP for now * microSAM fixes + zarr install * Update pretrained_features.py --------- Co-authored-by: Benjamin Gallusser <[email protected]> Co-authored-by: Martin Weigert <[email protected]> --------- Co-authored-by: Benjamin Gallusser <[email protected]> Co-authored-by: Martin Weigert <[email protected]> --------- Co-authored-by: Benjamin Gallusser <[email protected]> Co-authored-by: Martin Weigert <[email protected]> --------- Co-authored-by: Martin Weigert <[email protected]> Co-authored-by: Benjamin Gallusser <[email protected]> Co-authored-by: Martin Weigert <[email protected]>

C-Achard · 2025-09-23T15:10:37Z

See #39

maweigert and others added 30 commits January 3, 2025 20:48

Add v2 dist mode

0c03d8c

add flash attn for v2

bf16f02

add warning

8f2cb5c

Fix small bug in augmentations

cb4ad2a

bump

52ccc2f

(WIP) Initial concept

316928f

(WIP) First functional feature extractor

afd0997

(WIP) Adding features to dataloader

d5553b0

Functional first prototype

8dad4ba

Note : currently too slow. Move feat. extractor to a single one for the class, maybe precompute embeddings before data loading/access is needed

Update setup.cfg

74ae4dd

Compute features before getting results

7f1ce84

Reduce duplicate code + speed up getitem

3309f1a

Add SAM + DinoV2

49d54c9

Move extra rescale step to Hiera only

f8bfa58

Add mean patch mode

83237f2

Fixes for mean patches mode (still too slow)

34dc60d

Try to speed up mean embeddings mode

bb81315

Add SAMv2 features

6d4978d

Try to speed up data loading

f19b075

- Improve nearest patch code - Keep embeddings in memory and in a single chunk to save on disk read operations

Refactor pre-trained embeddings computation and assignment

bfd0bd0

Cleanup + fixes for embedding assignment

391fb63

Fix SAM2 install + error in features shape

3db851b

Rework args for CTCData for pretrained feats

452769f

Small info tweak

0b1507c

Add SAM2.1-base-plus

7f49fe9

Play around with model input size

1a20a32

Quicker backbone switch for pretrained features

8501966

Add missing feat_dim update when switching models

804f75e

Add override with arbitrary predict func

5f1cc94

add from_pretrained

ad3e37d

C-Achard and others added 8 commits March 21, 2025 17:46

New WRPretrainedFeatures API

078794d

Update WRFeat augmentation pipeline code + refactor CTCData pretraine…

b61a824

…d config

Improve handling of existing but incompatible embeddings

9264d12

Move model input size setting earlier

4fcbc49

Fix error in pretrained_feats init

a729fbd

Update data.py

5bea8d3

Fix NaNs in affine_transform

7234e34

C-Achard marked this pull request as ready for review August 6, 2025 09:20

C-Achard changed the base branch from update-train-mw to main August 6, 2025 09:20

C-Achard marked this pull request as draft August 6, 2025 09:21

C-Achard closed this Sep 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pre-trained feature extraction#24

Pre-trained feature extraction#24
C-Achard wants to merge 38 commits intoweigertlab:mainfrom
C-Achard:cy/embed-extract

C-Achard commented Feb 24, 2025 •

edited

Loading

Uh oh!

C-Achard commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

C-Achard commented Feb 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Allows using pre-trained backbones for feature generation.

Uh oh!

C-Achard commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

C-Achard commented Feb 24, 2025 •

edited

Loading