Hello, team #215

barneylogo · 2024-08-16T12:24:27Z

No description provided.

barneylogo · 2025-01-24T20:43:49Z

Hello, team
I'd like to get help for running nanotron
So actually, I am running script for my datasets.
I am using this config

...
 dataset_overwrite_cache: false
      dataset_processing_num_proc_per_process: 1
      hf_dataset_config_name: null
      hf_dataset_or_datasets:  barneylogo/refined_data
      hf_dataset_splits: train
      text_column_name: text
...

But in here, I want to add revision, too. like load_dataset of dataset

load_dataset("barneylogo/refined_data", split=['train], revision="aaffc6609084f420a551b50fcabaa73b84344470")

xrsrke · 2025-01-31T09:39:48Z

#275 (comment)

barneylogo closed this as completed Aug 16, 2024

barneylogo reopened this Jan 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hello, team #215

Hello, team #215

barneylogo commented Aug 16, 2024

barneylogo commented Jan 24, 2025 •

edited

Loading

xrsrke commented Jan 31, 2025

Hello, team #215

Hello, team #215

Comments

barneylogo commented Aug 16, 2024

barneylogo commented Jan 24, 2025 • edited Loading

xrsrke commented Jan 31, 2025

barneylogo commented Jan 24, 2025 •

edited

Loading