Skip to content

Commit 1a22a94

Browse files
Conchylicultorcopybara-github
authored andcommitted
Update TFDS to v4.1.0
* It is now possible to manually download the data for all datasets (if the automated download fail for any reason). See [doc](https://www.tensorflow.org/datasets/overview#load_a_dataset). * Simplification of the dataset creation API. * We've made it is easier to create datasets outside TFDS repository (see our updated [dataset creation guide](https://www.tensorflow.org/datasets/add_dataset)). * `_split_generators` should now returns `{'split_name': self._generate_examples(), ...}` (but current datasets are backward compatible). * All dataset inherit from `tfds.core.GeneratorBasedBuilder`. Converting a dataset to beam now only require changing `_generate_examples` (see [example and doc](https://www.tensorflow.org/datasets/beam_datasets#instructions)). * `tfds.core.SplitGenerator`, `tfds.core.BeamBasedBuilder` are deprecated and will be removed in future version. * Better `pathlib.Path`, `os.PathLike` compatibility: * `dl_manager.manual_dir` now returns a pathlib-Like object. Example: ```python text = (dl_manager.manual_dir / 'downloaded-text.txt').read_text() ``` * Note: Other `dl_manager.download`, `.extract`,... will return pathlib-like objects in future versions * `FeatureConnector`,... and most functions should accept `PathLike` objects. Let us know if some functions you need are missing. * Add a `tfds.core.as_path` to create pathlib.Path-like objects compatible with GCS (e.g. `tfds.core.as_path('gs://my-bucket/labels.csv').read_text()`). * Other bug fixes and improvement. E.g. * Add `verify_ssl=` option to `tfds.download.DownloadConfig` to disable SSH certificate during download. * `BuilderConfig` are now compatible with Beam datasets #2348 * `--record_checksums` now assume the new dataset-as-folder model * `tfds.features.Images` can accept encoded `bytes` images directly (useful when used with `img_name, img_bytes = dl_manager.iter_archive('images.zip')`). * Doc API now show deprecated methods, abstract methods to overwrite are now documented. * You can generate `imagenet2012` with only a single split (e.g. only the validation data). Other split will be skipped if not present. * And of course, new datasets... Thank you to all our contributors for improving TFDS! PiperOrigin-RevId: 340614460
1 parent 34fe11e commit 1a22a94

File tree

3 files changed

+40
-19
lines changed

3 files changed

+40
-19
lines changed

tensorflow_datasets/stable_versions.txt

Lines changed: 36 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -174,18 +174,27 @@ c4/en.noclean/2.2.0
174174
c4/en.noclean/2.2.1
175175
c4/en.noclean/2.3.0
176176
c4/en.noclean/2.3.1
177-
c4/en.realnewslike/2.2.0
178-
c4/en.realnewslike/2.2.1
179-
c4/en.realnewslike/2.3.0
180-
c4/en.realnewslike/2.3.1
181-
c4/en.webtextlike/2.2.0
182-
c4/en.webtextlike/2.2.1
183-
c4/en.webtextlike/2.3.0
184-
c4/en.webtextlike/2.3.1
177+
c4/en.noclean/3.0.1
185178
c4/en/2.2.0
186179
c4/en/2.2.1
187180
c4/en/2.3.0
188181
c4/en/2.3.1
182+
c4/en/3.0.1
183+
c4/multilingual/2.2.0
184+
c4/multilingual/2.2.1
185+
c4/multilingual/2.3.0
186+
c4/multilingual/2.3.1
187+
c4/multilingual/3.0.1
188+
c4/realnewslike/2.2.0
189+
c4/realnewslike/2.2.1
190+
c4/realnewslike/2.3.0
191+
c4/realnewslike/2.3.1
192+
c4/realnewslike/3.0.1
193+
c4/webtextlike/2.2.0
194+
c4/webtextlike/2.2.1
195+
c4/webtextlike/2.3.0
196+
c4/webtextlike/2.3.1
197+
c4/webtextlike/3.0.1
189198
caltech101/3.0.0
190199
caltech_birds2010/0.1.1
191200
caltech_birds2011/0.1.1
@@ -386,6 +395,7 @@ cifar10_corrupted/zoom_blur_3/1.0.0
386395
cifar10_corrupted/zoom_blur_4/1.0.0
387396
cifar10_corrupted/zoom_blur_5/1.0.0
388397
citrus_leaves/0.1.1
398+
cityscapes/semantic_segmentation/1.0.0
389399
civil_comments/CivilComments/1.0.0
390400
civil_comments/CivilCommentsIdentities/1.0.0
391401
clevr/3.0.0
@@ -434,9 +444,9 @@ cos_e/0.0.1
434444
cosmos_qa/1.0.0
435445
covid19sum/1.0.0
436446
crema_d/1.0.0
437-
curated_breast_imaging_ddsm/original-calc/2.0.1
438-
curated_breast_imaging_ddsm/original-mass/2.0.1
439-
curated_breast_imaging_ddsm/patches/2.0.1
447+
curated_breast_imaging_ddsm/original-calc/3.0.0
448+
curated_breast_imaging_ddsm/original-mass/3.0.0
449+
curated_breast_imaging_ddsm/patches/3.0.0
440450
cycle_gan/apple2orange/2.0.0
441451
cycle_gan/cezanne2photo/2.0.0
442452
cycle_gan/cityscapes/2.0.0
@@ -448,8 +458,7 @@ cycle_gan/monet2photo/2.0.0
448458
cycle_gan/summer2winter_yosemite/2.0.0
449459
cycle_gan/ukiyoe2photo/2.0.0
450460
cycle_gan/vangogh2photo/2.0.0
451-
deep_weeds/1.0.0
452-
deep_weeds/2.0.0
461+
deep_weeds/3.0.0
453462
definite_pronoun_resolution/1.1.0
454463
dementiabank/1.0.0
455464
diabetic_retinopathy_detection/1M/3.0.0
@@ -473,6 +482,7 @@ dsprites/2.0.0
473482
dsprites/2.1.0
474483
dtd/3.0.1
475484
duke_ultrasound/1.0.0
485+
e2e_cleaned/0.1.0
476486
emnist/balanced/3.0.0
477487
emnist/byclass/3.0.0
478488
emnist/bymerge/3.0.0
@@ -648,8 +658,8 @@ imagenet_resized/32x32/0.1.0
648658
imagenet_resized/64x64/0.1.0
649659
imagenet_resized/8x8/0.1.0
650660
imagenet_v2/matched-frequency/1.0.0
651-
imagenet_v2/threshold-0.7/0.1.1
652-
imagenet_v2/topimages/0.1.1
661+
imagenet_v2/threshold-0.7/1.0.0
662+
imagenet_v2/topimages/1.0.0
653663
imagenette/160px-v2/0.1.0
654664
imagenette/160px/0.1.0
655665
imagenette/320px-v2/0.1.0
@@ -668,6 +678,7 @@ iris/2.0.0
668678
kitti/3.1.0
669679
kitti/3.2.0
670680
kmnist/3.0.1
681+
lambada/1.0.0
671682
lfw/0.1.0
672683
librispeech/plain_text/1.1.0
673684
librispeech/subwords32k/1.1.0
@@ -751,6 +762,13 @@ math_dataset/polynomials__simplify_power/1.0.0
751762
math_dataset/probability__swr_p_level_set/1.0.0
752763
math_dataset/probability__swr_p_sequence/1.0.0
753764
mctaco/1.0.0
765+
mlqa/ar/1.0.0
766+
mlqa/de/1.0.0
767+
mlqa/en/1.0.0
768+
mlqa/es/1.0.0
769+
mlqa/hi/1.0.0
770+
mlqa/vi/1.0.0
771+
mlqa/zh/1.0.0
754772
mnist/3.0.1
755773
mnist_corrupted/brightness/1.0.0
756774
mnist_corrupted/canny_edges/1.0.0
@@ -861,6 +879,7 @@ paws_x_wiki/ko/1.0.0
861879
paws_x_wiki/zh/1.0.0
862880
pet_finder/1.0.0
863881
pg19/0.1.1
882+
piqa/1.0.0
864883
places365_small/2.0.0
865884
plant_leaves/0.1.0
866885
plant_village/1.0.2
@@ -948,6 +967,7 @@ starcraft_video/move_unit_to_border_64/1.0.0
948967
starcraft_video/road_trip_with_medivac_128/1.0.0
949968
starcraft_video/road_trip_with_medivac_64/1.0.0
950969
stl10/1.0.0
970+
story_cloze/1.0.0
951971
sun397/standard-part1-120k/4.0.0
952972
sun397/standard-part10-120k/4.0.0
953973
sun397/standard-part2-120k/4.0.0
@@ -1068,6 +1088,7 @@ wiki40b/uk/1.3.0
10681088
wiki40b/vi/1.3.0
10691089
wiki40b/zh-cn/1.3.0
10701090
wiki40b/zh-tw/1.3.0
1091+
wiki_bio/0.1.0
10711092
wikihow/all/1.2.0
10721093
wikihow/sep/1.2.0
10731094
wikipedia/20190301.aa/1.0.0

tensorflow_datasets/version.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -24,8 +24,8 @@
2424

2525
# We follow Semantic Versioning (https://semver.org/)
2626
_MAJOR_VERSION = '4'
27-
_MINOR_VERSION = '0'
28-
_PATCH_VERSION = '1'
27+
_MINOR_VERSION = '1'
28+
_PATCH_VERSION = '0'
2929

3030
__version__ = '.'.join([
3131
_MAJOR_VERSION,

tensorflow_datasets/version_stable.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -20,8 +20,8 @@
2020

2121
# We follow Semantic Versioning (https://semver.org/)
2222
_MAJOR_VERSION = '4'
23-
_MINOR_VERSION = '0'
24-
_PATCH_VERSION = '1'
23+
_MINOR_VERSION = '1'
24+
_PATCH_VERSION = '0'
2525

2626
__version__ = '.'.join([
2727
_MAJOR_VERSION,

0 commit comments

Comments
 (0)