Skip to content

Commit 3bbb200

Browse files
authoredDec 27, 2022
Update readme.md
1 parent 01eef9b commit 3bbb200

File tree

1 file changed

+18
-1
lines changed
  • Semantic Data Versioning Benchmark (SDVB)

1 file changed

+18
-1
lines changed
 
Original file line numberDiff line numberDiff line change
@@ -1 +1,18 @@
1-
BLA BLA
1+
**Semantic Data Versioning Benchmark (SDVB)**
2+
| Topic (Name) | \# of Original Tuples | \# of Original Attributes | \# of Versions | \# of Version-pairs |
3+
|---------------------------------------------|----------------|----------------|----------|---------------|
4+
| Movies and TV shows (IMDB) | 1,000 | 6 | 72 | 29 |
5+
| NBA Players (NBA) | 11,700 | 9 | 68 | 27 |
6+
| Wines Reviews (WINE) | 129,971 | 6 | 72 | 29 |
7+
| Iris Flowers (IRIS) | 150 | 5 | 58 | 22 |
8+
| Titanic Passengers (TITANIC) | 891 | 6 | 72 | 29 |
9+
10+
- Each file contains a `problem_sets.csv` documenting the version pairs, their respective hold backs, and some documentation of the type of changes that were applied.
11+
12+
Sources for initial versions:
13+
14+
- IMDB: https://www.kaggle.com/datasets/harshitshankhdhar/imdb-dataset-of-top-1000-movies-and-tv-shows
15+
- NBA: https://www.kaggle.com/justinas/nba-players-data
16+
- WINE: https://www.kaggle.com/christopheiv/winemagdata130k
17+
- IRIS: https://www.kaggle.com/uciml/iris
18+
- TITANIC: https://www.kaggle.com/competitions/titanic

0 commit comments

Comments
 (0)
Please sign in to comment.