Skip to content

Commit

Permalink
docs: replace "data set" with "dataset" everywhere
Browse files Browse the repository at this point in the history
  • Loading branch information
deepyaman committed Jun 21, 2024
1 parent a4919fb commit 5337593
Show file tree
Hide file tree
Showing 4 changed files with 8 additions and 8 deletions.
4 changes: 2 additions & 2 deletions docs/tutorial/pytorch.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -28,7 +28,7 @@ pip install 'ibis-framework[duckdb,examples]' ibis-ml skorch torch

## The New York City flight data

Let's use the [nycflights13 data](https://github.com/hadley/nycflights13) to predict whether a plane arrives more than 30 minutes late. This data set contains information on 325,819 flights departing near New York City in 2013. Let's start by loading the data and making a few changes to the variables:
Let's use the [nycflights13 data](https://github.com/hadley/nycflights13) to predict whether a plane arrives more than 30 minutes late. This dataset contains information on 325,819 flights departing near New York City in 2013. Let's start by loading the data and making a few changes to the variables:

```{python}
#| output: false
Expand Down Expand Up @@ -107,7 +107,7 @@ flight_data = (
flight_data
```

We can see that about 16% of the flights in this data set arrived more than 30 minutes late.
We can see that about 16% of the flights in this dataset arrived more than 30 minutes late.

```{python}
flight_data.arr_delay.value_counts().rename(n="arr_delay_count").mutate(
Expand Down
4 changes: 2 additions & 2 deletions docs/tutorial/scikit-learn.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -28,7 +28,7 @@ pip install 'ibis-framework[duckdb,examples]' ibis-ml scikit-learn

## The New York City flight data

Let's use the [nycflights13 data](https://github.com/hadley/nycflights13) to predict whether a plane arrives more than 30 minutes late. This data set contains information on 325,819 flights departing near New York City in 2013. Let's start by loading the data and making a few changes to the variables:
Let's use the [nycflights13 data](https://github.com/hadley/nycflights13) to predict whether a plane arrives more than 30 minutes late. This dataset contains information on 325,819 flights departing near New York City in 2013. Let's start by loading the data and making a few changes to the variables:

```{python}
#| output: false
Expand Down Expand Up @@ -106,7 +106,7 @@ flight_data = (
flight_data
```

We can see that about 16% of the flights in this data set arrived more than 30 minutes late.
We can see that about 16% of the flights in this dataset arrived more than 30 minutes late.

```{python}
flight_data.arr_delay.value_counts().rename(n="arr_delay_count").mutate(
Expand Down
4 changes: 2 additions & 2 deletions docs/tutorial/xgboost.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -28,7 +28,7 @@ pip install 'ibis-framework[duckdb,examples]' ibis-ml 'xgboost[scikit-learn]'

## The New York City flight data

Let's use the [nycflights13 data](https://github.com/hadley/nycflights13) to predict whether a plane arrives more than 30 minutes late. This data set contains information on 325,819 flights departing near New York City in 2013. Let's start by loading the data and making a few changes to the variables:
Let's use the [nycflights13 data](https://github.com/hadley/nycflights13) to predict whether a plane arrives more than 30 minutes late. This dataset contains information on 325,819 flights departing near New York City in 2013. Let's start by loading the data and making a few changes to the variables:

```{python}
#| output: false
Expand Down Expand Up @@ -106,7 +106,7 @@ flight_data = (
flight_data
```

We can see that about 16% of the flights in this data set arrived more than 30 minutes late.
We can see that about 16% of the flights in this dataset arrived more than 30 minutes late.

```{python}
flight_data.arr_delay.value_counts().rename(n="arr_delay_count").mutate(
Expand Down
4 changes: 2 additions & 2 deletions examples/Preprocess your data with recipes.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,7 @@
"source": [
"## The New York City flight data\n",
"\n",
"Let's use the [nycflights13 data](https://github.com/hadley/nycflights13) to predict whether a plane arrives more than 30 minutes late. This data set contains information on 325,819 flights departing near New York City in 2013. Let's start by loading the data and making a few changes to the variables:"
"Let's use the [nycflights13 data](https://github.com/hadley/nycflights13) to predict whether a plane arrives more than 30 minutes late. This dataset contains information on 325,819 flights departing near New York City in 2013. Let's start by loading the data and making a few changes to the variables:"
]
},
{
Expand Down Expand Up @@ -317,7 +317,7 @@
"id": "722b2213-3b84-4f03-9006-59bf72591613",
"metadata": {},
"source": [
"We can see that about 16% of the flights in this data set arrived more than 30 minutes late."
"We can see that about 16% of the flights in this dataset arrived more than 30 minutes late."
]
},
{
Expand Down

0 comments on commit 5337593

Please sign in to comment.