You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: doc/python/ml-pca.md
+9-9
Original file line number
Diff line number
Diff line change
@@ -6,9 +6,9 @@ jupyter:
6
6
extension: .md
7
7
format_name: markdown
8
8
format_version: '1.3'
9
-
jupytext_version: 1.14.1
9
+
jupytext_version: 1.16.1
10
10
kernelspec:
11
-
display_name: Python 3
11
+
display_name: Python 3 (ipykernel)
12
12
language: python
13
13
name: python3
14
14
language_info:
@@ -20,7 +20,7 @@ jupyter:
20
20
name: python
21
21
nbconvert_exporter: python
22
22
pygments_lexer: ipython3
23
-
version: 3.8.8
23
+
version: 3.10.11
24
24
plotly:
25
25
description: Visualize Principle Component Analysis (PCA) of your high-dimensional
26
26
data in Python with Plotly.
@@ -105,17 +105,17 @@ fig.show()
105
105
106
106
When you will have too many features to visualize, you might be interested in only visualizing the most relevant components. Those components often capture a majority of the [explained variance](https://en.wikipedia.org/wiki/Explained_variation), which is a good way to tell if those components are sufficient for modelling this dataset.
107
107
108
-
In the example below, our dataset contains 10 features, but we only select the first 4 components, since they explain over 99% of the total variance.
108
+
In the example below, our dataset contains 8 features, but we only select the first 2 components.
109
109
110
110
```python
111
111
import pandas as pd
112
112
import plotly.express as px
113
113
from sklearn.decomposition importPCA
114
-
from sklearn.datasets importload_boston
114
+
from sklearn.datasets importfetch_california_housing
0 commit comments