Description of feature
I looked a bit into sparse encoding. One-hot encoding being the most important:
- scikit-learn's one-hot encoding supports a
sparse_output parameter that should return a CSR matrix.
- We're getting
original_values as numpy arrays when calling the function. May or may not be fine.
- Currently we default the
sparse_output parameter to False without checking the type of matrix.
- The
_update_encoded_data does not take sparse matrices into account
Description of feature
I looked a bit into sparse encoding. One-hot encoding being the most important:
sparse_outputparameter that should return a CSR matrix.original_valuesas numpy arrays when calling the function. May or may not be fine.sparse_outputparameter toFalsewithout checking the type of matrix._update_encoded_datadoes not take sparse matrices into account