Skip to content

problems using featurewiz in a pipeline #129

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
sdaza opened this issue Mar 28, 2025 · 1 comment
Open

problems using featurewiz in a pipeline #129

sdaza opened this issue Mar 28, 2025 · 1 comment

Comments

@sdaza
Copy link

sdaza commented Mar 28, 2025

Hi, I'm trying to use featurewiz in a pipeline, but it doesn't seem to be working.

from featurewiz import FeatureWiz
from sklearn.pipeline import Pipeline
from sklearn.ensemble import RandomForestClassifier, RandomForestRegressor
pipeline = Pipeline([('fw', FeatureWiz()),('model', RandomForestClassifier())])

# %% 
pipeline.fit(X_train, y_train)

I got this error:

ValueError: setting an array element with a sequence. The requested array has an inhomogeneous shape after 2 dimensions. The detected shape was (2, 209334) + inhomogeneous part.

The training data is just two columns (numeric), and the target binary (0/1).

Thanks!

@AutoViML
Copy link
Owner

Hi @sdaza
Thanks for bringing this problem to my attention 👍

Can you upload a zip file with just 20 rows from your dataset so I can test the above and find out where this issue is happening?

Also can you try out the featurewiz-polars library below for your transformer since it is better suited for large dataset?
https://github.com/AutoViML/featurewiz_polars

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants