How are "feature_importances_" ordered in Scikit-learn's RandomForestRegressor

python scikit-learn random-forest

20,168

Solution 1

As mentioned in the comments, it looks like the order or feature importances is the order of the "x" input variable (which I've converted from Pandas to a Python native data structure). I use this code to generate a list of types that look like this: (feature_name, feature_importance).

zip(x.columns, clf.feature_importances_)

Solution 2

You may save the result in a pandas data frame as follows:

pandas.DataFrame({'col_name': clf.feature_importances_}, index=x.columns).sort_values(by='col_name', ascending=False)

By sorting it in a descending manner we get a hint to significant features.

20,168

Author by

Krishan Gupta

Updated on July 27, 2021

Comments

Krishan Gupta almost 3 years
If I run a model (called clf in this case), I get output that looks like this. How can I tie this to the feature inputs that were used to train the classifier?
```
>>> clf.feature_importances_

array([ 0.01621506,  0.18275428,  0.09963659,... ])
```

Recents

Why Is PNG file with Drop Shadow in Flutter Web App Grainy?

How to troubleshoot crashes detected by Google Play Store for Flutter app

Cupertino DateTime picker interfering with scroll behaviour

Why does awk -F work for most letters, but not for the letter "t"?

Flutter change focus color and icon color but not works

How to print and connect to printer using flutter desktop via usb?

Critical issues have been reported with the following SDK versions: com.google.android.gms:play-services-safetynet:17.0.0

Flutter Dart - get localized country name from country code

navigatorState is null when using pushNamed Navigation onGenerateRoutes of GetMaterialPage

Android Sdk manager not found- Flutter doctor error

Flutter Laravel Push Notification without using any third party like(firebase,onesignal..etc)

How to change the color of ElevatedButton when entering text in TextField

Related

'DecisionTreeClassifier' object has no attribute 'export_graphviz'

Scikit-learn - ValueError: Input contains NaN, infinity or a value too large for dtype('float32') with Random Forest

AttributeError: 'RandomForestClassifier' object has no attribute 'fit_transform'

RandomForestRegressor and feature_importances_ error

Random Forest hyperparameter tuning scikit-learn using GridSearchCV

scikit-learn: How to calculate root-mean-square error (RMSE) in percentage?

ValueError: Unable to coerce to Series, length must be 1: given n

How to compute precision,recall and f1 score of an imbalanced dataset for K fold cross validation?

What is the difference between cross_val_score with scoring='roc_auc' and roc_auc_score?

How to extract feature importances from an Sklearn pipeline