Feature Selection Transformer
Feature selection: Selecting features is a type of transformer.
Before using the tool
Start with an existing workflow. You should first clean and prep your dataset. Once your dataset contains only the relevant data you need for your business use case, then start building a pipeline using the Machine Learning tools.
Add the tool
- Click the Transformation tool in the Machine Learning tool palette. Drag it to the workflow canvas, and connect it to your workflow.A start pipeline tool is required for the transformation to function. Your workflow should contain a start pipeline tool such as the Start Pipeline tool or the Assisted Modeling tool prior to starting a data transformation.
- In Transformer, select the transformation type you want to configure.
- Configure the tool.
Configure the tool
Configure the parameters. Understand the parameters before changing them. For best practices, avoid making assumptions, and use a test dataset to assess the performance of your model whether your objective is prediction or not.
Select the features you want to use as predictors. This transformation provides an opportunity to double-check the features in your dataset, make any changes you want, and take action to improve model performance. See Machine Learning Tools Glossary.
You should evaluate all the possible features in your dataset and decide which are most important in your model, that is, those that will most improve model performance. The most important features will be those that result in the most accurate model. There are several factors to consider.
1. Select the columns you want to use as features
2. Clear the check box for any features you don't want to use
Clear the check box for any features you don't want to use as predictor variables.
Run the workflow to apply the configuration.
Machine Learning Tools
Definitions for Machine Learning Tools
Steps in Assisted Modeling
Select Target and Machine-Learning Method
Other Machine Learning Tools
One Hot Encoding Machine Learning Tool
Fit Tool Machine Learning Tool