Transformation Reference
This section contains reference information on the transformations available in Dataprep by Trifacta.
Tip
Use the values in the Title column as search strings in the Search panel to begin specifying these transformations.
Name | Title | Description |
---|---|---|
scaleminmax | Scale to min max | Scale a column to a specific min max range. See Prepare Data for Machine Processing. |
onehotencode | One hot encode | Create a column for each unique value indicating its presence or absence. See Prepare Data for Machine Processing. |
scalestandardize | Scale to mean | Scale a column to zero mean and unit variance. See Prepare Data for Machine Processing. |
bincolumn | Bin column | Bin values into ranges of equal or custom size. See Prepare Data for Machine Processing. |
changetype | Change column type | Changes the data type of a column [settype]. See Change Column Data Type. |
comment | Comment | Adds a comment to your recipe [comment]. See Add Comments to Your Recipe. |
conditions | Conditional column | Returns values based on conditions such as if-then-else or case statements. See Apply Conditional Transformations. |
convertpattern | Convert patterns | Finds one or more patterns or text literals and replaces them with specified pattern values. See Standardize Using Patterns. |
countmatches | Count matches | Counts the number of matches [countpattern]. See Compute Counts. |
countmatchesbetween | Count matches between delimiters | Counts the number of matches [countpattern]. See Compute Counts. |
deduplicate | Remove duplicate rows | Removes duplicate rows where values in every column are the same. See Deduplicate Data. |
derive | New formula | Creates a new column with the result of a formula. |
drop | Delete columns | Delete one or more columns. See Remove Data. |
extractbetweendelimiters | Extract between delimiters | Extracts text found between two patterns. See Extract Values. |
extractcustom | Extract text or pattern | Extracts text found between two patterns. Variant: Custom text or pattern. See Extract Values. |
extractfirstcharacters | Extract first | Extracts text according to its position. Variant: Extract the first n characters. See Extract Values. |
extractkv | Convert key/value to Object | Extracts key-value pairs into an Object [extractkv]. See Extract Values. |
extractlastcharacters | Extract last | Extracts key-value pairs into an Object [extractkv]. Variant: Extract the last n characters. See Extract Values. |
extractlist | Extract matches to Array | Extracts a list into an Array [extractlist]. See Extract Values. |
extractrangeofcharacters | Extract between positions | Extracts text according to its position. Variant: Extract the last n characters. See Extract Values. |
extractmismatched | Extract mismatched | Extracts a list into an Array [extractlist]. Variant: The data type to match against. See Extract Values. |
extractnumbers | Extract numbers | Extracts a list into an Array [extractlist]. Variant: Extract numbers from a text. See Extract Values. |
extractquerystrings | Extract query strings | Extracts a list into an Array [extractlist]. Variant: Extract fields from an URL query string. See Extract Values. |
filtercontains | Filter contains | Filter rows that satisfy a condition. Variant: Filter rows that contain a specified value or pattern. See Filter Data. |
filtercustom | Filter custom formula | Filter rows that satisfy a condition. Variant: Filter rows that satisfy an arbitrary formula. See Filter Data. |
filterendswith | Filter ends with | Filter rows that satisfy a condition. Variant: Filter rows that ends with a specified value or pattern. See Filter Data. |
filterexactly | Filter exact | Filter rows that satisfy a condition. Variant: Filter rows that match exactly a specified value. See Filter Data. |
filternot | Filter not equals | Filters rows that do not satisfy a condition. See Filter Data. |
filterfromtop | Filter from top | Filter rows by their position. Variant: Filter rows from the top. See Filter Data. |
filtergreaterthan | Filter greater than | Filter rows that satisfy a condition. Variant: Filter rows with values greater than (or equal to) a specified value. See Filter Data. |
filterinterval | Filter at interval | Filter rows by their position. Variant: . Variant: The size of the interval to filter rows at. See Filter Data. |
filterlessthan | Filter less than | Filter rows that satisfy a condition. Variant: Filter rows with values less than (or equal to) a specified value. See Filter Data. |
filtermissing | Filter missing | Filter rows that satisfy a condition. Variant: Filter rows with missing values. See Remove Data. |
filtermismatched | Filter mismatched | Filter rows that satisfy a condition. Variant: Filter rows with mismatched values. See Filter Data. |
filteroneof | Filter in | Filter rows that satisfy a condition. Variant: Filter rows that match any of the specified values. See Filter Data. |
filterrange | Filter range | Filter rows by their position. Variant: Filter rows within a range. See Filter Data. |
filterstartswith | Filter starts with | Filter rows that satisfy a condition. Variant: Filter rows that starts with a specified value or pattern. See Filter Data. |
flatten | Expand Array to rows | Converts each element in an Array into a new row. See Working with Arrays. |
groupby | Group by | Group data and perform aggregated calculations on it. See Create Aggregations. |
join | Join datasets | Adds additional columns from other data sources [join]. See Join Window. |
locktype | Lock type | Lock column to current type. |
lowercase | Lowercase text | Format text in columns. Variant: Convert text in column to lowercase. See Modify String Values. |
leftpad | Pad with leading | Format text in columns. Variant: Add the necessary number of characters to each value to make them of the same length. See Modify String Values. |
merge | Merge columns | Concatenates the values from two or more columns into a new column [merge]. See Add Two Columns. |
move | Move columns | Moves one or more columns before or after another column [move]. See Move Columns. |
nest | Nest columns | Converts columns into an Object or Array [nest]. See Working with Arrays. |
pivot | Pivot | Creates a new column for each unique value in a column [pivot]. See Pivot Data. |
prefix | Prefix text | Format text in columns. Variant: Specify a prefix to be added at the beginning of each selected column name. See Modify String Values. |
propercase | Propercase text | Format text in columns. Variant: Convert text in column to ProperCase. See Modify String Values. |
removesymbols | Remove symbols in text | Format text in columns. Variant: Remove all non-alphanumerical characters from the text. See Remove Data. |
removewhitespace | Remove whitespace in text | Format text in columns. Variant: Remove all whitespace found in the text. See Remove Data. |
removeaccents | Remove accents in text | Remove accent marks from text. See Modify String Values. |
rename | Rename columns | Renames one or more columns [rename]. See Rename Columns. |
renamepattern | Rename with pattern | Renames one or more columns [rename]. See Rename Columns. |
renameprefix | Rename with prefix | Renames one or more columns [rename]. See Rename Columns. |
renameheader | Rename with row(s | Renames one or more columns [rename]. See Rename Columns. |
renamesuffix | Rename with suffix | Renames one or more columns [rename]. See Rename Columns. |
renameupper | Rename to UPPERCASE | Renames one or more columns [rename]. See Rename Columns. |
renamelower | Rename to lowercase | Renames one or more columns [rename]. See Rename Columns. |
renamekeepleft | Rename from beginning | Renames one or more columns [rename]. See Rename Columns. |
renamekeepright | Rename from end | Renames one or more columns [rename]. See Rename Columns. |
renamesanitize | Rename by removing special characters | Renames one or more columns [rename]. See Rename Columns. |
replacecell | Replace cells | Renames one or more columns [rename]. See Rename Columns. |
replacepattern | Replace text or pattern | Replace text matching a pattern. See Replace Cell Values. |
replacebetweenpatterns | Replace between delimiters | Replace text between delimiters. Variant: Replace text between delimiters. See Replace Cell Values. |
replacebetweenpositions | Replace between positions | Replace text between delimiters. Variant: Replaces text based on position. See Replace Cell Values. |
replacemismatched | Replace mismatched | Replace mismatched values. See Replace Cell Values. |
replacemissing | Replace missing | Replace missing values. See Replace Cell Values. |
select | Select | Create a new table of columns Selectd from your current dataset. See Select. |
set | Edit with formula | Sets the values of one or more columns to the result of a formula [set]. |
sort | Sort rows | Sorts the rows based on the values in one or more columns. |
splitondelimiter | Split on text or pattern | Split by delimiter. Variant: Text or pattern. See Split Column. |
splitbetweendelimiters | Split between delimiters | Split by delimiter. Variant: Between two delimiters. See Split Column. |
splitmultipledelimiters | Split with multiple delimiters | Split by delimiter. Variant: By multiple delimiters. See Split Column. |
splitpositions | Split at positions | Split by character position. Variant: By positions. See Split Column. |
splitevery | Split at interval | Split by character position. Variant: At regular interval. See Split Column. |
splitbetweenpositions | Split between positions | Split by character position. Variant: Between two positions. See Split Column. |
splitrows | Split into rows | Splits raw data into rows [splitrows]. See Split Column. |
suffix | Suffix text | Format text in columns. Variant: Specify a suffix to be added to the end of each selected column name. See Modify String Values. |
trimwhitespace | Trim whitespace | Format text in columns. Variant: Remove all whitespaces found at the beginning and end of the text. See Modify String Values. |
trimquotes | Trim quotes | Format text in columns. Variant: Remove quotes found at the beginning and end of the text. See Modify String Values. |
udf | Invoke external function | Creates a new column with the result of an external function. Note This feature may not be available in all product editions. For more information on available features, see Compare Editions. Note This transformation requires additional configuration. |
uppercase | Uppercase text | Format text in columns. Variant: Convert text in column to UPPERCASE. See Modify String Values. |
dateformat | Date format | Change format for Datetime columns. See Format Dates. |
union | Union datasets | Adds additional rows from other data source [union]. See Union Page. |
standardize | Standardize column | Single-column standardization for standardizing column values. See Standardize Page. |
columnbyexample | Create column from examples | Create a new column by providing example values. See Create Column by Example. |
unlocktype | Unlock type | Unlock column type. |
unnest | Unnest elements | Extracts elements from an Object or Array into columns. See Working with Arrays. |
unpivot | Unpivot | Turns columns into rows. Produces a key column with unnested values. See Pivot Data. |
valuestocols | Convert values to columns | Creates a new column for each unique value in a column [valuestocols]. See Pivot Data. |
window | Window | Performs row-based calculations across multiple ordered rows [window]. See Window Functions. |
sourcerownumber | sourcerownumber | Generate a new column containing the row number for each row from the source, if available. See Source Metadata References. |
filepath | filepath | Generate a new column containing the path to the source file, if available. See Source Metadata References. |