Most likely, different cells have different data formats associated with them. When bringing in XLS or XLSX input, be sure of the following:
From the
Input Data Tool options number 2 File Format, choose Microsoft Excel (*.xlsx). See Alteryx driver for Excel (.xlsx).
If using the other Microsoft Excel Legacy XLSX option,
this occurrence is due to a well-documented limitation of the Microsoft
Jet Engine which is what Alteryx uses to read Microsoft files. By
default, Microsoft reads through the first 8 rows of data to determine
the field type of a column. It then passes this information to Alteryx
and that is what we use to read the file.
The user has
the choice to work around this issue or fix it in their registry.
The workaround includes the following:
- From the
Input Data Tool configuration, First row
contains data. This will force all fields as V_String.
- Configure the Dynamic Rename Tool to "Take Field Names from First Row of Data."
- Use the Auto Field Tool to properly assign the best field type for each data column.
The fix for this involves changing a setting in your registry. These instructions are being provided as well as document from Microsoft, as Alteryx cannot be sure that changing this setting will not cause problems with other applications. For more information on this issue: http://support.microsoft.com/kb/189897.
To change the Registry setting:
- Go
to Start > Run and type "regedit"
- In
the registry go to HKey Local Machine > Software >
Microsoft > Jet > 4.0 >Engines > Excel
- Double
Click: TypeGuessRows
- Change
the value to 0 (zero)
For performance reasons,
setting the TypeGuessRows value to zero (0) is not recommended
if your Excel table is very large. When this value is set
to zero, Microsoft Excel will scan all records in your table
to determine the type of data in each column.