CleanData.converting_ascii
Standardises text case in input data.
CleanData.converting_ascii([ascii_exclusion_list ,])
Parameters
- ascii_exclusion_list: (list, optional)
- List of characters to not replace
- default:
self.options_convert_ascii_exclusion_list
Returns None.
Notes
- Updated dataframe can be found as
CleanData.clean_df
. - A copy of the cleaned data can be found in the folder
CleanData.train_data_path
, with a suffixCleanData.suffix_convert_ascii
.
Relevant Definitions Settings
- SUFFIX_CONVERT_ASCII: suffix to append to the end of the output filename of the input data. E.g. “
ASCII
” - OPTIONS_CONVERT_ASCII_EXCLUSION_LIST: list of characters to exclude from conversion. E.g. “
['€','$','Ò']
”.
Examples
See Example cleanData for detailed setup and outputs.