Skip to main content Link Menu Expand (external link) Document Search Copy Copied

CleanData.converting_ascii

Standardises text case in input data.

CleanData.converting_ascii([ascii_exclusion_list ,])

Parameters

  • ascii_exclusion_list: (list, optional)
    • List of characters to not replace
    • default: self.options_convert_ascii_exclusion_list

Returns None.

Notes

  • Updated dataframe can be found as CleanData.clean_df.
  • A copy of the cleaned data can be found in the folder CleanData.train_data_path, with a suffix CleanData.suffix_convert_ascii.

Relevant Definitions Settings

  • SUFFIX_CONVERT_ASCII: suffix to append to the end of the output filename of the input data. E.g. “ASCII
  • OPTIONS_CONVERT_ASCII_EXCLUSION_LIST: list of characters to exclude from conversion. E.g. “['€','$','Ò']”.

Examples

See Example cleanData for detailed setup and outputs.


Copyright © 2023 BiomedDAR. Distributed by an MIT license.