awswrangler.catalog.sanitize_dataframe_columns_names¶
- awswrangler.catalog.sanitize_dataframe_columns_names(df: pandas.core.frame.DataFrame) pandas.core.frame.DataFrame ¶
Normalize all columns names to be compatible with Amazon Athena.
https://docs.aws.amazon.com/athena/latest/ug/tables-databases-columns-names.html
Possible transformations: - Strip accents - Remove non alphanumeric characters - Convert CamelCase to snake_case
Note
After transformation, some column names might not be unique anymore. Example: the columns [“A”, “a”] will be sanitized to [“a”, “a”]
- Parameters
df (pandas.DataFrame) – Original Pandas DataFrame.
- Returns
Original Pandas DataFrame with columns names normalized.
- Return type
pandas.DataFrame
Examples
>>> import awswrangler as wr >>> df_normalized = wr.catalog.sanitize_dataframe_columns_names(df=pd.DataFrame({'A': [1, 2]}))