macpie.pandas.mark_duplicates_by_cols#
- macpie.pandas.mark_duplicates_by_cols(df: DataFrame, cols: List[str])#
Create a column in
df
calledget_option("column.system.duplicates")
which is a boolean Series denoting duplicate rows as identified bycols
.- Parameters:
- dfDataFrame
- colslist-like
Only consider these columns for identifiying duplicates