macpie.pandas.mark_duplicates_by_cols#

macpie.pandas.mark_duplicates_by_cols(df: DataFrame, cols: List[str])#

Create a column in df called get_option("column.system.duplicates") which is a boolean Series denoting duplicate rows as identified by cols.

Parameters:
dfDataFrame
colslist-like

Only consider these columns for identifiying duplicates