Skip to content

differnt baseline scores for the same y #73

@LiuYYSS

Description

@LiuYYSS

Hi, if my understanding is correct, the calculation of the baseline score does not involve X. Therefore, for the same Y, even if different X are used, the baseline score should be the same. However, I recently came across a strange phenomenon when using pps.matrix(). As shown in the attached image, three different baseline scores appeared when calculating the baseline score for Y=5. I have uploaded a pickle dump of my pandas dataframe. I hope you can test it and tell me if you can reproduce this error.

image

dataframe.zip

import ppscore as pps
pps.matrix(dataset_df)
ppscore=1.30
pandas=1.5.3

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions