Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug Report: extreme values results mixed up while comparing profiling reports #1592

Open
3 tasks done
hzhu6666 opened this issue May 9, 2024 · 0 comments
Open
3 tasks done
Labels
bug 🐛 Something isn't working

Comments

@hzhu6666
Copy link

hzhu6666 commented May 9, 2024

Current Behaviour

2024-05-09_14-52-42
2024-05-09_14-53-11

Expected Behaviour

left side should always represent df1 using blue color; right side should be df2 using red. Frequency calculation needs to be calculated based on the correct total value in the second picture.

Data Description

generated in code below

Code that reproduces the bug

import ydata_profiling as yp
import pandas as pd

df1 = pd.DataFrame([1]*50+[0]*50,columns=['point'])
df2 = pd.DataFrame([1]*9+[0]*1,columns=['point'])

report1 = yp.ProfileReport(df1, minimal=True)
report2 = yp.ProfileReport(df2, minimal=True)
comparison_report = yp.compare([report1,report2],compute=True)
cname = 'test.html'
comparison_report.to_file(cname)

pandas-profiling version

v4.7.0

Dependencies

pandas==2.2.1

OS

No response

Checklist

  • There is not yet another bug report for this issue in the issue tracker
  • The problem is reproducible from this bug report. This guide can help to craft a minimal bug report.
  • The issue has not been resolved by the entries listed under Common Issues.
@fabclmnt fabclmnt added bug 🐛 Something isn't working and removed needs-triage labels May 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug 🐛 Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants