Precision-recall description improvement #18719

daniel-yj-yang · 2020-11-01T01:43:42Z

Describe the issue linked to the documentation

https://scikit-learn.org/stable/auto_examples/model_selection/plot_precision_recall.html

It says, "high precision relates to a low false positive rate" and some a few places it links these two together, e.g., "false positives, decreasing precision."

Suggest a potential alternative/fix

"Precision = 1 - false discovery rate" and "Specificity = 1 - false positive rate"

Thus, the term "false discovery rate" should be emphasized, and "false positive rate" should be deemphasized when talking about high precision.

NicolasHug · 2020-11-01T09:02:39Z

I'm not sure what's so wrong about the current version, but feel free to open a PR so we can make a more tangible review @daniel-yj-yang

kushwahvikram15 · 2020-12-29T19:18:46Z

@NicolasHug I also agree with you. I didn't found any issue with current version.

jnothman · 2020-12-29T22:25:27Z

I think you're technically right, @daniel-yj-yang, for those who have been trained to use terms like "false discovery rate". This is true of a lot of the medical community, but unfortunately not for much of the machine learning community. The problem here is that a technical term is inadvertently being used: an increase in false positives will indeed decrease precision, if the number of true positives remains constant; and indeed the count of "false positives" and of "false negatives" is all that differs between the formulas for P & R. The reason for a difference between FPR and FDR is that the denominator of FDR is dependent on the estimator, whereas the denominator of FPR is dependent only on the ground truth. It is an important difference, but one that might not be easily drawn out in the context of this example. In any case, an attempt to improve the wording that avoids misuse of jargon, would be helpful.

daniel-yj-yang added the Documentation label Nov 1, 2020

lucyleeow linked a pull request May 7, 2024 that will close this issue

DOC Improve plot_precision_recall #28967

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Precision-recall description improvement #18719

Precision-recall description improvement #18719

daniel-yj-yang commented Nov 1, 2020

NicolasHug commented Nov 1, 2020

kushwahvikram15 commented Dec 29, 2020

jnothman commented Dec 29, 2020

Precision-recall description improvement #18719

Precision-recall description improvement #18719

Comments

daniel-yj-yang commented Nov 1, 2020

Describe the issue linked to the documentation

Suggest a potential alternative/fix

NicolasHug commented Nov 1, 2020

kushwahvikram15 commented Dec 29, 2020

jnothman commented Dec 29, 2020