plot_calibration¶
yohou.plotting.evaluation.plot_calibration(y_pred, y_truth, coverage_rates=None, *, columns=None, target=None, n_bins=10, groups=None, facet_by='member', facet_n_cols=2, color_palette=None, show_legend=True, title=None, x_label=None, y_label=None, width=None, height=None, line_width=2.0, line_opacity=1.0, reference_color='#1e293b', reference_width=3.0, reference_dash='dash')
¶
Plot calibration for interval or class-probability forecasts.
Automatically detects the prediction type from column names:
- Interval predictions: columns named
"{target}_upper_{rate}"/"{target}_lower_{rate}"are compared against nominal coverage_rates (empirical vs nominal coverage). - Class-probability predictions: columns containing
"_proba_"are binned and compared against empirical frequencies (reliability diagram).
A well-calibrated model has points close to the diagonal.
Parameters¶
| Name | Type | Description | Default |
|---|---|---|---|
y_pred
|
DataFrame
|
Predicted values. Either prediction intervals with
|
required |
y_truth
|
DataFrame
|
Ground truth values. |
required |
coverage_rates
|
list of float or None
|
Nominal coverage rates for interval calibration (e.g.,
|
None
|
columns
|
str | list[str] | None
|
Target column name(s) for interval calibration. When groups is set this acts as a member postfix filter. Ignored for class-probability predictions (use target instead). |
None
|
target
|
str or None
|
Target column name for class-probability calibration. Required when multiple targets are present in the probability columns. Ignored for interval calibration. |
None
|
n_bins
|
int
|
Number of bins for class-probability calibration. Ignored for interval calibration. |
10
|
groups
|
list[str] | None
|
Panel group prefixes for faceted subplots. When provided, each resolved panel column gets its own subplot. |
None
|
facet_by
|
Literal['group', 'member'] | None
|
Faceting axis for panel data. |
"member"
|
facet_n_cols
|
int
|
Number of columns in the facet grid when groups is used. |
2
|
color_palette
|
list[str] | None
|
Custom color palette as hex codes. If None, uses yohou palette. |
None
|
show_legend
|
bool
|
Whether to show the legend. |
True
|
title
|
str | None
|
Plot title. Defaults to |
None
|
x_label
|
str | None
|
X-axis label. |
None
|
y_label
|
str | None
|
Y-axis label. |
None
|
width
|
int | None
|
Plot width in pixels. |
None
|
height
|
int | None
|
Plot height in pixels. |
None
|
line_width
|
float
|
Width of the calibration line. |
2.0
|
line_opacity
|
float
|
Opacity of the calibration line. |
1.0
|
reference_color
|
str
|
Colour of the perfect-calibration reference line. |
"#1e293b"
|
reference_width
|
float
|
Width of the reference line. |
3.0
|
reference_dash
|
str
|
Dash style of the reference line. |
"dash"
|
Returns¶
| Type | Description |
|---|---|
Figure
|
Plotly figure object. |
Raises¶
| Type | Description |
|---|---|
ValueError
|
If interval columns are missing, coverage_rates is not provided for interval predictions, or class-probability columns are ambiguous. |
Examples¶
Interval calibration:
>>> # Create sample data
>>> n = 100
>>> y_truth = pl.DataFrame({"y": np.random.randn(n)})
>>> y_pred_int = pl.DataFrame({
... "y_upper_0.9": np.random.randn(n) + 1.65,
... "y_lower_0.9": np.random.randn(n) - 1.65,
... "y_upper_0.95": np.random.randn(n) + 1.96,
... "y_lower_0.95": np.random.randn(n) - 1.96,
... })
>>> # Plot calibration
>>> fig = plot_calibration(y_pred_int, y_truth, coverage_rates=[0.9, 0.95])
>>> len(fig.data)
2
Class-probability calibration (reliability diagram):
>>> from datetime import datetime
>>> y_pred_proba = pl.DataFrame({
... "time": [datetime(2020, 1, i) for i in range(1, 11)],
... "w_proba_sunny": [0.8, 0.7, 0.6, 0.5, 0.4, 0.3, 0.2, 0.1, 0.9, 0.85],
... "w_proba_rainy": [0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 0.1, 0.15],
... })
>>> y_truth_cat = pl.DataFrame({
... "time": [datetime(2020, 1, i) for i in range(1, 11)],
... "w": ["sunny", "sunny", "sunny", "rainy", "rainy", "rainy", "rainy", "rainy", "sunny", "sunny"],
... })
>>> fig = plot_calibration(y_pred_proba, y_truth_cat)
>>> isinstance(fig, go.Figure)
True
See Also¶
plot_forecast : Plot forecast with optional prediction intervals.
plot_residuals : Residual diagnostics with panel facets.
Source Code¶
Show/Hide source
719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 1032 1033 1034 1035 1036 1037 1038 1039 1040 | |
Tutorials¶
The following example notebooks use this component:
-
How to Score Class-Probability Forecasts
Evaluation-Search
Evaluate categorical forecasts with LogLoss, BrierScore, and Accuracy. Covers per-timestep scoring, aggregation modes, and reliability diagrams.
-
How to Evaluate Interval Forecasts
Evaluation-Search
Evaluate prediction intervals with EmpiricalCoverage, IntervalScore, MeanIntervalWidth, PinballLoss, and CalibrationError across coverage levels.
-
How to Forecast Class Probabilities
Forecasting-Models
Use ClassProbaReductionForecaster to produce calibrated probability forecasts and evaluate them with Brier score, log loss, and accuracy.
-
How to Use Distance-Based Similarity for Intervals
Forecasting-Models
Adaptive prediction intervals via similarity-weighted conformal prediction using DistanceSimilarity with configurable distance metrics and bandwidths.
-
Quickstart
Quickstart
Comprehensive end-to-end tour of yohou beyond the Getting Started tutorials, covering data loading, baseline forecasting, preprocessing pipelines, decomposition, cross-validation search, and interval prediction.
-
How to Visualize Forecast Evaluation Results
Visualization
Use plot_calibration, plot_score_per_step, and plot_forecast to diagnose forecast accuracy and interval calibration visually.
-
Forecast Visualization
Visualization
Visualise point forecasts from single and multiple models, decomposition pipeline components, and time weight decay functions with interactive Plotly.
-
How to Visualize Forecasts
Visualization
Plot point forecasts, compare multiple models, render prediction interval bands, inspect residual diagnostics, and check interval calibration.