Dry-window models

Phase 3p · `v2026-06-14_150640_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (1 run)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080817_phase3p` `v2026-06-11_054510_phase3p`	8	0.001 —	0.000 —	0.000 —	—	—

Phase 3b · `v2026-06-14_145202` Δ -0.003 vs prev train

53-feature LightGBM per-(station, window). Trained 2026-06-14. Metric: Test Brier.

Lead	Blend	Best single	Δ vs best
+24h	0.086	has_dry_window_gfs (0.207)	-58.8%
+48h	0.109	has_dry_window_gfs (0.193)	-43.3%
+72h	0.096	has_dry_window_gfs (0.178)	-46.0%

Verify history (16 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-07_140739`	26	0.060 —	0.047 —	0.074 —	—	—
Mon 2026-06-15	`v2026-06-07_140739`	11	0.072 —	0.014 —	0.141 —	—	—
Fri 2026-06-12	`v2026-05-31_204435`	23	0.305 —	0.219 —	0.168 —	—	—
Thu 2026-06-11	`v2026-05-31_204435`	18	0.373 —	0.280 —	0.213 —	—	—
Mon 2026-06-08	`v2026-05-31_204435`	6	0.676 —	0.306 —	0.101 —	—	—
Thu 2026-06-04	`v2026-05-24_131915`	29	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-24_131915`	14	0.000 —	0.000 —	0.000 —	—	—
Thu 2026-05-28	`v2026-05-14_052009`	32	0.062 —	0.123 —	0.054 —	—	—
Mon 2026-05-25	`v2026-05-14_052009`	14	0.141 —	0.345 —	0.206 —	—	—
Thu 2026-05-21	`v2026-04-29_120905`	50	0.005 —	0.015 —	0.043 —	—	—
Mon 2026-05-18	`v2026-04-29_120905`	57	0.006 —	0.002 —	0.097 —	—	—
Thu 2026-05-14	`v2026-04-29_120905`	64	0.043 —	0.032 —	0.118 —	—	—
Mon 2026-05-11	`v2026-04-29_120905`	54	0.049 —	0.039 —	0.127 —	—	—
Thu 2026-05-07	`v2026-04-29_120905`	25	0.030 —	0.078 —	0.155 —	—	—
Tue 2026-05-05	`v2026-04-27_192657` `v2026-04-28_161801` `v2026-04-29_120905`	5	0.003 —	0.000 —	0.000 —	—	—
Sun 2026-05-03	`v2026-04-23_101107` `v2026-04-27_192657`	12	0.000 —	0.000 —	0.000 —	—	—

Phase 3p · `v2026-06-14_150640_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (7 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080817_phase3p` `v2026-06-11_054510_phase3p`	8	0.004 —	0.000 —	0.000 —	—	—
Mon 2026-06-15	`v2026-06-07_142135_phase3p`	11	0.042 —	0.036 —	0.012 —	—	—
Fri 2026-06-12	`v2026-05-31_205722_phase3p`	23	0.131 —	0.158 —	0.221 —	—	—
Thu 2026-06-11	`v2026-05-31_205722_phase3p`	18	0.159 —	0.200 —	0.275 —	—	—
Mon 2026-06-08	`v2026-05-31_205722_phase3p`	6	0.237 —	0.129 —	0.207 —	—	—
Thu 2026-06-04	`v2026-05-26_110451_phase3p`	18	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-26_110451_phase3p`	3	0.000 —	—	—	—	—

Phase 3b · `v2026-06-14_145318` Δ -0.003 vs prev train

53-feature LightGBM per-(station, window). Trained 2026-06-14. Metric: Test Brier.

Lead	Blend	Best single	Δ vs best
+24h	0.093	has_dry_window_mf (0.185)	-49.9%
+48h	0.106	has_dry_window_ecmwf (0.222)	-52.4%
+72h	0.119	has_dry_window_ecmwf (0.267)	-55.5%

Verify history (16 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-07_140851`	26	0.053 —	0.051 —	0.062 —	—	—
Mon 2026-06-15	`v2026-06-07_140851`	11	0.055 —	0.173 —	0.114 —	—	—
Fri 2026-06-12	`v2026-05-31_204541`	23	0.190 —	0.121 —	0.088 —	—	—
Thu 2026-06-11	`v2026-05-31_204541`	18	0.227 —	0.153 —	0.108 —	—	—
Mon 2026-06-08	`v2026-05-31_204541`	6	0.543 —	0.104 —	0.073 —	—	—
Thu 2026-06-04	`v2026-05-24_132009`	29	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-24_132009`	14	0.000 —	0.000 —	0.000 —	—	—
Thu 2026-05-28	`v2026-05-14_052407`	32	0.175 —	0.102 —	0.087 —	—	—
Mon 2026-05-25	`v2026-05-14_052407`	14	0.401 —	0.286 —	0.332 —	—	—
Thu 2026-05-21	`v2026-04-29_121003`	50	0.076 —	0.054 —	0.061 —	—	—
Mon 2026-05-18	`v2026-04-29_121003`	57	0.002 —	0.021 —	0.101 —	—	—
Thu 2026-05-14	`v2026-04-29_121003`	64	0.058 —	0.149 —	0.142 —	—	—
Mon 2026-05-11	`v2026-04-29_121003`	54	0.068 —	0.185 —	0.161 —	—	—
Thu 2026-05-07	`v2026-04-29_121003`	25	0.055 —	0.212 —	0.202 —	—	—
Tue 2026-05-05	`v2026-04-27_192749` `v2026-04-28_161822` `v2026-04-29_121003`	5	0.000 —	0.000 —	0.000 —	—	—
Sun 2026-05-03	`v2026-04-23_101150` `v2026-04-27_192749`	12	0.000 —	0.000 —	0.000 —	—	—

Phase 3p · `v2026-06-14_150640_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (7 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080817_phase3p` `v2026-06-11_054510_phase3p`	8	0.012 —	0.000 —	0.002 —	—	—
Mon 2026-06-15	`v2026-06-07_142135_phase3p`	11	0.098 —	0.085 —	0.033 —	—	—
Fri 2026-06-12	`v2026-05-31_205722_phase3p`	23	0.076 —	0.104 —	0.154 —	—	—
Thu 2026-06-11	`v2026-05-31_205722_phase3p`	18	0.087 —	0.125 —	0.184 —	—	—
Mon 2026-06-08	`v2026-05-31_205722_phase3p`	6	0.133 —	0.060 —	0.101 —	—	—
Thu 2026-06-04	`v2026-05-26_110451_phase3p`	18	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-26_110451_phase3p`	3	0.000 —	—	—	—	—

Phase 3p · `v2026-06-14_150640_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (1 run)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080817_phase3p` `v2026-06-11_054510_phase3p`	8	0.034 —	0.000 —	0.006 —	—	—

Phase 3b · `v2026-06-14_145432` Δ +0.013 vs prev train

53-feature LightGBM per-(station, window). Trained 2026-06-14. Metric: Test Brier.

Lead	Blend	Best single	Δ vs best
+24h	0.109	has_dry_window_mf (0.185)	-40.9%
+48h	0.111	has_dry_window_jma (0.193)	-42.6%
+72h	0.160	has_dry_window_jma (0.207)	-22.8%

Verify history (16 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-07_141001`	26	0.110 —	0.075 —	0.025 —	—	—
Mon 2026-06-15	`v2026-06-07_141001`	11	0.252 —	0.234 —	0.081 —	—	—
Fri 2026-06-12	`v2026-05-31_204645`	23	0.094 —	0.060 —	0.143 —	—	—
Thu 2026-06-11	`v2026-05-31_204645`	18	0.074 —	0.062 —	0.118 —	—	—
Mon 2026-06-08	`v2026-05-31_204645`	6	0.000 —	0.001 —	0.008 —	—	—
Thu 2026-06-04	`v2026-05-24_132101`	29	0.012 —	0.021 —	0.007 —	—	—
Mon 2026-06-01	`v2026-05-24_132101`	14	0.002 —	0.016 —	0.045 —	—	—
Thu 2026-05-28	`v2026-05-14_052612`	32	0.241 —	0.134 —	0.160 —	—	—
Mon 2026-05-25	`v2026-05-14_052612`	14	0.552 —	0.376 —	0.613 —	—	—
Thu 2026-05-21	`v2026-05-03_120517`	50	0.128 —	0.049 —	0.203 —	—	—
Mon 2026-05-18	`v2026-05-03_120517`	51	0.158 —	0.162 —	0.192 —	—	—
Thu 2026-05-14	`v2026-05-03_120517`	31	0.110 —	0.288 —	0.311 —	—	—
Mon 2026-05-11	`v2026-05-03_120517`	17	0.149 —	0.397 —	0.732 —	—	—
Thu 2026-05-07	`v2026-04-29_121058`	25	0.079 —	0.031 —	0.125 —	—	—
Tue 2026-05-05	`v2026-04-27_192839` `v2026-04-28_161844` `v2026-04-29_121058`	5	0.013 —	0.002 —	0.002 —	—	—
Sun 2026-05-03	`v2026-04-23_101214` `v2026-04-27_182428` `v2026-04-27_192839`	11	0.001 —	0.000 —	0.001 —	—	—

Phase 3p · `v2026-06-14_150640_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (7 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080817_phase3p` `v2026-06-11_054510_phase3p`	8	0.059 —	0.001 —	0.016 —	—	—
Mon 2026-06-15	`v2026-06-07_142135_phase3p`	11	0.270 —	0.303 —	0.368 —	—	—
Fri 2026-06-12	`v2026-05-31_205722_phase3p`	23	0.062 —	0.057 —	0.116 —	—	—
Thu 2026-06-11	`v2026-05-31_205722_phase3p`	18	0.053 —	0.044 —	0.113 —	—	—
Mon 2026-06-08	`v2026-05-31_205722_phase3p`	6	0.032 —	0.011 —	0.012 —	—	—
Thu 2026-06-04	`v2026-05-26_110451_phase3p`	18	0.001 —	0.001 —	0.002 —	—	—
Mon 2026-06-01	`v2026-05-26_110451_phase3p`	3	0.000 —	—	—	—	—

Phase 3p · `v2026-06-14_150642_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (1 run)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080819_phase3p` `v2026-06-11_054512_phase3p`	8	0.000 —	0.000 —	0.000 —	—	—

Phase 3b · `v2026-06-14_145544` Δ -0.005 vs prev train

53-feature LightGBM per-(station, window). Trained 2026-06-14. Metric: Test Brier.

Lead	Blend	Best single	Δ vs best
+24h	0.049	has_dry_window_icon (0.062)	-20.6%
+48h	0.044	has_dry_window_mf (0.085)	-48.0%
+72h	0.045	has_dry_window_gfs (0.092)	-51.2%

Verify history (13 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-07_141111`	26	0.004 —	0.005 —	0.003 —	—	—
Mon 2026-06-15	`v2026-06-07_141111`	11	0.000 —	0.006 —	0.001 —	—	—
Fri 2026-06-12	`v2026-05-31_204749`	28	0.027 —	0.016 —	0.023 —	—	—
Thu 2026-06-11	`v2026-05-31_204749`	24	0.031 —	0.020 —	0.030 —	—	—
Mon 2026-06-08	`v2026-05-31_204749`	11	0.017 —	0.005 —	0.088 —	—	—
Thu 2026-06-04	`v2026-05-24_132152`	29	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-24_132152`	14	0.000 —	0.000 —	0.000 —	—	—
Thu 2026-05-28	`v2026-05-14_052815`	32	0.002 —	0.008 —	0.002 —	—	—
Mon 2026-05-25	`v2026-05-14_052815`	14	0.003 —	0.022 —	0.006 —	—	—
Thu 2026-05-21	`v2026-05-03_233816`	50	0.000 —	0.000 —	0.003 —	—	—
Mon 2026-05-18	`v2026-05-03_233816`	51	0.000 —	0.001 —	0.004 —	—	—
Thu 2026-05-14	`v2026-05-03_233816`	24	0.001 —	0.001 —	0.010 —	—	—
Mon 2026-05-11	`v2026-05-03_233816`	10	0.001 —	0.004 —	—	—	—

Phase 3p · `v2026-06-14_150642_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (7 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080819_phase3p` `v2026-06-11_054512_phase3p`	8	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-15	`v2026-06-07_142137_phase3p`	11	0.002 —	0.002 —	0.001 —	—	—
Fri 2026-06-12	`v2026-05-31_205723_phase3p`	28	0.053 —	0.049 —	0.018 —	—	—
Thu 2026-06-11	`v2026-05-31_205723_phase3p`	24	0.061 —	0.062 —	0.023 —	—	—
Mon 2026-06-08	`v2026-05-31_205723_phase3p`	11	0.036 —	0.096 —	0.092 —	—	—
Thu 2026-06-04	`v2026-05-26_110452_phase3p`	18	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-26_110452_phase3p`	3	0.000 —	—	—	—	—

Phase 3b · `v2026-06-14_145655` Δ -0.005 vs prev train

53-feature LightGBM per-(station, window). Trained 2026-06-14. Metric: Test Brier.

Lead	Blend	Best single	Δ vs best
+24h	0.067	has_dry_window_mf (0.200)	-66.4%
+48h	0.063	has_dry_window_mf (0.169)	-63.0%
+72h	0.072	has_dry_window_gfs (0.200)	-64.2%

Verify history (13 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-07_141220`	26	0.026 —	0.018 —	0.022 —	—	—
Mon 2026-06-15	`v2026-06-07_141220`	11	0.018 —	0.021 —	0.028 —	—	—
Fri 2026-06-12	`v2026-05-31_204851`	28	0.114 —	0.056 —	0.085 —	—	—
Thu 2026-06-11	`v2026-05-31_204851`	24	0.133 —	0.071 —	0.108 —	—	—
Mon 2026-06-08	`v2026-05-31_204851`	11	0.096 —	0.066 —	0.342 —	—	—
Thu 2026-06-04	`v2026-05-24_132243`	29	0.001 —	0.001 —	0.002 —	—	—
Mon 2026-06-01	`v2026-05-24_132243`	14	0.001 —	0.001 —	0.004 —	—	—
Thu 2026-05-28	`v2026-05-14_053030`	32	0.001 —	0.009 —	0.003 —	—	—
Mon 2026-05-25	`v2026-05-14_053030`	14	0.003 —	0.024 —	0.009 —	—	—
Thu 2026-05-21	`v2026-05-03_233924`	50	0.004 —	0.003 —	0.013 —	—	—
Mon 2026-05-18	`v2026-05-03_233924`	51	0.002 —	0.002 —	0.014 —	—	—
Thu 2026-05-14	`v2026-05-03_233924`	24	0.003 —	0.004 —	0.031 —	—	—
Mon 2026-05-11	`v2026-05-03_233924`	10	0.005 —	0.010 —	—	—	—

Phase 3p · `v2026-06-14_150642_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (7 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080819_phase3p` `v2026-06-11_054512_phase3p`	8	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-15	`v2026-06-07_142137_phase3p`	11	0.010 —	0.007 —	0.003 —	—	—
Fri 2026-06-12	`v2026-05-31_205723_phase3p`	28	0.127 —	0.098 —	0.048 —	—	—
Thu 2026-06-11	`v2026-05-31_205723_phase3p`	24	0.148 —	0.122 —	0.060 —	—	—
Mon 2026-06-08	`v2026-05-31_205723_phase3p`	11	0.125 —	0.184 —	0.199 —	—	—
Thu 2026-06-04	`v2026-05-26_110452_phase3p`	18	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-26_110452_phase3p`	3	0.000 —	—	—	—	—

Phase 3p · `v2026-06-14_150642_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (1 run)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080819_phase3p` `v2026-06-11_054512_phase3p`	8	0.001 —	0.000 —	0.002 —	—	—

Phase 3b · `v2026-06-14_145807` Δ -0.009 vs prev train

53-feature LightGBM per-(station, window). Trained 2026-06-14. Metric: Test Brier.

Lead	Blend	Best single	Δ vs best
+24h	0.112	has_dry_window_gfs (0.231)	-51.5%
+48h	0.116	has_dry_window_gfs (0.262)	-55.8%
+72h	0.130	has_dry_window_icon (0.231)	-43.5%

Verify history (13 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-07_141329`	26	0.114 —	0.055 —	0.085 —	—	—
Mon 2026-06-15	`v2026-06-07_141329`	11	0.065 —	0.080 —	0.064 —	—	—
Fri 2026-06-12	`v2026-05-31_204955`	28	0.106 —	0.161 —	0.194 —	—	—
Thu 2026-06-11	`v2026-05-31_204955`	24	0.123 —	0.203 —	0.246 —	—	—
Mon 2026-06-08	`v2026-05-31_204955`	11	0.091 —	0.162 —	0.010 —	—	—
Thu 2026-06-04	`v2026-05-24_132334`	29	0.017 —	0.004 —	0.001 —	—	—
Mon 2026-06-01	`v2026-05-24_132334`	14	0.033 —	0.008 —	0.001 —	—	—
Thu 2026-05-28	`v2026-05-14_053806`	32	0.026 —	0.088 —	0.017 —	—	—
Mon 2026-05-25	`v2026-05-14_053806`	14	0.058 —	0.245 —	0.064 —	—	—
Thu 2026-05-21	`v2026-05-03_234027`	50	0.047 —	0.146 —	0.168 —	—	—
Mon 2026-05-18	`v2026-05-03_234027`	51	0.012 —	0.018 —	0.017 —	—	—
Thu 2026-05-14	`v2026-05-03_234027`	24	0.024 —	0.038 —	0.029 —	—	—
Mon 2026-05-11	`v2026-05-03_234027`	10	0.050 —	0.103 —	—	—	—

Phase 3p · `v2026-06-14_150642_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (7 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080819_phase3p` `v2026-06-11_054512_phase3p`	8	0.003 —	0.001 —	0.006 —	—	—
Mon 2026-06-15	`v2026-06-07_142137_phase3p`	11	0.066 —	0.053 —	0.028 —	—	—
Fri 2026-06-12	`v2026-05-31_205723_phase3p`	28	0.185 —	0.150 —	0.192 —	—	—
Thu 2026-06-11	`v2026-05-31_205723_phase3p`	24	0.213 —	0.182 —	0.236 —	—	—
Mon 2026-06-08	`v2026-05-31_205723_phase3p`	11	0.236 —	0.115 —	0.067 —	—	—
Thu 2026-06-04	`v2026-05-26_110452_phase3p`	18	0.000 —	0.001 —	0.001 —	—	—
Mon 2026-06-01	`v2026-05-26_110452_phase3p`	3	0.000 —	—	—	—	—

Phase 3p · `v2026-06-14_150643_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (1 run)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080820_phase3p` `v2026-06-11_054514_phase3p`	8	0.001 —	0.000 —	0.000 —	—	—

Phase 3b · `v2026-06-14_145920` Δ -0.006 vs prev train

53-feature LightGBM per-(station, window). Trained 2026-06-14. Metric: Test Brier.

Lead	Blend	Best single	Δ vs best
+24h	0.114	has_dry_window_icon (0.215)	-46.9%
+48h	0.137	has_dry_window_gfs (0.222)	-38.4%
+72h	0.129	has_dry_window_mf (0.230)	-43.9%

Verify history (15 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-07_141438`	26	0.016 —	0.115 —	0.063 —	—	—
Mon 2026-06-15	`v2026-06-07_141438`	11	0.028 —	0.108 —	0.055 —	—	—
Fri 2026-06-12	`v2026-05-31_205058`	23	0.130 —	0.223 —	0.177 —	—	—
Thu 2026-06-11	`v2026-05-31_205058`	18	0.158 —	0.283 —	0.221 —	—	—
Mon 2026-06-08	`v2026-05-31_205058`	6	0.385 —	0.305 —	0.201 —	—	—
Thu 2026-06-04	`v2026-05-24_132426`	29	0.000 —	0.002 —	0.003 —	—	—
Mon 2026-06-01	`v2026-05-24_132426`	14	0.000 —	0.002 —	0.004 —	—	—
Thu 2026-05-28	`v2026-05-14_054446`	32	0.400 —	0.233 —	0.188 —	—	—
Mon 2026-05-25	`v2026-05-14_054446`	14	0.284 —	0.273 —	0.124 —	—	—
Thu 2026-05-21	`v2026-04-29_121459`	50	0.081 —	0.156 —	0.153 —	—	—
Mon 2026-05-18	`v2026-04-29_121459`	57	0.002 —	0.006 —	0.048 —	—	—
Thu 2026-05-14	`v2026-04-29_121459`	64	0.018 —	0.032 —	0.048 —	—	—
Mon 2026-05-11	`v2026-04-29_121459`	54	0.021 —	0.038 —	0.057 —	—	—
Thu 2026-05-07	`v2026-04-29_121459`	25	0.005 —	0.070 —	0.061 —	—	—
Tue 2026-05-05	`v2026-04-29_121459`	5	0.000 —	—	—	—	—

Phase 3p · `v2026-06-14_150643_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (7 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080820_phase3p` `v2026-06-11_054514_phase3p`	8	0.004 —	0.000 —	0.000 —	—	—
Mon 2026-06-15	`v2026-06-07_142138_phase3p`	11	0.049 —	0.037 —	0.011 —	—	—
Fri 2026-06-12	`v2026-05-31_205725_phase3p`	23	0.124 —	0.152 —	0.206 —	—	—
Thu 2026-06-11	`v2026-05-31_205725_phase3p`	18	0.151 —	0.192 —	0.256 —	—	—
Mon 2026-06-08	`v2026-05-31_205725_phase3p`	6	0.226 —	0.120 —	0.216 —	—	—
Thu 2026-06-04	`v2026-05-26_110453_phase3p`	18	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-26_110453_phase3p`	3	0.000 —	—	—	—	—

Phase 3b · `v2026-06-14_150033` Δ -0.003 vs prev train

53-feature LightGBM per-(station, window). Trained 2026-06-14. Metric: Test Brier.

Lead	Blend	Best single	Δ vs best
+24h	0.134	has_dry_window_jma (0.156)	-14.1%
+48h	0.162	has_dry_window_ecmwf (0.230)	-29.3%
+72h	0.146	has_dry_window_aifs (0.267)	-45.1%

Verify history (15 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-07_141547`	26	0.046 —	0.090 —	0.030 —	—	—
Mon 2026-06-15	`v2026-06-07_141547`	11	0.073 —	0.312 —	0.034 —	—	—
Fri 2026-06-12	`v2026-05-31_205201`	23	0.085 —	0.080 —	0.115 —	—	—
Thu 2026-06-11	`v2026-05-31_205201`	18	0.101 —	0.100 —	0.140 —	—	—
Mon 2026-06-08	`v2026-05-31_205201`	6	0.171 —	0.080 —	0.052 —	—	—
Thu 2026-06-04	`v2026-05-24_132517`	29	0.001 —	0.001 —	0.001 —	—	—
Mon 2026-06-01	`v2026-05-24_132517`	14	0.001 —	0.001 —	0.001 —	—	—
Thu 2026-05-28	`v2026-05-14_055125`	32	0.432 —	0.246 —	0.241 —	—	—
Mon 2026-05-25	`v2026-05-14_055125`	14	0.369 —	0.311 —	0.291 —	—	—
Thu 2026-05-21	`v2026-04-29_121554`	50	0.234 —	0.175 —	0.174 —	—	—
Mon 2026-05-18	`v2026-04-29_121554`	57	0.254 —	0.178 —	0.259 —	—	—
Thu 2026-05-14	`v2026-04-29_121554`	64	0.152 —	0.150 —	0.225 —	—	—
Mon 2026-05-11	`v2026-04-29_121554`	54	0.176 —	0.186 —	0.287 —	—	—
Thu 2026-05-07	`v2026-04-29_121554`	25	0.090 —	0.097 —	0.044 —	—	—
Tue 2026-05-05	`v2026-04-29_121554`	5	0.001 —	—	—	—	—

Phase 3p · `v2026-06-14_150643_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (7 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080820_phase3p` `v2026-06-11_054514_phase3p`	8	0.013 —	0.000 —	0.002 —	—	—
Mon 2026-06-15	`v2026-06-07_142138_phase3p`	11	0.116 —	0.090 —	0.033 —	—	—
Fri 2026-06-12	`v2026-05-31_205725_phase3p`	23	0.069 —	0.098 —	0.143 —	—	—
Thu 2026-06-11	`v2026-05-31_205725_phase3p`	18	0.082 —	0.117 —	0.169 —	—	—
Mon 2026-06-08	`v2026-05-31_205725_phase3p`	6	0.123 —	0.054 —	0.107 —	—	—
Thu 2026-06-04	`v2026-05-26_110453_phase3p`	18	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-26_110453_phase3p`	3	0.000 —	—	—	—	—

Phase 3p · `v2026-06-14_150643_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (1 run)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080820_phase3p` `v2026-06-11_054514_phase3p`	8	0.521 —	0.954 —	0.838 —	—	—

Phase 3b · `v2026-06-14_150146` Δ +0.010 vs prev train

53-feature LightGBM per-(station, window). Trained 2026-06-14. Metric: Test Brier.

Lead	Blend	Best single	Δ vs best
+24h	0.164	has_dry_window_mf (0.267)	-38.6%
+48h	0.165	has_dry_window_gem (0.296)	-44.3%
+72h	0.199	has_dry_window_ecmwf (0.244)	-18.6%

Verify history (15 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-07_141656`	26	0.244 —	0.279 —	0.278 —	—	—
Mon 2026-06-15	`v2026-06-07_141656`	11	0.227 —	0.123 —	0.107 —	—	—
Fri 2026-06-12	`v2026-05-31_205304`	23	0.073 —	0.134 —	0.125 —	—	—
Thu 2026-06-11	`v2026-05-31_205304`	18	0.049 —	0.070 —	0.093 —	—	—
Mon 2026-06-08	`v2026-05-31_205304`	6	0.023 —	0.033 —	0.029 —	—	—
Thu 2026-06-04	`v2026-05-24_132609`	29	0.019 —	0.008 —	0.024 —	—	—
Mon 2026-06-01	`v2026-05-24_132609`	14	0.013 —	0.011 —	0.074 —	—	—
Thu 2026-05-28	`v2026-05-14_055801`	32	0.328 —	0.138 —	0.164 —	—	—
Mon 2026-05-25	`v2026-05-14_055801`	14	0.200 —	0.039 —	0.144 —	—	—
Thu 2026-05-21	`v2026-05-03_120600`	50	0.120 —	0.111 —	0.197 —	—	—
Mon 2026-05-18	`v2026-05-03_120600`	51	0.252 —	0.181 —	0.231 —	—	—
Thu 2026-05-14	`v2026-05-03_120600`	31	0.411 —	0.290 —	0.415 —	—	—
Mon 2026-05-11	`v2026-05-03_120600`	17	0.536 —	0.394 —	0.574 —	—	—
Thu 2026-05-07	`v2026-04-29_121648`	25	0.405 —	0.403 —	0.170 —	—	—
Tue 2026-05-05	`v2026-04-29_121648`	5	0.070 —	—	—	—	—

Phase 3p · `v2026-06-14_150643_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (7 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080820_phase3p` `v2026-06-11_054514_phase3p`	8	0.532 —	0.924 —	0.758 —	—	—
Mon 2026-06-15	`v2026-06-07_142138_phase3p`	11	0.217 —	0.241 —	0.379 —	—	—
Fri 2026-06-12	`v2026-05-31_205725_phase3p`	23	0.168 —	0.198 —	0.165 —	—	—
Thu 2026-06-11	`v2026-05-31_205725_phase3p`	18	0.085 —	0.131 —	0.099 —	—	—
Mon 2026-06-08	`v2026-05-31_205725_phase3p`	6	0.029 —	0.010 —	0.012 —	—	—
Thu 2026-06-04	`v2026-05-26_110453_phase3p`	18	0.001 —	0.001 —	0.002 —	—	—
Mon 2026-06-01	`v2026-05-26_110453_phase3p`	3	0.000 —	—	—	—	—

Phase 3p · `v2026-06-14_150645_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (1 run)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080822_phase3p` `v2026-06-11_054515_phase3p`	8	0.000 —	0.000 —	0.000 —	—	—

Phase 3b · `v2026-06-14_150259` Δ +0.006 vs prev train

53-feature LightGBM per-(station, window). Trained 2026-06-14. Metric: Test Brier.

Lead	Blend	Best single	Δ vs best
+24h	0.136	has_dry_window_jma (0.179)	-24.0%
+48h	0.154	has_dry_window_ecmwf (0.284)	-45.7%
+72h	0.120	has_dry_window_mf (0.179)	-33.0%

Verify history (8 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-07_141804`	24	0.045 —	0.098 —	0.044 —	—	—
Mon 2026-06-15	`v2026-06-07_141804`	9	0.068 —	0.144 —	0.035 —	—	—
Fri 2026-06-12	`v2026-05-31_205408`	28	0.207 —	0.222 —	0.162 —	—	—
Thu 2026-06-11	`v2026-05-31_205408`	24	0.241 —	0.281 —	0.203 —	—	—
Mon 2026-06-08	`v2026-05-31_205408`	11	0.305 —	0.292 —	0.122 —	—	—
Thu 2026-06-04	`v2026-05-26_110225`	18	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-26_110225`	3	0.000 —	—	—	—	—
Sun 2026-05-03	`v2026-04-23_101238` `v2026-04-27_192928`	11	0.000 —	0.000 —	0.000 —	—	—

Phase 3p · `v2026-06-14_150645_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (7 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080822_phase3p` `v2026-06-11_054515_phase3p`	8	0.002 —	0.000 —	0.000 —	—	—
Mon 2026-06-15	`v2026-06-07_142140_phase3p`	9	0.029 —	0.024 —	0.009 —	—	—
Fri 2026-06-12	`v2026-05-31_205726_phase3p`	28	0.169 —	0.179 —	0.252 —	—	—
Thu 2026-06-11	`v2026-05-31_205726_phase3p`	24	0.196 —	0.224 —	0.317 —	—	—
Mon 2026-06-08	`v2026-05-31_205726_phase3p`	11	0.229 —	0.149 —	0.251 —	—	—
Thu 2026-06-04	`v2026-05-26_110454_phase3p`	18	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-26_110454_phase3p`	3	0.000 —	—	—	—	—

Phase 3b · `v2026-06-14_150412` Δ -0.004 vs prev train

53-feature LightGBM per-(station, window). Trained 2026-06-14. Metric: Test Brier.

Lead	Blend	Best single	Δ vs best
+24h	0.130	has_dry_window_jma (0.157)	-17.1%
+48h	0.144	has_dry_window_ecmwf (0.254)	-43.4%
+72h	0.136	has_dry_window_aifs (0.269)	-49.2%

Verify history (8 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-07_141914`	24	0.038 —	0.086 —	0.031 —	—	—
Mon 2026-06-15	`v2026-06-07_141914`	9	0.076 —	0.305 —	0.027 —	—	—
Fri 2026-06-12	`v2026-05-31_205512`	28	0.217 —	0.100 —	0.128 —	—	—
Thu 2026-06-11	`v2026-05-31_205512`	24	0.253 —	0.126 —	0.157 —	—	—
Mon 2026-06-08	`v2026-05-31_205512`	11	0.454 —	0.100 —	0.143 —	—	—
Thu 2026-06-04	`v2026-05-26_110313`	18	0.001 —	0.000 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-26_110313`	3	0.000 —	—	—	—	—
Sun 2026-05-03	`v2026-04-23_101310` `v2026-04-27_193017`	11	0.000 —	0.000 —	0.001 —	—	—

Phase 3p · `v2026-06-14_150645_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (7 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080822_phase3p` `v2026-06-11_054515_phase3p`	8	0.008 —	0.000 —	0.002 —	—	—
Mon 2026-06-15	`v2026-06-07_142140_phase3p`	9	0.077 —	0.060 —	0.029 —	—	—
Fri 2026-06-12	`v2026-05-31_205726_phase3p`	28	0.124 —	0.133 —	0.174 —	—	—
Thu 2026-06-11	`v2026-05-31_205726_phase3p`	24	0.142 —	0.160 —	0.212 —	—	—
Mon 2026-06-08	`v2026-05-31_205726_phase3p`	11	0.206 —	0.123 —	0.127 —	—	—
Thu 2026-06-04	`v2026-05-26_110454_phase3p`	18	0.000 —	0.000 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-26_110454_phase3p`	3	0.000 —	—	—	—	—

Phase 3p · `v2026-06-14_150645_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (1 run)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080822_phase3p` `v2026-06-11_054515_phase3p`	8	0.025 —	0.000 —	0.008 —	—	—

Phase 3b · `v2026-06-14_150524` Δ -0.006 vs prev train

53-feature LightGBM per-(station, window). Trained 2026-06-14. Metric: Test Brier.

Lead	Blend	Best single	Δ vs best
+24h	0.114	has_dry_window_mf (0.201)	-43.6%
+48h	0.135	has_dry_window_mf (0.261)	-48.3%
+72h	0.132	has_dry_window_jma (0.194)	-31.9%

Verify history (8 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-07_142023`	24	0.056 —	0.055 —	0.021 —	—	—
Mon 2026-06-15	`v2026-06-07_142023`	9	0.137 —	0.192 —	0.109 —	—	—
Fri 2026-06-12	`v2026-05-31_205615`	28	0.078 —	0.056 —	0.098 —	—	—
Thu 2026-06-11	`v2026-05-31_205615`	24	0.078 —	0.065 —	0.107 —	—	—
Mon 2026-06-08	`v2026-05-31_205615`	11	0.069 —	0.074 —	0.031 —	—	—
Thu 2026-06-04	`v2026-05-26_110401`	18	0.033 —	0.007 —	0.000 —	—	—
Mon 2026-06-01	`v2026-05-26_110401`	3	0.084 —	—	—	—	—
Sun 2026-05-03	`v2026-04-23_101336` `v2026-04-27_193106`	11	0.007 —	0.001 —	0.002 —	—	—

Phase 3p · `v2026-06-14_150645_phase3p` Δ +0.000 vs prev train

Gaussian copula MC over Phase 3o's hourly P(wet) marginals. Single empirical Σ per station, fit on train-split observed daytime wet/dry binary sequences. Captures within-day wet/dry autocorrelation. Trained 2026-06-14. Metric: Test Brier.

Lead	Best single	Δ vs best
+24h	— (0.000 val)	—
+48h	— (0.000 val)	—
+72h	— (0.000 val)	—

Verify history (7 runs)

Mon + Thu rolling Brier. Per-lead cells turn red when the rolling metric breaches the lead-specific drift threshold; check the verify report for the per-cell breakdown. Version column names the trained model — a fresh champion takes ~5-9d to show.

Run (UTC)	Version	N	+24h	+48h	+72h	+96h	+120h
Thu 2026-06-18	`v2026-06-10_080822_phase3p` `v2026-06-11_054515_phase3p`	8	0.044 —	0.001 —	0.020 —	—	—
Mon 2026-06-15	`v2026-06-07_142140_phase3p`	9	0.271 —	0.316 —	0.391 —	—	—
Fri 2026-06-12	`v2026-05-31_205726_phase3p`	28	0.060 —	0.070 —	0.117 —	—	—
Thu 2026-06-11	`v2026-05-31_205726_phase3p`	24	0.057 —	0.057 —	0.118 —	—	—
Mon 2026-06-08	`v2026-05-31_205726_phase3p`	11	0.056 —	0.035 —	0.015 —	—	—
Thu 2026-06-04	`v2026-05-26_110454_phase3p`	18	0.001 —	0.002 —	0.002 —	—	—
Mon 2026-06-01	`v2026-05-26_110454_phase3p`	3	0.000 —	—	—	—	—

Dry window — Bellever Dartmoor — 2-hour

Verify history (1 run)

Dry window — Bellever Dartmoor — 3-hour

Verify history (16 runs)

Verify history (7 runs)

Dry window — Bellever Dartmoor — 4-hour

Verify history (16 runs)

Verify history (7 runs)

Dry window — Bellever Dartmoor — 5-hour

Verify history (1 run)

Dry window — Bellever Dartmoor — 6-hour

Verify history (16 runs)

Verify history (7 runs)

Dry window — Bovey Tracey — 2-hour

Verify history (1 run)

Dry window — Bovey Tracey — 3-hour

Verify history (13 runs)

Verify history (7 runs)

Dry window — Bovey Tracey — 4-hour

Verify history (13 runs)

Verify history (7 runs)

Dry window — Bovey Tracey — 5-hour

Verify history (1 run)

Dry window — Bovey Tracey — 6-hour

Verify history (13 runs)

Verify history (7 runs)

Dry window — Dartmoor Nr Hexworthy — 2-hour

Verify history (1 run)

Dry window — Dartmoor Nr Hexworthy — 3-hour

Verify history (15 runs)

Verify history (7 runs)

Dry window — Dartmoor Nr Hexworthy — 4-hour

Verify history (15 runs)

Verify history (7 runs)

Dry window — Dartmoor Nr Hexworthy — 5-hour

Verify history (1 run)

Dry window — Dartmoor Nr Hexworthy — 6-hour

Verify history (15 runs)

Verify history (7 runs)

Dry window — Princetown — 2-hour

Verify history (1 run)

Dry window — Princetown — 3-hour

Verify history (8 runs)

Verify history (7 runs)

Dry window — Princetown — 4-hour

Verify history (8 runs)

Verify history (7 runs)

Dry window — Princetown — 5-hour

Verify history (1 run)

Dry window — Princetown — 6-hour

Verify history (8 runs)

Verify history (7 runs)