Spaces:

evgueni-p
/

fbmc-chronos2

Sleeping

Evgueni Poloukarov Claude commited on 26 days ago

Commit

069627f

1 Parent(s): ff9fbcf

feat: implement past-only covariate masking for volatility capture

Past-Only Masking Implementation (Chronos-2 v1.4.0):
- Include ALL 3,043 features in future_df with automatic masking
- Known-future: weather, generation, load (615 features)
- Past-only masked: CNEC outages, volatility, flows (~2,428 features)
- Leverages Chronos-2's mask-based attention mechanism

Batch Size Optimization:
- Increased from 32 to 128 for better temporal attention
- Enables longer-range dependencies within single batch
- Faster inference (fewer forward passes)

Architecture Rationale:
- Past-only features have NaN future values → Chronos-2 sets mask=0
- Model learns cross-feature correlations from historical context
- Attention mechanism uses dimensional structure even when masked
- Enables learning CNEC/volatility patterns without future knowledge

Expected Impact:
- Better hourly volatility capture (especially hours 15-21)
- Cross-feature learning from masked constraint indicators
- No accuracy degradation risk (worst case: features ignored)

Files Modified:
- src/forecasting/dynamic_forecast.py: Include all feature categories
- src/forecasting/chronos_inference.py: batch_size=128, update docs
- app.py: Update feature count documentation (615 → 3,043)

Co-Authored-By: Claude <[email protected]>

Files changed (3) hide show

app.py +4 -2
src/forecasting/chronos_inference.py +16 -13
src/forecasting/dynamic_forecast.py +25 -7

app.py CHANGED Viewed

@@ -140,10 +140,12 @@ with gr.Blocks(title="FBMC Chronos-2 Forecasting") as demo:
     generalizes directly to FBMC cross-border flows using historical patterns and future covariates.
     **Features**:
-    - 2,553 engineered features (weather, CNEC constraints, load forecasts, LTA)
     - 24-month historical context (Oct 2023 - Oct 2025)
     - Time-aware extraction (prevents data leakage)
-    - Probabilistic forecasts (10th/50th/90th percentiles)
     **Performance**:
     - Smoke test: ~30 seconds (1 border × 168 hours)

     generalizes directly to FBMC cross-border flows using historical patterns and future covariates.
     **Features**:
+    - 3,043 engineered features using past-only covariate masking
+    - Known-future: weather, generation, load forecasts (615 features)
+    - Past-only masked: CNEC outages, volatility, flows (~2,428 features)
     - 24-month historical context (Oct 2023 - Oct 2025)
     - Time-aware extraction (prevents data leakage)
+    - Probabilistic forecasts (9 quantiles: 1st/5th/10th/25th/50th/75th/90th/95th/99th)
     **Performance**:
     - Smoke test: ~30 seconds (1 border × 168 hours)

src/forecasting/chronos_inference.py CHANGED Viewed

@@ -1,9 +1,9 @@
 #!/usr/bin/env python3
 """
-Chronos-2 Inference Pipeline with Covariate Support
 Standalone inference script for HuggingFace Space deployment.
-Uses predict_df() API to enable multivariate forecasting with weather, generation, CNEC outages.
-FORCE REBUILD: v1.3.0 - Context reduced to 128h for memory
 """
 import os
@@ -29,8 +29,10 @@ from .feature_availability import FeatureAvailability
 class ChronosInferencePipeline:
     """
-    Production inference pipeline for Chronos-2 zero-shot forecasting WITH COVARIATES.
-    Uses predict_df() API to leverage all 615 collected features (weather, generation, outages, etc.)
     Designed for deployment as API endpoint on HuggingFace Spaces.
     """
@@ -170,10 +172,11 @@ class ChronosInferencePipeline:
         total_start = time.time()
-        # PER-BORDER INFERENCE WITH COVARIATES
-        # Using predict_df() API to leverage all 615 features (weather, generation, CNEC outages, etc.)
-        print(f"\n[COVARIATE FORECAST] Running inference for {len(forecast_borders)} borders with 615 features...")
-        print(f"  Features: weather per zone, generation per zone, CNEC outages, LTA, load forecasts")
         for i, border in enumerate(forecast_borders, 1):
             # Clear GPU cache BEFORE each border to prevent memory accumulation
@@ -194,7 +197,7 @@ class ChronosInferencePipeline:
                 )
                 print(f"    Context shape: {context_data.shape}, Future shape: {future_data.shape}", flush=True)
-                print(f"    Using {len(future_data.columns)-2} future covariates for multivariate forecast", flush=True)
                 # Run covariate-informed inference using DataFrame API
                 # Note: predict_df() returns quantiles directly
@@ -203,12 +206,12 @@ class ChronosInferencePipeline:
                 with torch.inference_mode():
                     forecasts_df = pipeline.predict_df(
                         context_data,  # Historical data with ALL features
-                        future_df=future_data,  # Future covariates (615 features)
                         prediction_length=prediction_hours,
                         id_column='border',
                         timestamp_column='timestamp',
                         target='target',
-                        batch_size=32,  # Reduce from default 256 to save GPU memory
                         quantile_levels=[0.01, 0.05, 0.10, 0.25, 0.50, 0.75, 0.90, 0.95, 0.99]  # 9 quantiles for volatility
                     )
@@ -267,7 +270,7 @@ class ChronosInferencePipeline:
                     'num_features': len(future_data.columns) - 2  # Exclude border and timestamp
                 }
-                print(f"    [OK] Complete in {inference_time:.1f}s (WITH {len(future_data.columns)-2} covariates)", flush=True)
             except Exception as e:
                 import traceback

 #!/usr/bin/env python3
 """
+Chronos-2 Inference Pipeline with Past-Only Covariate Masking
 Standalone inference script for HuggingFace Space deployment.
+Uses predict_df() API with ALL 3,043 features leveraging Chronos-2's mask-based attention.
+FORCE REBUILD: v1.4.0 - Past-only covariates + batch_size=128 for volatility capture
 """
 import os
 class ChronosInferencePipeline:
     """
+    Production inference pipeline for Chronos-2 zero-shot forecasting WITH PAST-ONLY MASKING.
+    Uses predict_df() API with ALL 3,043 features (known-future + past-only covariates).
+    Past-only covariates (CNEC, volatility, historical flows) are masked in future → model
+    learns cross-feature correlations from historical context via attention mechanism.
     Designed for deployment as API endpoint on HuggingFace Spaces.
     """
         total_start = time.time()
+        # PER-BORDER INFERENCE WITH PAST-ONLY COVARIATE MASKING
+        # Using predict_df() API with ALL 3,043 features (known-future + past-only masked)
+        print(f"\n[PAST-ONLY MASKING] Running inference for {len(forecast_borders)} borders with 3,043 features...")
+        print(f"  Known-future: weather, generation, load forecasts (615 features)")
+        print(f"  Past-only masked: CNEC outages, volatility, historical flows (~2,428 features)")
         for i, border in enumerate(forecast_borders, 1):
             # Clear GPU cache BEFORE each border to prevent memory accumulation
                 )
                 print(f"    Context shape: {context_data.shape}, Future shape: {future_data.shape}", flush=True)
+                print(f"    Using {len(future_data.columns)-2} features (known-future + past-only masked)", flush=True)
                 # Run covariate-informed inference using DataFrame API
                 # Note: predict_df() returns quantiles directly
                 with torch.inference_mode():
                     forecasts_df = pipeline.predict_df(
                         context_data,  # Historical data with ALL features
+                        future_df=future_data,  # All 3,043 features (past-only masked)
                         prediction_length=prediction_hours,
                         id_column='border',
                         timestamp_column='timestamp',
                         target='target',
+                        batch_size=128,  # Increased from 32 for better temporal attention + faster inference
                         quantile_levels=[0.01, 0.05, 0.10, 0.25, 0.50, 0.75, 0.90, 0.95, 0.99]  # 9 quantiles for volatility
                     )
                     'num_features': len(future_data.columns) - 2  # Exclude border and timestamp
                 }
+                print(f"    [OK] Complete in {inference_time:.1f}s ({len(future_data.columns)-2} features with past-only masking)", flush=True)
             except Exception as e:
                 import traceback

src/forecasting/dynamic_forecast.py CHANGED Viewed

@@ -9,7 +9,16 @@ Key Concepts:
 - run_date: When the forecast is made (e.g., "2025-09-30 23:00")
 - forecast_horizon: Always 14 days (D+1 to D+14, fixed at 336 hours)
 - context_window: Historical data before run_date (typically 512 hours)
-- future_covariates: Features available for forecasting (603 full + 12 partial)
 """
 from typing import Dict, Tuple, Optional
@@ -156,9 +165,16 @@ class DynamicForecast:
         """
         Extract future covariate data for D+1 to D+14.
-        Future covariates include:
-        - Full-horizon D+14: 603 features (always available)
-        - Partial D+1: 12 features (load forecasts, will be masked D+2-D+14)
         Args:
             run_date: Forecast run timestamp
@@ -179,10 +195,12 @@ class DynamicForecast:
             (pl.col('timestamp') <= forecast_end)
         )
-        # Select only future covariate features (603 full + 12 partial)
         future_features = (
-            self.categories['full_horizon_d14'] +
-            self.categories['partial_d1']
         )
         # Build future DataFrame

 - run_date: When the forecast is made (e.g., "2025-09-30 23:00")
 - forecast_horizon: Always 14 days (D+1 to D+14, fixed at 336 hours)
 - context_window: Historical data before run_date (typically 512 hours)
+- future_covariates: ALL 3,043 features (leveraging Chronos-2 past-only masking)
+  * 603 full-horizon (known future)
+  * 12 partial D+1 (masked D+2-D+14)
+  * ~2,428 historical-only (masked as past-only covariates)
+Chronos-2 Past-Only Covariate Masking:
+- Historical-only features have NaN future values → Chronos-2 sets mask=0
+- Model learns cross-feature correlations from historical context
+- Attention mechanism uses dimensional structure even when values masked
+- Enables learning of CNEC/volatility patterns without future knowledge
 """
 from typing import Dict, Tuple, Optional
         """
         Extract future covariate data for D+1 to D+14.
+        Future covariates include ALL 3,043 features using Chronos-2's past-only masking:
+        - Full-horizon D+14: 603 features (known future values)
+        - Partial D+1: 12 features (load forecasts, masked D+2-D+14)
+        - Historical-only: ~2,428 features (MASKED as past-only covariates)
+        Past-only covariates leverage Chronos-2's mask-based attention:
+        - Future values are NaN (unknown)
+        - Chronos-2 sets mask=0 for these dimensions
+        - Model learns cross-feature correlations from historical context
+        - Attention mechanism uses structure even when future values masked
         Args:
             run_date: Forecast run timestamp
             (pl.col('timestamp') <= forecast_end)
         )
+        # Include ALL features (3,043 total) to leverage past-only covariate masking
+        # Historical-only features will be NaN in future → Chronos-2 masks them automatically
         future_features = (
+            self.categories['full_horizon_d14'] +    # 603 known-future
+            self.categories['partial_d1'] +          # 12 partial
+            self.categories['historical_only']       # ~2,428 past-only (MASKED!)
         )
         # Build future DataFrame