Spaces:

evgueni-p
/

fbmc-chronos2

Sleeping

App Files Files Community

fbmc-chronos2 / doc /HF_SPACE_SETUP_GUIDE.md

Evgueni Poloukarov

feat: complete Day 3 zero-shot inference pipeline

74bde7a about 1 month ago

preview code

raw

history blame contribute delete

10.8 kB

	# HuggingFace Space Setup Guide - FBMC Chronos 2

	IMPORTANT: This is Day 3, Hour 1-4 of the implementation plan. Complete all steps before proceeding to inference pipeline development.

	---

	## Prerequisites

	- HuggingFace account: https://huggingface.co/join
	- HuggingFace write token: https://huggingface.co/settings/tokens
	- Git installed locally
	- Project files ready at: `C:\Users\evgue\projects\fbmc_chronos2`

	---

	## STEP 1: Create HuggingFace Dataset Repository (10 min)

	### 1.1 Create Dataset on HuggingFace Web UI

	1. Go to: https://huggingface.co/new-dataset
	2. Fill in:
	- Owner: YOUR_USERNAME
	- Dataset name: `fbmc-features-24month`
	- License: MIT
	- Visibility: Private (contains project data)
	3. Click "Create dataset"

	### 1.2 Upload Data to Dataset

	#### Option A: Using the upload script (Recommended)

	```bash
	# 1. Add your HF token to .env file
	echo "HF_TOKEN=hf_..." >> .env

	# 2. Edit the script to replace YOUR_USERNAME with your actual HF username
	# Edit: scripts/upload_to_hf_datasets.py
	# Replace all instances of "YOUR_USERNAME" with your HuggingFace username

	# 3. Install required packages
	.venv\Scripts\uv.exe pip install datasets huggingface-hub

	# 4. Run the upload script
	.venv\Scripts\python.exe scripts\upload_to_hf_datasets.py
	```

	The script will upload:
	- `features_unified_24month.parquet` (~25 MB)
	- `metadata.csv` (2,553 features)
	- `target_borders.txt` (38 target borders)

	#### Option B: Manual upload via web UI

	1. Go to: https://huggingface.co/datasets/YOUR_USERNAME/fbmc-features-24month
	2. Click "Files" tab → "Add file" → "Upload files"
	3. Upload:
	- `data/processed/features_unified_24month.parquet`
	- `data/processed/features_unified_metadata.csv` (rename to `metadata.csv`)
	- `data/processed/target_borders_list.txt` (rename to `target_borders.txt`)

	### 1.3 Verify Dataset Uploaded

	Visit: `https://huggingface.co/datasets/YOUR_USERNAME/fbmc-features-24month`

	You should see:
	- `features_unified_24month.parquet` (~25 MB)
	- `metadata.csv` (~200 KB)
	- `target_borders.txt` (~1 KB)

	---

	## STEP 2: Create HuggingFace Space (15 min)

	### 2.1 Create Space on HuggingFace Web UI

	1. Go to: https://huggingface.co/new-space
	2. Fill in:
	- Owner: YOUR_USERNAME
	- Space name: `fbmc-chronos2-forecast`
	- License: MIT
	- Select SDK: JupyterLab
	- Space hardware: Click "Advanced" → Select A10G GPU (24GB) ($30/month)
	- Visibility: Private (contains API keys)
	3. Click "Create Space"

	IMPORTANT: The Space will start building immediately. This takes ~10-15 minutes for first build.

	### 2.2 Configure Space Secrets

	While the Space is building:

	1. Go to Space → Settings → Variables and Secrets
	2. Add these secrets (click "New secret"):

	\| Name \| Value \| Description \|
	\|------\|-------\|-------------\|
	\| `HF_TOKEN` \| `hf_...` \| Your HuggingFace write token \|
	\| `ENTSOE_API_KEY` \| `your_key` \| ENTSO-E Transparency API key \|

	3. Click "Save"

	### 2.3 Wait for Initial Build

	- Monitor build logs: Space → Logs tab
	- Wait for message: "Your Space is up and running"
	- This can take 10-15 minutes for first build

	---

	## STEP 3: Clone Space Locally (5 min)

	### 3.1 Clone the Space Repository

	```bash
	# Navigate to projects directory
	cd C:\Users\evgue\projects

	# Clone the Space (replace YOUR_USERNAME)
	git clone https://huggingface.co/spaces/YOUR_USERNAME/fbmc-chronos2-forecast

	# Navigate into Space directory
	cd fbmc-chronos2-forecast
	```

	### 3.2 Copy Project Files to Space

	```bash
	# Copy source code
	cp -r ../fbmc_chronos2/src ./

	# Copy requirements (rename to requirements.txt)
	cp ../fbmc_chronos2/hf_space_requirements.txt ./requirements.txt

	# Copy .env.example (for documentation)
	cp ../fbmc_chronos2/.env.example ./

	# Create directories
	mkdir -p data/evaluation
	mkdir -p notebooks
	mkdir -p tests
	```

	### 3.3 Create Space README.md

	Create `README.md` in the Space directory with:

	```yaml
	---
	title: FBMC Chronos 2 Forecast
	emoji: ⚡
	colorFrom: blue
	colorTo: green
	sdk: jupyterlab
	sdk_version: "4.0.0"
	app_file: app.py
	pinned: false
	license: mit
	hardware: a10g-small
	---

	# FBMC Flow Forecasting - Zero-Shot Inference

	Amazon Chronos 2 for cross-border capacity forecasting.

	## Features
	- 2,553 features (615 future covariates)
	- 38 bidirectional border targets (19 physical borders)
	- 8,192-hour context window
	- Dynamic date-driven inference
	- A10G GPU acceleration

	## Quick Start

	### Launch JupyterLab
	1. Open this Space
	2. Wait for build to complete (~10-15 min first time)
	3. Click "Open in JupyterLab"

	### Run Inference
	See `notebooks/01_test_inference.ipynb` for examples.

	## Data Source
	- Dataset: [YOUR_USERNAME/fbmc-features-24month](https://huggingface.co/datasets/YOUR_USERNAME/fbmc-features-24month)
	- Size: 25 MB (17,544 hours × 2,553 features)
	- Period: Oct 2023 - Sept 2025

	## Model
	- Chronos 2 Large (710M parameters)
	- Pretrained: amazon/chronos-t5-large
	- Zero-shot: No fine-tuning in MVP

	## Cost
	- A10G GPU: $30/month
	- Storage: <1 GB (free tier)
	```

	### 3.4 Push Initial Files to Space

	```bash
	# Stage files
	git add README.md requirements.txt .env.example src/

	# Commit
	git commit -m "feat: initial Space setup with A10G GPU and source code"

	# Push to HuggingFace
	git push
	```

	IMPORTANT: After pushing, the Space will rebuild (~10-15 min). Monitor the build in the Logs tab.

	---

	## STEP 4: Test Space Environment (10 min)

	### 4.1 Wait for Build to Complete

	- Go to Space → Logs tab
	- Wait for: "Your Space is up and running"
	- If build fails, check requirements.txt for dependency conflicts

	### 4.2 Open JupyterLab

	1. Go to your Space: https://huggingface.co/spaces/YOUR_USERNAME/fbmc-chronos2-forecast
	2. Click "Open in JupyterLab" (top right)
	3. JupyterLab will open in new tab

	### 4.3 Create Test Notebook

	In JupyterLab, create `notebooks/00_test_setup.ipynb`:

	Cell 1: Test GPU
	```python
	import torch
	print(f"GPU available: {torch.cuda.is_available()}")
	print(f"GPU device: {torch.cuda.get_device_name(0) if torch.cuda.is_available() else 'None'}")
	print(f"GPU memory: {torch.cuda.get_device_properties(0).total_memory / 1e9:.2f} GB")
	```

	Expected output:
	```
	GPU available: True
	GPU device: NVIDIA A10G
	GPU memory: 22.73 GB
	```

	Cell 2: Load Dataset
	```python
	from datasets import load_dataset
	import polars as pl

	# Load unified features from HF Dataset
	dataset = load_dataset("YOUR_USERNAME/fbmc-features-24month", split="train")
	df = pl.from_pandas(dataset.to_pandas())

	print(f"Shape: {df.shape[0]:,} rows × {df.shape[1]:,} columns")
	print(f"Columns: {df.columns[:10]}")
	print(f"Date range: {df['timestamp'].min()} to {df['timestamp'].max()}")
	```

	Expected output:
	```
	Shape: 17,544 rows × 2,553 columns
	Columns: ['timestamp', 'cnec_t1_binding_10T-DE-FR-000068', ...]
	Date range: 2023-10-01 00:00:00 to 2025-09-30 23:00:00
	```

	Cell 3: Load Metadata
	```python
	import pandas as pd

	# Load metadata
	metadata = pd.read_csv(
	"hf://datasets/YOUR_USERNAME/fbmc-features-24month/metadata.csv"
	)

	# Check future covariates
	future_covs = metadata[metadata['is_future_covariate'] == 'true']['feature_name'].tolist()
	print(f"Future covariates: {len(future_covs)}")
	print(f"Historical features: {len(metadata) - len(future_covs)}")
	print(f"\nCategories: {metadata['category'].unique()}")
	```

	Expected output:
	```
	Future covariates: 615
	Historical features: 1,938

	Categories: ['CNEC_Tier1', 'CNEC_Tier2', 'Weather', 'LTA', 'Temporal', ...]
	```

	Cell 4: Test Chronos 2 Loading
	```python
	from chronos import ChronosPipeline

	# Load Chronos 2 Large (this will download ~3 GB on first run)
	print("Loading Chronos 2 Large...")
	pipeline = ChronosPipeline.from_pretrained(
	"amazon/chronos-t5-large",
	device_map="cuda",
	torch_dtype=torch.bfloat16
	)
	print("[OK] Chronos 2 loaded successfully")
	print(f"Model device: {pipeline.model.device}")
	```

	Expected output:
	```
	Loading Chronos 2 Large...
	[OK] Chronos 2 loaded successfully
	Model device: cuda:0
	```

	IMPORTANT: The first time you load Chronos 2, it will download ~3 GB. This takes 5-10 minutes. Subsequent runs will use cached model.

	### 4.4 Run All Cells

	- Execute all cells in order
	- Verify all outputs match expected results
	- If any cell fails, check error messages and troubleshoot

	---

	## STEP 5: Commit Test Notebook to Space

	```bash
	# In JupyterLab terminal or locally
	git add notebooks/00_test_setup.ipynb
	git commit -m "test: verify GPU, data loading, and Chronos 2 model"
	git push
	```

	---

	## Troubleshooting

	### Build Fails

	Error: `Collecting chronos-forecasting>=2.0.0: Could not find a version...`
	- Fix: Check chronos-forecasting version exists on PyPI
	- Try: `chronos-forecasting==2.0.0` (pin exact version)

	Error: `torch 2.0.0 conflicts with transformers...`
	- Fix: Pin compatible versions in requirements.txt
	- Try: `torch==2.1.0` and `transformers==4.36.0`

	### GPU Not Detected

	Issue: `GPU available: False`
	- Check: Space Settings → Hardware → Should show "A10G"
	- Fix: Restart Space (Settings → Restart Space)

	### Dataset Not Found

	Error: `Repository Not Found for url: https://huggingface.co/datasets/...`
	- Check: Dataset name matches in code
	- Fix: Replace `YOUR_USERNAME` with actual HuggingFace username
	- Verify: Dataset is public or HF_TOKEN is set in Space secrets

	### Out of Memory

	Error: `CUDA out of memory`
	- Cause: A10G has 24 GB VRAM, may not be enough for 8,192 context + large batch
	- Fix: Reduce context window to 512 hours temporarily
	- Fix: Process borders in smaller batches (10 at a time)

	---

	## Next Steps (Day 3, Hours 5-8)

	Once the test notebook runs successfully:

	1. Hour 5-6: Create `src/inference/data_fetcher.py` (AsOfDateFetcher class)
	2. Hour 7-8: Create `src/inference/chronos_pipeline.py` (ChronosForecaster class)
	3. Smoke test: Run inference on 1 border × 7 days

	See main implementation plan for details.

	---

	## Success Criteria

	At end of STEP 5, you should have:

	- [x] HF Dataset repository created and populated (3 files)
	- [x] HF Space created with A10G GPU ($30/month)
	- [x] Space secrets configured (HF_TOKEN, ENTSOE_API_KEY)
	- [x] Source code pushed to Space
	- [x] Space builds successfully (~10-15 min)
	- [x] JupyterLab accessible
	- [x] GPU detected (NVIDIA A10G, 22.73 GB)
	- [x] Dataset loads (17,544 × 2,553)
	- [x] Metadata loads (2,553 features, 615 future covariates)
	- [x] Chronos 2 loads successfully (~3 GB download first time)
	- [x] Test notebook committed to Space

	Estimated time: ~40 minutes active work + ~25 minutes waiting for builds

	---

	Questions? Check HuggingFace Spaces documentation: https://huggingface.co/docs/hub/spaces