Developed a production-grade real-time weather forecasting system that streams hourly meteorological data from OpenMeteo API for Hobart Station, Tasmania. The pipeline implements a complete MLOps workflow using PySpark for distributed ETL processing, Medallion Architecture for data quality governance, and multiple ML forecasting models including LSTM neural networks, ARIMA time series, and AutoML for comprehensive weather prediction capabilities.
The system processes over 24 weather parameters including temperature, humidity, wind patterns, atmospheric pressure, and precipitation data through a three-tier data architecture (Bronze-Silver-Gold), enabling robust feature engineering and model training for accurate short-term and medium-term weather forecasting with automated model selection and ensemble predictions.