<TODO materials from our IEEE article>
TODO
Opening the Black Box of wind forecasting
Grid Raven downscales wind to specific locations with the help of deep learning. In this post we’ll open the ‘black box’ to explore how the machine predicts the wind.
First, what does the model do?
The wind downscaling model has various ‘branches’ of input, including LIDAR data (high detailed geographical elevation maps, accurate to 1m), the Numerical Weather Prediction (NWP) for a grid of locations (the output of physics simulators, run many times a day on supercomputers) and some spatial and temporal metadata about a prediction. From this, the model is expected to predict, to the exact metre, the wind in a specific location - not a simple task by any means. You’d expect the model to analyse every metre of terrain with its trees, houses, roads; anything at all that could cause the wind to change direction even the slightest bit to achieve this, all along power lines crossing entire countries. But what is it really learning from this data? Can its results be trusted, and what can we do to help nudge the model in the right direction? To answer these questions, we can use various methods for interpreting the workings of these ‘black box’ models.
Which inputs are the most important for downscaling wind?
One method for exploring the workings of the model is perturbation analysis, where we ‘get rid’ of inputs, and see how much this affects the model’s prediction of windspeed in both the East and North directions across 20 different samples.
As seen in the above results, the input surface pressure and windspeed from the NWP are both key to the model’s prediction, consistently impacting the prediction when missing. Furthermore, the model seems to think that surface models and the slope of the terrain are the next most significant factors for windspeed, also good signs.
Terrain slope and surface pressure affect wind speed
To try and gain greater insights into how each feature is impacting the result, we can look at the perturbation results for specific samples. We can take the below example, taken from a Finnish weather station on top of a hill during a windy day.
For this example, the model was able to predict the wind direction with an accuracy of just 3 degrees! Let’s consider the following decision plot for the model output predicting windspeed to the East:
Like usual, the model is using surface pressure and windspeed as the main weather features for the prediction, as well as both the surface model and terrain slope. But here, a strange ‘zig zag’ shape appears.
The machine sequentially accounts with landscape features
If we start from the bottom of the diagram and work our way up, we can see that removing the first 7 features had no or negligible effect on the model prediction. The next 4 had little effect (each changing the prediction by just 0.05 m/s at max), and can be ignored. But from ‘Windspeed going north’ and further up, the impacts they had on the model decision become increasingly significant. If we consider just the inputs that tell the model about the topography, ‘surface model’ and ‘slope of the terrain’, we can see that they have a positive impact on the model prediction, moving the final prediction towards -0.25 m/s. The weather inputs on the other hand, seem to have a negative effect on the result, pushing the final prediction value towards -0.55, where it finishes its decision at.
This discourse between different types of input to the model is a good indication that the model is considering the features in a desirable way. It shows that the model is taking into account the interactions between weather and topography, even if one gets in the way of another, leading to more reliable and accurate predictions. However, this example also hints at some issues with the model, including the little consideration of the terrain model in an example where you’d expect the height and shape of the hill to be a significant factor, as well as the lack of consideration of both latitude and longitude, which are generally key indicators of the type of weather to be expected in a geography.
While this is just a single example, repeated indications like the ones described above can hint at larger issues or confirm positive improvements in the model’s performance. The theories taken away from these can help shape future model development targets and strategies. For instance, comparisons between the considerations of different models across a wide range of samples was a key factor for the focus on improving the model processing LIDAR data, leading to the continued development of masked auto encoders as an alternative pre-training method.
Wind prediction accuracy improved by 39%
A light wind doubles grid capacity
Out of all the weather parameters wind can have the biggest effect on the ability of high-voltage power lines to transport energy. An increase of 2 m/s at right angles nearly doubles the capacity of a power line to transmit energy (source).
Accurately predicting the wind is therefore critical in order to unlock the maximum potential of transmission and distribution assets. Unfortunately, existing weather forecasts are not accurate enough, especially in complex terrain, to allow utilities to take advantage of this data.
Machine learning for wind prediction
Grid Raven is tackling the wind prediction challenge with the help of machine learning. Our model takes as input the widely used numerical weather predictions (see for example https://weather.us/model-charts) and downscales them with the help of detailed landscape data.
We set out to benchmark the ability of our model to improve wind forecasts. To do this we obtained data from 37 official weather stations by Estonia’s Estonian Environment Agency. We trained our model on data from 2020 and 2021 then validated it against data from 2022. We then benchmarked our results against Met.no’s forecast for the same locations.
Improving accuracy by 39%
The mean absolute error for wind speed in Grid Raven’s forecast was 0.79 m/s, while the error in Met.no was 1.30 m/s. This is an improvement in accuracy of 39%.
To illustrate the case we chose data from the weather station in Ristna. It is known to meteorologists in Estonia that Ristna is a challenging weather station for wind forecasts because it is sheltered to northerly winds.
These above graph shows hourly wind speeds over three days in January 2022 at Ristna. The numerical weather prediction predicted strong northerly winds for 17th of January 2022. However, the actual measured wind speeds remained at around 6 m/s, which was correctly predicted by our model.
This is a significant improvement in accuracy of predicting wind. Accurate wind prediction is central to safely increasing the amount of power transmitted over the high-voltage network.
The next step is to demonstrate the accuracy of the model for predicting wind in locations that it has not encountered during training.