Smooth Forecast Help
SmoothForecast.com provides free time series forecasting capability on the web. That is you can enter a sequence of numbers representing anything from daily rainfall amounts for the last 5 years to monthly food expenditures for the last 36 months and forecast the upcoming values based on that history. You can specify the number of forecasts that are produced. You can also specify whether the history seems to have a trend or seasonal cycle (i.e. the history goes up and down in regular intervals). Also if the history has values that seem too large or too small (called “spikes”), you can have them filtered out by using the spike smoothing feature. And finally, to get a sense of how good the forecast is, you can specify what is called a “hold back”. This means you can “hold back” a specified number of values from the end of the history and produce a forecast based upon the history prior to the held back values. The forecast will then be compared to the held back history and metrics produced that can be used to assess the forecast accuracy. The forecast will also be listed side-by-side with the held back history.
All the values are graphed so you can compare the history with the held back forecast or with the actual forecast. By visually comparing held back forecasts with the actual forecasts, you can get a sense of how good the forecast is based on how well it performed in the past. If spike filtering is enabled that line plot will be layered on top of the history so you can see what values were filtered and how much.
As an example consider the following 24 months of a food budget. The first value is the oldest and the last the most recent. Here are the values.
Here is a snapshot of SmoothForecast.com using this historical data. Note that spike smoothing and a damped trend has been selected. The desire was to forecast the food expenditure a year ahead (i.e. 12 months) and see how the forecast would have looked if it was done 12 months ago (i.e. Number to Hold Back is 12). Looking at the graph one can see the forecast of the hold back line does match the forecast line fairly well giving us confidence this is a reasonable forecast and the general trend for the food budget is to increase in cost for the next 12 months. Note the MdAPE (Median Absolute Percentage Error) value for the hold back is about 10.4%. This means of the 12 hold back values the median error was a little less than 10.5%. Plus the forecast was better than just copying forward the previous value (i.e. MASE (Mean Absolute Scaled Error) is less than 1).
As another example consider the following nearly 15 years of a monthly electric bill. The first value is the oldest and the last the most recent. Here are the values.
Below is a snapshot of SmoothForecast.com using this
historical data. This series has a cycle but we are not certain what it is. So
the Automatic Seasonal Cycle radio button is selected allowing the system to
fill in the Seasonal Cycle text box when the Process Series button is selected.
Note the system determines the cycle to be 12 months (i.e. yearly cycle). 12
forecasts were requested and 12 held back for comparison purposes. Note the
Smooth Spikes checkbox is not selected in this example since it actually makes
the forecast worse. The way to see this is to unselect Smooth Spikes and compare with the Holdback
Results before and after selecting Smooth Spikes. The Series Trend is set to
Linear since there is a gentle slope upwards in the data. The way to see which
Series Trend works the best is to set the Number to Hold Back to a value large
enough to be a good sample of data and try the different Series Trend
selections looking at the Holdback Results. For seasonal data a good value is
some multiple of the Cycle. In this example a multiple of 1 was used since 12
values is a good enough recent sample. Note the MdAPE (Median Absolute
Percentage Error) is a little more than 11%. Also the model MASE (Mean Absolute Scaled Error)
is less than 1.
Form Field Descriptions
There are several form fields that need to be specified to use SmoothForecast.com to generate forecasts. Below is a description of each and whether the field is required or optional. Once these fields are completed, the Process Series button must be pressed to generate results. The accesskey 's' may be used as a keyboard shortcut. For example, if using the Chrome, Firefox, or Safari browser on MacOS, you can press control + option + s to press the Process Series button. On Windows, using Firefox, you can press alt + shift + s. If using Chrome, Edge, or Internet Explorer on Windows you can press alt + s.
· Series (required)
This field is where the historical data is entered. One numerical value is entered per line starting with the oldest value. Values can be pasted into the field and edited before processing or after processing for subsequent processing runs (i.e. when the Process Series button is pressed). The series is preserved in the field from one processing run to another. Also the data can be loaded from a file by selecting the Browse button on the Load Series File field. The series loaded from a file will be loaded into the Series field when the Process Series button is pressed.
· Load Series File (optional)
This is field is used to load a numerical series from a file selected using the Browse button. Any file loaded must be a text file and have one numerical value per line starting with the oldest value. The values will be loaded from the file and placed into the Series field when the Process Series button is selected.
· Smooth Spikes (optional)
This checkbox, if selected, causes values considered outside the series moving average to be clipped back to the moving average value. The default is to have the checkbox selected.
· Seasonal Cycle (optional)
These radio buttons, if one is selected other than No Seasonal Cycle, cause the forecast to be based on a regularly occurring cycle in the series (e.g. 12 months in a year, 7 days in a week, etc) depending on what the series values represent (e.g. a month value, a day of the week value, etc). If the Automatic Seasonal Cycle radio button is selected, the Cycle value will be filled in automatically when the Process Series button is selected. If there is no detectable seasonal cycle, the Cycle value will be blank. If the Specify Seasonal Cycle radio button is selected, the Cycle value must also be specified indicating the size of the cycle (e.g. 12 for months in a year, 7 for days in a week, etc). The default is to have Automatic Seasonal Cycle selected.
· Cycle (required if Specify Seasonal Series is selected)
This value specifies the size of the regularly occurring cycle in the series. See Seasonal Series field for more information.
· Series Trend (optional)
This drop down menu indicates whether there is a trend in the data either Linear or Damped. A Linear trend places the forecast which may include seasonal cycles on a straight line either up or down. A Damped trend places the forecast which may include seasonal cycles on a curve that gently slopes up or down and then levels off. The default is to have no trend in the forecast.
· Number of Forecasts (required)
The number of forecasts to generate must be specified and be at least 1. Generally it is a good idea to specify enough forecasts so they can be visually compared with the series using the generated graph. The resulting forecasts are listed starting with the most recent value in the Forecasts text box. Also the resulting graph shows the forecast on the end of the series. If the series had Smooth Spikes selected, the history graph line is overlaid with a line showing the resulting series when the history has spikes clipped and that would be the data used to generate the forecast instead of the actual history.
· Number to Hold Back (optional)
The number to hold back tells the system to generate forecasts using less recent data so the forecast can be compared with the more recent actual values that occurred. For example, if the Number to Hold Back is 12, the last 12 values in the series are set aside and 12 forecasts are generated using the series up to but not including those values set aside. The forecasts are then compared with the values set aside. The actual values are listed alongside the corresponding forecast in the Holdback Results text box. Also included are several metrics that can be used to gauge how well the forecasts matched the actual values. Each metric is described below:
§ RMSE (Root Mean Squared Error)
This metric value gives the square root of the average value of the squared difference between the forecast and the actual value. Having said that the metric is a good indicator of how far away the forecast is from the actual value in absolute terms. That is, the value is in the units of the series (e.g. dollars/cents, mileage, etc). Note the RMSE can be skewed by spikes in the series.
§ MASE (Mean Absolute Scaled Error)
This is the average of the absolute value of the difference between the forecast and the actual value divided by the scale determined by using a random walk on the history prior to the holdback period. This metric is a relative indicator of how well the forecast of the holdback period compares with the simplest model being used to forecast the history. The idea is to get a sense of how much better the forecast model behaves relative to the simplest possible forecast model. Ideally, the resulting ratio is less than 1.0 implying the forecast model is superior to copying forward the last value. On the other hand, forecasting into a large holdback period has a larger likelihood of being incorrect in further holdback values. So values larger than 1.0 can be expected in those cases and still have reasonable forecasts. But if the MASE is large (e.g. greater than say 2.0) for holdback periods less than 10% of the size of the history, the model forecast is probably not that good.
§ MAPE (Mean Absolute Percentage Error)
This is the average of the absolute value of the difference between the forecast and the actual value divided by the actual value and is expressed as a percentage. This metric is a relative indicator of how far away the forecast is from the actual value. Like the RMSE it is subject to being skewed by spikes.
§ MdAPE (Median Absolute Percentage Error)
This is the median (i.e. center value) of the absolute value of the difference between the forecast and the actual value divided by the actual value and is expressed as a percentage. This metric is a relative indicator of how far away the forecast is from the actual value. Unlike the MAPE value, this value is not affected by spikes. Consequently, it is the better metric to use.
§ SMAPE (Symmetric Mean Absolute Percentage Error)
This is the average of the absolute value of the difference between the forecast and the actual value divided by the actual value plus the forecast divided by 2 and is expressed as a percentage. This metric is a relative indicator of how far away the forecast is from the center of the actual and forecast values. The idea behind this metric is to have the resulting percentage be between 0% and 200%. However under and over forecasts are not given equal weight.
Generally speaking it is a good idea to gauge a hold back forecast using 2 or more of the metrics listed. Usually the MASE and the MdAPE are the best indicators of the hold back forecast performance. Finally, note the graph line corresponding to the hold back forecast is overlayed on the historical values that it is forecasting to allowing a visual comparison between the actual values and the forecast.
© 2010-2022 John Eldreth All rights reserved. Contact Us At email@example.com