Smooth Forecast Help
SmoothForecast.com provides free time series forecasting capability on the web. That is you can enter a sequence of numbers representing anything from monthly rainfall amounts for the last 10 years to monthly food expenditures for the last 36 months and forecast the upcoming values based on that history. You can specify the number of forecasts that are produced. You can also specify whether the history seems to have a trend or seasonal cycles (i.e. the history goes up and down in regular intervals). Also if the history has values that seem too large or too small (called "spikes"), you can have them filtered out by using the spike smoothing feature. And to get a sense of how accurate the forecast is, you can specify what is called a "hold back". This means you can "hold back" a specified number of values from the end of the history and have the system forecast based upon the history prior to the held back values. The system will then compare the forecast with the held back history and produce metrics that can be used to assess how accurate the forecast is. It will also list the forecast side-by-side with the held back history.
Additionally rather than determine the trend, seasonal cycle, or spike smoothing settings yourself, you can have the system determine them for you by selecting the metric to minimize over the hold back period (i.e. the auto-select feature). The list of supported metrics is provided below. If the auto-select feature is enabled, any user specified trend, seasonal cycle, or spike smoothing settings are discarded.
All the values are graphed so you can compare the history with the held back forecast or with the forecast. Also by visually comparing held back forecasts with the forecasts, you can get a sense of how good the forecast is based on how well it performed for past history. If spike filtering is enabled that graph will be layered on top of the history so you can see what values were chopped off and how much.
As an example consider the following 24 months of a food budget. The first value is the oldest and the last the most recent. Here are the values.
Here is a snapshot of SmoothForecast.com using this historical data. The auto-select feature has been enabled by selecting the MedAPE (Median Absolute Percentage Error) metric to minimize over the hold back period. The system determined that spike smoothing should be enabled, a seasonal cycle of 4 should be specified, and that a linear trend should be selected. The desire was to forecast the food expenditure a year ahead (i.e. 12 months) and see how the forecast would have looked if it was done 12 months ago (i.e. Number to Hold Back is 12). Looking at the graph one can see the forecast of the hold back line does match the forecast line fairly well giving us confidence this is a reasonable forecast and the general trend for the food budget is to increase in cost for the next 12 months. Note the MedAPE value for the hold back is about 7.8% (and the MASE is significantly less than 1 suggesting a good model fit).
As another example consider the following nearly 15 years of a monthly electric bill. The first value is the oldest and the last the most recent. Here are the values.
Below is a snapshot of SmoothForecast.com using this
historical data. Note the auto-select feature is disabled by selecting None as
the metric. This series has a cycle but we are not certain what it is. So
the Automatic Seasonal Cycle radio button is selected allowing the system to
fill in the Seasonal Cycle text box when the Process Series button is selected.
Note the system determines the cycle to be 12 months (i.e. yearly cycle). 12
forecasts were requested and 12 held back for comparison purposes. Note the
Smooth Spikes checkbox is not selected in this example since it actually makes
the forecast worse. The way to see this is to select Smooth Spikes and compare
with the Holdback Results before and after selecting Smooth Spikes. The Series
Trend is set to Linear since there is a gentle slope upwards in the data. The
way to see which Series Trend works the best is to set the Number to Hold Back
to a value large enough to be a good sample of data and try the different Series
Trend selections looking at the Holdback Results. For seasonal data a good
value is some multiple of the Cycle. In this example a multiple of 1 was used
since 12 values is a good enough recent sample. Note the MedAPE (Median Absolute
Percentage Error) is about 11.3% which is a good match with the held back data
(and the MASE is significantly less than 1 suggesting a good model fit).
Form Field Descriptions
There are several form fields that need to be specified to use SmoothForecast.com to generate forecasts. Below is a description of each and whether the field is required or optional. Once these fields are completed, the Process Series button must be pressed to generate results. The accesskey 's' may be used as a keyboard shortcut. For example, if using the Chrome, Firefox, or Safari browser on MacOS, you can press control + option + s to press the Process Series button. On Windows, using Firefox, you can press alt + shift + s. If using Chrome, Edge, or Internet Explorer on Windows you can press alt + s.
· Series (required)
This field is where the historical data is entered. One numerical value is entered per line starting with the oldest value. Values can be pasted into the field and edited before processing or after processing for subsequent processing runs (i.e. when the Process Series button is pressed). The series is preserved in the field from one processing run to another. Also the data can be loaded from a file by selecting the Browse button on the Load Series File field. The series loaded from a file will be loaded into the Series field when the Process Series button is pressed.
· Load Series File (optional)
This is field is used to load a numerical series from a file selected using the Browse button. Any file loaded must be a text file and have one numerical value per line starting with the oldest value. The values will be loaded from the file and placed into the Series field when the Process Series button is selected.
· Auto-Select Metric (optional)
By picking a hold back metric to minimize over the hold back period, this drop down menu enables the automatic selection of forecast settings. This also requires the Number to Hold Back field to have a value of at least 2. If None is selected, the feature is disabled and user specified forecast settings are used. None is selected by default.
· Smooth Spikes (optional)
This checkbox, if selected, causes values considered outside the series moving average to be clipped back to the moving average value. The default is to have the checkbox selected.
· Seasonal Cycle Type (optional)
These radio buttons, if one is selected other than No Seasonal Cycle and Chaos, cause the forecast to be based on a regularly occurring cycle in the series (e.g. 12 months in a year, 7 days in a week) depending on what the series values represent (e.g. a month value, a day of the week value, etc). If the Automatic Seasonal Cycle radio button is selected, the Cycle value will be filled in automatically when the Process Series button is selected. If there is no detectable seasonal cycle, the Cycle value will be set to 0. If the Specify Seasonal Cycle radio button is selected, the Cycle value must also be specified indicating the size of the cycle (e.g. 12 for months in a year, 7 for days in a week, etc). If the Chaos radio button is selected, a series of semi-random oscillations are generated that best fit the hold back values. This assists in those cases where having forecast oscillations improves the quality of the forecast over using a straight line. The Automatic Seasonal Cycle radio button is selected by default.
· Cycle (required if Specify Seasonal Cycle is selected)
This value specifies the size of the regularly occurring cycle in the series. See Seasonal Cycle Type field for more information.
· Series Trend (optional)
This drop down menu indicates whether there is a trend in the data either Linear or Damped. A Linear trend places the forecast which may include seasonal cycles on a straight line either up or down. A Damped trend places the forecast which may include seasonal cycles on a curve that gently slopes up or down and then levels off. The default is to have no trend in the forecast.
· Number of Forecasts (required)
The number of forecasts to generate must be specified and be at least 1. Generally it is a good idea to specify enough forecasts so they can be visually compared with the series using the generated graph. The resulting forecasts are listed starting with the most recent value in the Forecasts text box. Also the resulting graph shows the forecast on the end of the series. If the series had Smooth Spikes selected, the history graph line is overlaid with a line showing the resulting series when the history has spikes clipped and that would be the data used to generate the forecast instead of the actual history.
· Number to Hold Back (optional)
The number to hold back, which must be at least 2, tells the system to generate forecasts using less recent data so the forecast can be compared with the more recent actual values that occurred. For example, if the Number to Hold Back is 12, the last 12 values in the series are set aside and 12 forecasts are generated using the series up to but not including those values set aside. The forecasts are then compared with the values set aside. The actual values are listed alongside the corresponding forecast in the Holdback Results text box. Also included are several metrics that can be used to gauge how well the forecasts matched the actual values. Each metric is described below:
§ MAPE (Mean Absolute Percentage Error)
This is the average of the absolute value of the difference between the forecast and the actual value divided by the actual value and is expressed as a percentage. This metric is a relative indicator of how far away the forecast is from the actual value. Like the RMSE it is subject to being skewed by outliers. And outlier effects can be reduced by selecting the Smooth Spikes checkbox.
§ MASE (Mean Absolute Scaled Error)
This is the average of the absolute value of the difference between the forecast and the actual value divided by the scale determined by using a random walk on the history prior to the holdback period. Instead of the history, this implementation uses a random walk on the actual values. This metric is a relative indicator of how well the forecast of the holdback period compares with the simplest model being used to forecast the history. The idea is to get a sense of how much better the forecast model behaves relative to the simplest possible forecast model. Ideally, the resulting ratio is less than 1.0 implying the forecast model is superior to a random walk. On the other hand, forecasting into a large holdback period has a larger likelihood of being incorrect in further holdback values. So values larger than 1.0 can be expected in those cases and still have reasonable forecasts. But if the MASE is large (e.g. greater than say 2.0) for holdback periods less than 10% of the size of the history, the model forecast is probably not that good.
§ MedAPE (Median Absolute Percentage Error)
This is the median (i.e. center value) of the absolute value of the difference between the forecast and the actual value divided by the actual value and is expressed as a percentage. This metric is a relative indicator of how far away the forecast is from the actual value. Unlike the MAPE value, this value is not affected by outliers. Consequently, it is the better metric to use and gives a good sense of how well spike smoothing has (or has not) improved the series for forecasting purposes.
§ RMSE (Root Mean Squared Error)
This metric value gives the square root of the average value of the squared difference between the forecast and the actual value. Having said that the metric is a good indicator of how far away the forecast is from the actual value in absolute terms. That is, the value is in the units of the series (e.g. dollars/cents, mileage, etc). Note the RMSE can be skewed by outliers in the series. Outlier effects can be reduced by selecting the Smooth Spikes checkbox.
§ SMAPE (Symmetric Mean Absolute Percentage Error)
This is the average of the absolute value of the difference between the forecast and the actual value divided by the absolute value of the actual value plus the absolute value of the forecast divided by 2 and is expressed as a percentage. This metric is a relative indicator of how far away the forecast is from the center of the actual and forecast values. The idea behind this metric is to have the resulting percentage be between 0% and 200%. However under and over forecasts are not given equal weight.
§ TotAPE (Total Absolute Percentage Error)
This is the ratio of the absolute value of the difference between the total forecast and the actual value total divided by the absolute value of the actual value total expressed as a percentage. This metric is a relative indicator of how far away the total forecast is from the actual value total. The idea behind this metric is to have an overall indication of the quality of the forecast when there is less concern about the individual forecast values.
Generally speaking it is a good idea to gauge a hold back forecast using 2 or more of the metrics listed. Usually the MASE and the MedAPE are the best indicators of the hold back forecast performance. Finally, note the graph line corresponding to the hold back forecast is overlayed on the historical values that it is forecasting to allowing a visual comparison between the actual values and the forecast.
· Process Series (required)
Click on the Process Series button to cause forecast results to be produced. If the field focus is not in the Series Input field, the Return key can be pressed to cause forecast results to be produced.
© 2010-2022 John Eldreth All rights reserved. Contact Us At email@example.com