Skip to main content

Root Cause Analysis

Introduction

Root cause analysis (RCA) can provide valuable information regarding forecasting results. Users have the ability to initiate the calculation of RCA results for a completed forecasting job by accessing the RCA endpoint. This functionality allows users to understand the factors that contribute to the forecast's appearance and gain insights into the construction of the Model Zoo.

Output

In the example provided below, there is a table representing the output of the root cause analysis:

timestampterm_1term_2term_3term_Nyhat_1yhat_2yhat_3yhat_Npredictor_1predictor_2predictor_3predictor_N
2014-10-25T04:00:00.02546900943.0562419431984198732961443198412871396
2014-10-26T04:00:00.0245150005409.623421952104208931232943158429871496
2014-10-27T04:00:00.0210320065363.41232211219021682762214219842987996
2014-10-28T04:00:00.02301100543.55452189215421674153643198419871996
2014-10-29T04:00:00.0222543298332125672592259831121143148419871996
2014-10-30T04:00:00.0215543551235.613425322490248741234943198419871996

Interpreting root cause analysis results

First and foremost, it is important to note that each forecast value can be generated by a different model within a Model Zoo. When examining the forecast and comprehending its construction, it is necessary to restrict the view to only other forecasts generated by the exact same model. This is why the model's index is a required parameter in the RCA endpoint. Each model's terms are additive, allowing for a clear understanding of the individual impact of each term on the forecast. There are three distinct views available: the nominal term view, the relative term view, and the predictor view.

The nominal term view provides precise information regarding the contribution of each term to the estimated forecast value. The relative term view presents a slightly different decomposition of the forecast, facilitating a better geometrical understanding of how the model gradually takes shape by adding terms. The predictor view aggregates the impact of each predictor across all terms in the model, offering information on the contribution of each predictor to the forecast value.

Nominal term_i

The value of the i-th term of the model of a chosen model_index used to obtain the forecast. The term can be found in the Model Zoo by model_index and the order number i. It is essential to mention that the term1 of the model with the _model_index 1 is different than the term1 of the model with the _model_index 2 - they are two separate models and have different terms.

NOTE: For a given timestamp t, the sum of the terms equals the forecast value.

Relative yhat_i

This essentially equals the forecast which would be obtained if the model only consisted of the first i terms (different from the sum of the first i terms). The forecasting error thus decreases with increasing i, showing the gradual build-up of the model. The nominal view of terms does not satisfy this property. This property visualizes how important individual terms are for the final forecast and how they influence it. If something goes wrong, this allows users to easily identify which term is responsible.

NOTE: For a given timestamp t, the last yhat (cumulation of all yhat's) equals the forecast value.

Aggregated predictor_i

It reveals the involvement of i-th predictor in the forecast value for a given data point. This is a straightforward way to figure out the impact of a given predictor on the forecast value.

NOTE: For a given timestamp t, the sum of the predictors equals the forecast value.