Datama Detect

logo detect

Datama DETECT aims at evaluating over time the normality of a variation.

This module allows to automate outlier identification on a given ratio, and understand the drivers of that anomaly. This is particularly useful in time series for being able to quickly spot issues in your KPIs overtime.

Datama DETECT is a data analysis tool that allows you to not only to detect potential outliers in your dataset but also to explain them using the Datama Compare algorithm.

You can use Datama DETECT to monitor your dataset and set up automatic monitoring of your main KPIs. If your dataset is big enough, Datama DETECT will show you potential outliers as well as give you the opportunity to further investigate your data.


Interface of Datama DETECT

You can also adjust how sensitive you want the detection to be using the “Confidence interval” slider in the settings.

By default, the most relevant outlier is selected, but you can also choose any point in your dataset that you would like to get insight for. This can be done by selecting it, or by directly clicking on the graph.


Dataset needed for Datama DETECT

Datama Detect needs at least 30 days in a the dataset to be able to evaluate a normality. Otherwise the dataset needs to have Dimensions and Metrics as for all solutions in Datama. A dataset used in Datama Detect could be used in the other solution as well.


Anomaly

In Detect, the dimension should be selected for assessing normality within your data. This dimension can be a date, category, or any other relevant data point.

Note: The chosen dimension should have at least 30 elements to ensure statistical significance and reliability in the analysis.

Baseline Establishment: Our system analyzes the volatility of the selected dimension by establishing a baseline. This baseline consists of ‘X’ number of previous data points.

Weighted Standard Deviation and Average: Using this baseline, the system calculates the weighted standard deviation and average for the specified KPI. This process involves giving different weights to the data points, typically placing more importance on recent data.

Confidence Interval Calculation: The system then uses these calculations to establish a confidence interval. This interval is based on the rules of normal distribution. It helps in identifying values that deviate significantly from what’s expected or normal, flagging them as anomalies.