statistics – OnData.blog

Linear Regression: Killer App with 19-century maths

I often feel the gap between the mainstream Data Science rhetoric and the true business needs is widening. When I hear of Hyperautomation, Edge AI, AutoML, or GANs, I challenge myself to take a leap back, understand our needs better.

Democratization of statistics: Chi2 for non-experts

I am big fan of advanced methods deployed to solve practical problems by ordinary users. Here is our recent achievement. My colleague, an experienced service desk manager, observed that the volume of work in his team has grown. He would

Practical AIOps: 5 use cases

In Sopra Steria we manage the IT infrastructure and applications of big clients. We process millions of service tickets and infrastructure events. This massive stream of data comes from monitoring tools such as Zabbix, Nagios, Solarwinds, and higher level frameworks:

Data Puzzle

Here is a new data puzzle, coming from my recent analytics in Sopra Steria. I will describe the problem, but not the answer. If you like the challenge, please contribute your thoughts in the comments. The title of the data

3 Steps to Unmask Data in Camouflage

I am looking at distribution of a certain data set (left). It has two peaks (this is called ‘bimodal’) therefore I suspect that those are two overimposed populations. How do I split the data, to rediscover the original two populations

The truth behind a histogram dent

Here is quite intriguing research with the data of our Sopra Steria IT operations (ITSM, AIOps, and Infrastructure Management). I’ve been faced with an interesting situation in an IT Applications Management project for a large corporate client. In such a

Coronavirus mortality: less than we think

Note 1: If you are looking for some COVID-19 conspiracy theories, go elsewhere. Below is only some boring statistics.