Skip to content
OnData.blog

OnData.blog

Menu

  • Articles
  • About & Contact
  • Linkedin
  • Facebook
  • twitter
  • RSS

statistics

Linear Regression: Killer App with 19-century maths

I often feel the gap between the mainstream Data Science rhetoric and the true business needs is widening. When I hear of Hyperautomation, Edge AI, AutoML, or GANs, I challenge myself to take a leap back, understand our needs better.

Pawel Plaszczak January 19, 2022January 24, 2022 Articles No Comments Read more

Democratization of statistics: Chi2 for non-experts

I am big fan of advanced methods deployed to solve practical problems by ordinary users. Here is our recent achievement. My colleague, an experienced service desk manager, observed that the volume of work in his team has grown. He would

Pawel Plaszczak January 12, 2022January 13, 2022 Articles No Comments Read more

Practical AIOps: 5 use cases

In Sopra Steria we manage the IT infrastructure and applications of big clients. We process millions of service tickets and infrastructure events. This massive stream of data comes from monitoring tools such as Zabbix, Nagios, Solarwinds, and higher level frameworks:

Pawel Plaszczak June 8, 2021June 14, 2021 Articles No Comments Read more

Data Puzzle

Here is a new data puzzle, coming from my recent analytics in Sopra Steria. I will describe the problem, but not the answer. If you like the challenge, please contribute your thoughts in the comments. The title of the data

Pawel Plaszczak March 24, 2021March 29, 2021 Articles 4 Comments Read more

3 Steps to Unmask Data in Camouflage

I am looking at distribution of a certain data set (left). It has two peaks (this is called ‘bimodal’) therefore I suspect that those are two overimposed populations. How do I split the data, to rediscover the original two populations

Pawel Plaszczak June 29, 2020November 20, 2021 Articles No Comments Read more

The truth behind a histogram dent

Here is quite intriguing research with the data of our Sopra Steria IT operations (ITSM, AIOps, and Infrastructure Management). I’ve been faced with an interesting situation in an IT Applications Management project for a large corporate client. In such a

Pawel Plaszczak June 19, 2020June 29, 2020 Articles No Comments Read more

Coronavirus mortality: less than we think

Note 1: If you are looking for some COVID-19 conspiracy theories, go elsewhere. Below is only some boring statistics.

Pawel Plaszczak March 29, 2020April 13, 2020 Articles, General Public 9 Comments Read more

Recent Posts

  • Porting PyTorch neural network to Amazon AWS June 30, 2022
  • Porting pyTorch cloud detection model to Amazon AWS S3 June 17, 2022
  • pushing data to AWS. SageMaker sucks. So does Anaconda June 14, 2022
  • Linear Regression: Killer App with 19-century maths January 19, 2022
  • Democratization of statistics: Chi2 for non-experts January 12, 2022
  • An approach to categorize multi-lingual phrases December 15, 2021
  • The implications of Scikit-learn bug #21455 November 29, 2021
  • Your model may be inaccurate November 25, 2021
  • Answering Why (with Chi-Square) November 19, 2021
  • What makes Data Quality so difficult November 8, 2021
Copyright © 2023 OnData.blog. Powered by WordPress. Theme: Spacious by ThemeGrill.