Skip to content
OnData.blog

OnData.blog

Menu

  • Articles
  • About & Contact
  • Linkedin
  • Facebook
  • twitter
  • RSS

#datascience

Coronavirus mortality: less than we think

Note 1: If you are looking for some COVID-19 conspiracy theories, go elsewhere. Below is only some boring statistics.

Pawel Plaszczak March 29, 2020April 13, 2020 Articles, General Public 9 Comments Read more

The Data Lake at Sopra Steria

Following on my previous post, we have spent some time on building an internal Data Lake at Sopra Steria. The infrastructure is functional now and admitting its first users. Much has been said on building successful Data Science teams. Multidisciplinary

Pawel Plaszczak February 17, 2020February 18, 2020 Articles 4 Comments Read more

Why Analysts need Data Lakes?

Why Analysts need Data Lakes?

With substantial analytical needs at Sopra Steria Apps, we are looking to expand our Data Science environment. My thoughts go towards a Data Lake architecture, from a concrete angle, having practical requirements and knowing quite precisely what we want. I’ve

Pawel Plaszczak October 14, 2019October 14, 2019 Articles No Comments Read more

Recent Posts

  • Porting PyTorch neural network to Amazon AWS June 30, 2022
  • Porting pyTorch cloud detection model to Amazon AWS S3 June 17, 2022
  • pushing data to AWS. SageMaker sucks. So does Anaconda June 14, 2022
  • Linear Regression: Killer App with 19-century maths January 19, 2022
  • Democratization of statistics: Chi2 for non-experts January 12, 2022
  • An approach to categorize multi-lingual phrases December 15, 2021
  • The implications of Scikit-learn bug #21455 November 29, 2021
  • Your model may be inaccurate November 25, 2021
  • Answering Why (with Chi-Square) November 19, 2021
  • What makes Data Quality so difficult November 8, 2021
Copyright © 2023 OnData.blog. Powered by WordPress. Theme: Spacious by ThemeGrill.