Skip to content
OnData.blog

OnData.blog

Menu

  • Articles
  • About & Contact
  • Linkedin
  • Facebook
  • twitter
  • RSS

data analytics

pushing data to AWS. SageMaker sucks. So does Anaconda

I did a lot of tech work on the infrastructure underlying my analytics over the past weeks. I am putting my notes here so they don’t get lost and maybe help someone. Here are three stories, unrelated to each other.

Pawel Plaszczak June 14, 2022June 14, 2022 Articles No Comments Read more

Linear Regression: Killer App with 19-century maths

I often feel the gap between the mainstream Data Science rhetoric and the true business needs is widening. When I hear of Hyperautomation, Edge AI, AutoML, or GANs, I challenge myself to take a leap back, understand our needs better.

Pawel Plaszczak January 19, 2022January 24, 2022 Articles No Comments Read more

What makes Data Quality so difficult

Garbage in, garbage out. Analysis of untrusted or poorly understood data will yield incorrect results. Hence the textbook approach is to clean the data first, and only then proceed with data analytics. For instance, in the data lakes, the data

Pawel Plaszczak November 8, 2021November 9, 2021 Articles No Comments Read more

Data Puzzle explained

For the Data Puzzle I posted last week, I received about a dozen of thoughtful and highly relevant answers. THANK YOU. I want to primarily thank to Luis Ruiz Santiago, Chetan Waman and anonymous J for comments under the previous

Pawel Plaszczak March 29, 2021March 30, 2021 Articles No Comments Read more

Recent Posts

  • Porting PyTorch neural network to Amazon AWS June 30, 2022
  • Porting pyTorch cloud detection model to Amazon AWS S3 June 17, 2022
  • pushing data to AWS. SageMaker sucks. So does Anaconda June 14, 2022
  • Linear Regression: Killer App with 19-century maths January 19, 2022
  • Democratization of statistics: Chi2 for non-experts January 12, 2022
  • An approach to categorize multi-lingual phrases December 15, 2021
  • The implications of Scikit-learn bug #21455 November 29, 2021
  • Your model may be inaccurate November 25, 2021
  • Answering Why (with Chi-Square) November 19, 2021
  • What makes Data Quality so difficult November 8, 2021
Copyright © 2023 OnData.blog. Powered by WordPress. Theme: Spacious by ThemeGrill.