Skip to content
OnData.blog

OnData.blog

Menu

  • Articles
  • About & Contact
  • Linkedin
  • Facebook
  • twitter
  • RSS

t-SNA

An approach to categorize multi-lingual phrases

I have 130,000 help desk tickets with multi-lingual descriptions. I need to divide this set into categories, such as “password reset”, “license expired”, or “storage failure”. Why? Users could then allocate a category to a new ticket they create. Then

Pawel Plaszczak December 15, 2021December 15, 2021 Articles No Comments Read more

How to isolate data that constitutes a spike in histogram?

How to isolate data that constitutes a spike in histogram?

We would all love to spot business problems early on, to react before they become painful. You can learn a lot by looking at past problems. Hence, understanding the nature of anomalies in data can bring substantial operational benefits and

Pawel Plaszczak October 1, 2019November 20, 2021 Articles No Comments Read more

Recent Posts

  • Linear Regression: Killer App with 19-century maths January 19, 2022
  • Democratization of statistics: Chi2 for non-experts January 12, 2022
  • An approach to categorize multi-lingual phrases December 15, 2021
  • The implications of Scikit-learn bug #21455 November 29, 2021
  • Your model may be inaccurate November 25, 2021
  • Answering Why (with Chi-Square) November 19, 2021
  • What makes Data Quality so difficult November 8, 2021
  • Don’t trust Data Science. Ask the people October 24, 2021
  • Mistaken by factor of 100,000 October 14, 2021
  • Practical AIOps: 5 use cases June 8, 2021
Copyright © 2022 OnData.blog. Powered by WordPress. Theme: Spacious by ThemeGrill.