Thank you for your visit. Briefly about me:

I have 20+ years of experience of working with data. I like the terms Data Analytics and Data Engineering. In more flashy terms this would be Big Data, Data Science, or perhaps Data Art? I have been fortunate to combine the perspective of a business executive, a contractor for large corporation, an employee of a government lab and a university teacher. I wrote one book. I would love to tag myself as an expert. For the moment, I like to say that I am a learner. is a random record of my work, restarted fresh in 2018. The purpose? It sometimes takes hours, or days, to develop a beautiful solution and use it once only. I think it is a waste. So, time permitting, I sometimes invest an extra hour to publish the ideas here, so others can benefit, get inspired or replicate my solutions if they feel it makes sense. If I helped you with something, please contact me, I will enjoy it and will respond individually.

My previous blog Big Data Matters has been active for a decade. However, having stepped down from upper management role in my old company, I took it down respecting the IPR considerations.

I enjoy sharing knowledge. I occasionally provide consulting and training. I enjoy public speaking opportunities if only I can become useful. Email me at pp [at], or through linkedin below.


To remain up to date

Follow us on facebook, or email newsletter, or both.

Here is Facebook page associated with this blog. Follow it to get notified about future posts. Also, your comments are welcome there. Alternatively, use the form below to subscribe to a low-traffic newsletter. You will be notified only when new articles are created. Unsubscribe at any time.

4 thoughts on “About & Contact

  • December 20, 2018 at 9:58 pm

    Are the lectures recorded and available in video format for Big Data in 30 hours?

    • December 26, 2018 at 8:32 am

      Thanks for asking. No, we have not been recording the sessions. Taking aside the fact that it’s currently work-in-progress, as I’ve been adding material on the fly as the lectures progress, there is reason to not put the lectures online. As a matter of fact, much of the Data Engineering/Data Science material is already available online elsewhere (such as basic BI, Hadoop, Spark, Kafka, Scikit-learn, Keras tutorials). I suppose the difficulty for a remote learner lies not in the absence of online tutorials, but just the opposite, in the overwhelming quantity of those. Therefore instead of adding yet another similar resource, here I’d prefer to create a step-by-step guide helping you move forward and keep your study on target, using what’s already available elsewhere. I realize it’s not fully organized yet. Will do my best to improve with cleaning and organizing stuff. Critical comments are most welcome. This one is certainly much apppreciated, thank you.

  • January 5, 2019 at 8:31 pm

    Hello, my name is Amos Bunde and I am from Kenya. I am a data scientist in the making. Your materials look detailed and here I mostly do the training online.
    Please help me register for the course 30hrs.

    • January 8, 2019 at 2:44 pm

      hello Amos, I appreciate your interest. Big Data in 30 Hours is not precisely an online class so there is no online enrollment curently. I am teaching the class for various clients (most lately Krakow University of Technology), which means I travel onsite to be present physically. I am happy to teach locally worldwide if invited. However, I am also putting a lot of the materials online so hopefully the more motivated remote users can also benefit from that. Additionally, there is project discussion group at linkedin, where you can become a member and discuss with others. The links to the online material and to the discussion board are here, enjoy!


Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.