When Pytorch Datapipes Meets GCS

Posted on Wed 06 July 2022 in posts pipeline • Tagged with python pytorch GCP

The reality when using pipeline infrastructure


Continue reading

Reflection on 2021

Posted on Fri 28 January 2022 in posts • Tagged with career

Reflecting on the my 2021 journey


Continue reading

Design patterns in ML and MLops

Posted on Sun 24 January 2021 in posts • Tagged with python mlops

My thoughts on an interview with Sara Robinson about Design patterns in ML, especially on pipelines


Continue reading

NaN Bites

Posted on Tue 29 December 2020 in dataanalysis posts • Tagged with python dataanalysis

Cleaning data from NaN's requires additional inputs especially when trying to work with sparse matrix in classification problems.


Continue reading

My 2cents worth after reviewing an academic project

Posted on Mon 21 December 2020 in posts • Tagged with python data_analysis

My 2cents after reviewing an academic project


Continue reading

Dataclasses in ML Pipelines

Posted on Sun 26 July 2020 in pipelines posts • Tagged with python MLpipeline

Tidy your ML pipelines with dataclasses.


Continue reading

Panel Dashboard in action

Posted on Sun 26 July 2020 in posts • Tagged with python dataviz

How we used Panel to monitor Dagster's assets.


Continue reading

Implementing a graph network pipeline with Dagster

Posted on Thu 09 July 2020 in posts • Tagged with python neo4j dagster

How to created a graph (nodes and edges) from separate data sources and import them into Neo4j


Continue reading

Warming up - My First Blog Post

Posted on Wed 08 July 2020 in posts

Getting started with my online blogging

This is my first post on my new blog.
I hope to be able to share my thoughts and knowledge in the various tech stacks that I'm involved in.

I'm going run on Pelican (a python static web site generator). There are numerous resources …


Continue reading