Kubeflow is a standardized platform for building ML pipelines. It’s an open source ML toolkit, built on top of Kubernetes. Essentially, it translates Data Science work into Kubernetes jobs, and provides a cloud-native, multi-cloud, interface for ML libraries, frameworks, pipelines, and notebooks. This makes it simple to deploy ML pipelines via the containerized approach. The … Read more

Regularization and Early Stopping

Takes into account the model complexity when calculating the error. It’s a major field of ML research, but we are going to focus on L1 and L2 regularization. L2 vs L1 Regularization L1, and L2 regularization are so called parameter norm penalties. They both aim to penalize the loss function by introducing coefficients with different … Read more


Keras is a deep learning API developed by Google for implementing Neural Networks (NNs). It’s a high level Python API, meant to simplify NN implementation. Kerals calls itself “an API for human beings”, meaning that it aims to be intuitive to humans. More specifically, the Keras framework is implemented with the purpose of minimizing the … Read more

TensorFlow on Google Cloud

Introduction TensorFlow is an open-source, high performance library for numerical computation that uses directed acyclic graphs (DAGs). DAGs are simply graphs that flows in one direction, and can be used to represent various computations, such as addition, subtraction etc. A tensor is an N-dimensional array of data. It’s currently the most popular ML library on … Read more

Google Cloud Products

This is a summary of the list of products and services, provided by Google Cloud Provider (GCP). They will be categorized into different categories, depending on their use case. Storage Data Processing Development Machine Learning Products Notable ones. Bigquery, DataFlow, Pubsub.

ML Activation and Loss Functions

Common Activation Functions Placeholder Failure modes for Gradient Descent Problem Gradients can vanish Gradients can explode ReLu layers can die Insight Each Additional layer can reduce the signal vs. noise Learning rates are important here Monitor fraction of zero weights in TensorBoard Solution Using ReLu instead of sigmoid/tanh can help Batch normalization (useful knob) can … Read more

GCP MLE Study Material

Here I am listing the material needed to be able to complete the MLE exam. Most of these include an understanding of Google Cloud Products and Services, and when to use them. Hence, I will do concepts and product breakdowns. I hope this will help you on your way! NOTE, it is important understand the … Read more

Google Cloud Database Offerings

To understand the differences between the Database offerings, one has to have a basic understanding of how Databases work. I have narrowed it down to be able to grasp the following bullet points: What is SQL? Serverless vs. Managed Instance For Managed vs. Server Instance, I’ll just drop this image here:

Google Cloud Service Models

It is important to understand the differences between the service models, in order to figure out how the available software or cloud services work at their fundamentals and who will utilize them, and for what purposes. Basic Service Models There are four main service models: 1. Infrastructure as a Service (IaaS) : The Cloud Service … Read more