This article provides an overview of the MLflow Python API. It is intended for anyone who wants to become quickly familiar with the MLflow Python API and focuses more on giving a whirlwind tour focusing more on breadth, than depth. The MLflow API is so vast that I feel the best way to learn about […]
Getting started with MLflow
This tutorial focuses on getting you started with MLflow. For the purposes of this tutorial, we will assume that you will be working on some form of Jupyter notebook. In my case, I use Google Colab. Step 1. Install MLFlow Our first step is to install the mlflow library. It is available on PyPI or […]
ngrok is a tool that allows you to expose your local development server to the internet, making it accessible from anywhere. In this article, we’ll walk you through the steps to set up an ngrok tunnel for an API running in a Colab notebook. Here are the steps: Step 1. Create an ngrok account To get […]
Quick Intro to MLflow
What is MLflow? MLflow is an open source framework used to manage the machine learning lifecycle. It enables ML model development, deployment, tracking, and experimentation. It is part of the OpenML project. It is available in Python, Java, and Scala. Components of mlflow MLflow is composed of the following core components: Additional components include: Uses […]
Fine Tuning and RAG Explained
As I continue my LLM learning journey, I have encountered terms such as fine tuning and retrieval augmented generation aka RAG. Initially the distinction between the two wasn’t clear to me. In this article I explain how they are different. Fine Tuning First, lets start with the less grandiose term Fine Tuning. Fine tuning is […]
Isolating Conda Virtual Envs
Rationale For users and fans of Anaconda, conda provides a useful alternative to traditional virtualenv functionality in Python. However. there are some twists you need to be aware of in order to use it effectively. One of the features that have tripped me up is the fact that the virtual environment inherits the interpreter, binaries […]
ChatGPT Function Calling
Problem Statement There are many questions a user wishes to ask that ChatGPT may be unable to answer. Many of these questions have to do with real-time scenarios or scenarios that are later than the available data on which the model was trained. Examples of such questions are: If we submit the 1st question to […]
Introduction I have become an avid fan of Google Colab notebooks for many reasons many of which are outlined here. I use it for prototyping, esp. as I am doing more data science and generative AI learning, prototyping and showcasing projects. In order to access third party APIs such as OpenAI which hosts the GPT-3.5 and […]
Introduction Amazon Elastic Cloud Compute (EC2) instances form the backbone of compute resources in AWS. They are virtual machines that can be provisioned in the cloud and utilized to perform compute tasks, run services such as AWS EMR, AWS ECS etc. Terraform is an infrastructure as code (IAC) tool that lets users define both cloud […]