Categories
Machine Learning

MLflow Python API

This article provides an overview of the MLflow Python API. It is intended for anyone who wants to become quickly familiar with the MLflow Python API and focuses more on giving a whirlwind tour focusing more on breadth, than depth. The MLflow API is so vast that I feel the best way to learn about […]

Categories
Machine Learning

Getting started with MLflow

This tutorial focuses on getting you started with MLflow. For the purposes of this tutorial, we will assume that you will be working on some form of Jupyter notebook. In my case, I use Google Colab. Step 1. Install MLFlow Our first step is to install the mlflow library. It is available on PyPI or […]

Categories
Data Engineering

How to setup ngrok tunnel for API running in Colab notebook

ngrok is a tool that allows you to expose your local development server to the internet, making it accessible from anywhere. In this article, we’ll walk you through the steps to set up an ngrok tunnel for an API running in a Colab notebook. Here are the steps: Step 1. Create an ngrok account To get […]

Categories
Machine Learning

Quick Intro to MLflow

What is MLflow? MLflow is an open source framework used to manage the machine learning lifecycle. It enables ML model development, deployment, tracking, and experimentation. It is part of the OpenML project. It is available in Python, Java, and Scala. Components of mlflow MLflow is composed of the following core components: Additional components include: Uses […]

Categories
Machine Learning

Fine Tuning and RAG Explained

As I continue my LLM learning journey, I have encountered terms such as fine tuning and retrieval augmented generation aka RAG. Initially the distinction between the two wasn’t clear to me. In this article I explain how they are different. Fine Tuning First, lets start with the less grandiose term Fine Tuning. Fine tuning is […]

Categories
Software Engineering

Isolating Conda Virtual Envs

Rationale For users and fans of Anaconda, conda provides a useful alternative to traditional virtualenv functionality in Python. However. there are some twists you need to be aware of in order to use it effectively. One of the features that have tripped me up is the fact that the virtual environment inherits the interpreter, binaries […]

Categories
Machine Learning

ChatGPT Function Calling

Problem Statement There are many questions a user wishes to ask that ChatGPT may be unable to answer. Many of these questions have to do with real-time scenarios or scenarios that are later than the available data on which the model was trained. Examples of such questions are: If we submit the 1st question to […]

Categories
Cloud Computing

Secrets access in Google Colab Notebooks

Introduction I have become an avid fan of Google Colab notebooks for many reasons many of which are outlined here. I use it for prototyping, esp. as I am doing more data science and generative AI learning, prototyping and showcasing projects. In order to access third party APIs such as OpenAI which hosts the GPT-3.5 and […]

Categories
Cloud Computing

AWS EC2 instance creation using Terraform

Introduction Amazon Elastic Cloud Compute (EC2) instances form the backbone of compute resources in AWS. They are virtual machines that can be provisioned in the cloud and utilized to perform compute tasks, run services such as AWS EMR, AWS ECS etc. Terraform is an infrastructure as code (IAC) tool that lets users define both cloud […]