deepschool.ai
Home
Blog
Programming Tips
ML Consulting
DL Course
Categories
All
(37)
Bayesian
(2)
Deep Learning
(10)
Diffusion Models
(3)
Docker
(1)
LLM
(7)
Loss Function
(4)
MLOps
(1)
Machine Learning
(1)
Multimodal
(3)
Software Engineering
(1)
Statistics
(3)
Tabular Data
(1)
Unsupervised Learning
(2)
huggingface
(6)
politics
(1)
python
(1)
pytorch
(11)
Blog
Supercharging Structured Outputs with Open Source Models 🚀
LLM
This exploration compares three open-source models for extracting structured outputs, finding that while NuExtract performs best, larger models benefit most from KV caching for speed improvements. Despite some hallucinations in responses, optimized caching and structure flattening show promise for faster, more accurate outputs.
Nov 3, 2024
Sachin Abeywardana
Vison Language Models from Scratch
Deep Learning
LLM
huggingface
Multimodal
Using Small Language Models to with Small vision models to generate captions
Aug 11, 2024
Sachin Abeywardana
BuildKite + ArgoWF for ML Training Jobs
MLOps
MLOps: Leveraging ArgoWF and Buildkite to train models
Jul 31, 2024
Sachin Abeywardana
Prompt Caching: Poor man’s guide to zero shot vision-LLM classification
Deep Learning
LLM
huggingface
Using KV caching and logit ratios to speed up and control LLM/ VLM outputs.
Jun 29, 2024
Sachin Abeywardana
From DDPM to DDIM: A Mathematcal Deep Dive
Deep Learning
Diffusion Models
In this blog we will explore the DDIM paper with excruciating mathematical detail. In doing so, hopefully, bridge the gap between the two papers. While knowledge of DDPM is…
Feb 11, 2024
Why I am voting YES!
politics
Today walking past a voting booth I saw a person with a similar background to me wearing a “Vote No” shirt. Now I am well aware that there are around 20% of Aboriginal and…
Oct 14, 2023
Sachin Abeywardana
Dreambooth Tutorial
LLM
Multimodal
Diffusion Models
Dreambooth is a cool technique that lets you convert existing diffusion models to output personalized images. It’s one of the two main methods for fine-tuning diffusion…
Oct 1, 2023
Creating a Caption Model from Scratch
LLM
Multimodal
Before we dive into the details, I want to give you a heads up that this was just an experiment. The results leave a lot to be desired, and you can see for yourself by…
Aug 30, 2023
Annotated DDPM
Deep Learning
Diffusion Models
Training MNIST via DDPM
Apr 18, 2023
Sachin Abeywardana
Fine Tuning T5 for Grammar Correction
Deep Learning
LLM
Fine-tuning T5 for Sequence to Sequence tasks
Nov 7, 2022
Sachin Abeywardana
Fine Tuning GPT2 for Grammar Correction
Deep Learning
LLM
Fine-tuning GPT2 for Sequence to Sequence tasks
Sep 25, 2022
Sachin Abeywardana
Transformer Model Compression (Attempt)
Deep Learning
A failed attempt at model compression using student teacher learning
Aug 7, 2022
Sachin Abeywardana
Unit Testing for Data Science
python
Software Engineering
Python testing for Machine Learning
Jul 3, 2022
Sachin Abeywardana
Vector Database from Scratch
pytorch
Implementing Approximate Nearest Neighbours Oh Yeah (ANNOY)
Mar 22, 2022
Sachin Abeywardana
KMeans in PyTorch with Cosine Distance🥧🔦
pytorch
Implementing kmeans with cosine distance
Mar 20, 2022
Sachin Abeywardana
PyTorch prefetch or rather the lack of it
pytorch
How prefetch_factor did not help in streaming data
Feb 13, 2022
Sachin Abeywardana
Generating captions with ViT and GPT2 using 🤗 Transformers - Part 2
pytorch
huggingface
Using Encoder Decoder models in HF to combine vision and text
Jan 26, 2022
Sachin Abeywardana
Generating captions with ViT and GPT2 using 🤗 Transformers
pytorch
huggingface
Using Encoder Decoder models in HF to combine vision and text
Dec 28, 2021
Sachin Abeywardana
HuggingFace Tokenizers as Collate Functions Timing 🤗 🤖
pytorch
huggingface
Timing comparison of tokenizer as collate function and after batching
Nov 17, 2021
Sachin Abeywardana
Zero Shot Classification with Huggingface + Sentence Transformers 🤗 🤖
pytorch
huggingface
Fast Zero Shot classification of text
Oct 10, 2021
Sachin Abeywardana
DINO Self Supervised Vision Transformers
pytorch
Loss Function
Getting image embeddings with no negative samples
Aug 1, 2021
Sachin Abeywardana
PyTorch Image Patches
pytorch
Getting image patches for Visual Transformer
Jul 3, 2021
Collate function tutorial
pytorch
PyTorch Collate function tutorial
Jun 5, 2021
The Annotated TabNet
Tabular Data
Deep Learning
Creating TabNet from Scratch in Tensorflow 2.0
Apr 5, 2021
Sachin Abeywardana
Multilingual CLIP with Huggingface + PyTorch Lightning 🤗 ⚡
pytorch
Loss Function
Training OpenAI’s CLIP on google colab
Mar 7, 2021
Sachin Abeywardana
Tensorflow Learning Rate Finder
Deep Learning
Using Callbacks to get Optimal Learning Rate
Feb 15, 2021
Focal Loss for Multi-class Classification
Loss Function
Extending normal Focal Loss
Nov 28, 2020
Docker for Data Science
Docker
Docker is a tool that simplifies the installation process for software engineers. Coming from a statistics background I used to care very little about how to install…
Aug 24, 2017
DeepSchool.io
NodeSchool is one of the most inclusive software communities that I have come across. What I liked about it the most is its emphasis on writing code. There are so many…
Jul 4, 2017
Keras LSTMs
Deep Learning
Keras has been one of the really powerful Deep Learning libraries that allow you to have a Deep Net running in a few lines of codes. Best part, don’t worry about the math.…
Oct 20, 2016
Deep Learning Quantile Regression - Keras
Loss Function
The loss function is simple as doing the following. Which is simply the pin-ball loss function.
Oct 16, 2016
XgBoost - Machine Learning made EASY!
Machine Learning
One of the machine learning frameworks that has been exploding on the Kaggle scene has been Xgboost. In my personal experience it has been an extremely powerful machine…
Aug 8, 2016
Reversible jump MCMC
Bayesian
Unsupervised Learning
Reversible jump MCMC is a Bayesian algorithm to infer the number of components/ clusters from a set of data. For this illustration we shall consider a two component model at…
Oct 20, 2015
Chinese Restuarant Process
Bayesian
Unsupervised Learning
In this instance we generate the parameters
\[\theta_k\]
from
\[\mathcal{N}(\mathbf{0},3\mathbf{I})\]
. The data is generated from
\[\mathcal{N}(\theta_k,0.1\mathbf{I})\]
.…
Oct 9, 2015
von Mises-Fisher Distribution
Statistics
The von Mises Fisher Distribution is a multivariate distribution on a hyper sphere. I have decided to share the expectation and covariance of the vMF distribution. The…
Aug 10, 2015
Sample Variance
Statistics
People often question why is there a “n-1” term when I calculate the variance. Why not divide through by “n”. Most stats courses dismiss this question by saying, “oh, that’s…
Aug 6, 2015
Normal Distribution
Statistics
No stats blog would be complete without a discussion of the Gaussian distribution. In the following video I discuss how to obtain the mean and variance of a Gaussian. You do…
Aug 2, 2015
No matching items