Articles and thoughts on AI, Machine Learning, and Software Engineering
AI Researcher, Building Generative AI solutions at Scale
Building AI that actually works! Currently deep into Vision-Language Models and Agentic Systems, with hands-on experience taking AI projects from wild ideas to real products. Love tinkering with model fine-tuning and cloud deployments. Big open-source enthusiast - you'll find me contributing to projects that make AI more accessible to everyone.
The technical journey of creating a performant bilingual LLM for low-resource languages with limited training data.
Deep dive into the underlying architecture of LLama3 and the structural changes between 7B and 8B parameter models.
A low-code guide to fine-tuning LLMs efficiently using the Axolotl library.
A comprehensive guide to fine-tuning Google's Gemma model with practical examples and best practices.
Unleashing the power of Mixtral: A comprehensive guide to fine-tuning this powerful mixture-of-experts model.
A practical guide and colab notebook to quantize LLMs in GGUF format to run them efficiently on your local machine.
A guide to quantizing LLMs using Activation-Aware Quantization (AWQ) on a Google Colab notebook for optimal performance.
A step-by-step guide to deploying Mistral or Llama models on AWS in just three simple stages.
Fine-tuning the Mistral 7B model for code generation using a single Google Colab notebook.
A journey into fine-tuning LLama2 to create an AI companion with personality and conversational abilities.