Hi!

Tech Stack

Languages & Core

AI/ML Frameworks

🤗

DevOps & Infrastructure

Specializations

LLM Training & Fine-tuning • Multi-GPU/Multi-node Training • Computer Vision • AI Infrastructure • MLOps

Posts

13 Jan 2026

Making Compression a Habit with zstd

Practical guide to using zstd compression in Python and Linux for data processing, log management, and reducing storage and transfer costs.

7 Jan 2026

Rust File I/O with Compression

Rust utility functions for transparent zstd compression/decompression when working with files

1 Jan 2026

2025 -> 2026

My plan for 2026

30 Dec 2025

Running Multiple LLM Models with llamacpp Router

Guide to setting up llamacpp router for managing multiple GGUF models including embeddings and LLMs

15 Dec 2025

Benchmarking LLM performance backends with rust

Guide to LLM performance benchmarking with Rust, covering TTFT, ITL, throughput metrics, and llmperf-rs’s approach to metrics aggregation

21 Nov 2025

Using hf tokenizers in Rust

Example of how to use Tokenizers from Huggingface in Rust

21 Oct 2025

Serverless Webhooks with Cerebrium

Processing large PDF files often leads to HTTP timeouts. You send a document, wait, and the connection dies before processing completes. Cerebrium’s serverless platform solves this with custom FastAPI webhooks and built-in security.

12 Aug 2025

Ansible at Home

Using Ansible to automate and manage home environments, including server setups and configurations.

3 Oct 2024

Re-Writing a Text Deduplication Tool in Rust

Exploratory project to re-write a text deduplication tool in Rust, comparing performance and accuracy with the original Python implementation.