Hi!
Tech Stack
Languages & Core
AI/ML Frameworks
DevOps & Infrastructure
Specializations
LLM Training & Fine-tuning • Multi-GPU/Multi-node Training • Computer Vision • AI Infrastructure • MLOps
Posts
13 Jan 2026
Making Compression a Habit with zstd
Practical guide to using zstd compression in Python and Linux for data processing, log management, and reducing storage and transfer costs.
7 Jan 2026
Rust File I/O with Compression
Rust utility functions for transparent zstd compression/decompression when working with files
1 Jan 2026
My plan for 2026
30 Dec 2025
Running Multiple LLM Models with llamacpp Router
Guide to setting up llamacpp router for managing multiple GGUF models including embeddings and LLMs
15 Dec 2025
Benchmarking LLM performance backends with rust
Guide to LLM performance benchmarking with Rust, covering TTFT, ITL, throughput metrics, and llmperf-rs’s approach to metrics aggregation
21 Nov 2025
Example of how to use Tokenizers from Huggingface in Rust
21 Oct 2025
Serverless Webhooks with Cerebrium
Processing large PDF files often leads to HTTP timeouts. You send a document, wait, and the connection dies before processing completes. Cerebrium’s serverless platform solves this with custom FastAPI webhooks and built-in security.
12 Aug 2025
Using Ansible to automate and manage home environments, including server setups and configurations.
3 Oct 2024
Re-Writing a Text Deduplication Tool in Rust
Exploratory project to re-write a text deduplication tool in Rust, comparing performance and accuracy with the original Python implementation.