Senior Machine Learning Engineer specializing in Computer Vision, Deep Learning, and scalable AI inference systems — building things that see and understand the world.
I'm a Senior ML Engineer with 7+ years of experience building and deploying machine learning systems at scale. My work sits at the intersection of computer vision, deep learning, and production engineering.
From real-time video pipelines processing thousands of camera feeds to inference servers handling high-throughput batched requests — I specialize in taking research-grade models and turning them into robust, production systems.
I've spoken at NVIDIA GTC twice, sharing work on object detection at 1770 FPS and geometric deep learning with TensorRT.
High-performance deep learning inference pipeline for real-time video processing. Deployed globally, monitoring 1000+ cameras with state-of-the-art object detection and classification.
Cross-camera person re-identification by matching feature vectors. Reduced execution time from 72 hours → 3 hours via multiprocessing and multithreading optimizations.
Video pipeline detecting unbilled goods at checkout. Deployed across 200+ registers in US and Canada, significantly reducing shrinkage and improving inventory accuracy.
Real-time vehicle number plate identification deployed across multiple sites in India, enhancing security through accurate vehicle logging in varied conditions.
Oriented bounding box detection pipeline monitoring elderly residents in care homes. Detects falls and unusual activity, alerting caregivers in real time.
Inference API server supporting multiple models on NVIDIA Triton with batched inference, optimizing GPU utilization for high-throughput production AI workloads.
RASA-based chatbot for tax-related queries deployed on AWS EC2, combining NLU model training with conversational AI for accurate, user-friendly assistance.
Pose estimation pipeline for retail analytics — estimates customer interactions and shopping behavior to help optimize store layouts and improve experience.