B.Tech Computer Science (AI & Data Science) · MIT-WPU, Pune · 2027
Building at the intersection of robotics, deep learning, and systems engineering.
Open to ML/AI research internships.
Zero-shot locomotion policy transfer across robot morphologies using a 31,582-parameter heterogeneous GNN (GATv2Conv + PPO), with YOLOv8 perception and LLM-guided navigation deployed on ROS2 Jazzy + Gazebo.
- Zero-shot quadruped → hexapod transfer (12-DOF → 18-DOF): 106 ± 25 reward, ~47 steps survival. MLP hard-fails with RuntimeError on any unseen morphology — fixed-input-dim structural limitation
- 500K-step fine-tuning: 3.8× reward gain (110 → 416 ± 114), survival 47 → 193 steps
- 85% fewer parameters than MLP baseline (31,582 vs 210,457), while MLP scores higher in-distribution only — GNN trades peak reward for architectural generalization
- Terrain robustness (zero-shot, no terrain training): 95% success at 5° slope, 0% at 10°
- Deployed at 200 Hz with yaw-rate PI correction (HAA joint offset) to eliminate circular drift from training bias
- LLM planning layer: Qwen 2.5 7B via Ollama, natural language → skill → GNN → joint commands, fully on-device
Polyp segmentation model combining attention gates and SE blocks with BCE-Dice loss. Evaluated on Kvasir-SEG. Dice: 0.9436.
Local-first Linux desktop AI daemon. Persistent daemon with UNIX socket IPC, X11/OCR context tracking, and streaming LLM inference via Ollama with online fallback.
- Intent router dispatches to typed executors (summarize, explain_error, search)
- Fault-tolerant streaming pipeline with full daemon-client lifecycle decoupling
- Internet-aware backend switching without daemon restart
STM32 firmware in C for Team Vegapod. Custom Bluetooth bootloader, FreeRTOS BMS/VCU braking, 3-phase inverter PWM, CAN bus. Represented MIT-WPU at European Hyperloop Week 2025.
ResNet-50 from scratch (86.63% on Imagenette, live ESP32-CAM inference) · Brain Tumor MRI Classification (98% accuracy, encoder-only Half-UNet)
metatensor/metatrain · Fixed zero-sized validation split edge case in ML training pipelines. PR #1003 merged January 2026.