Data Scientist · AI Builder · Researcher
Data scientist and ML researcher with 4 peer-reviewed publications. Currently building MindMirror, a personal AI that learns not just what you know, but how you think.
About
I'm a data scientist and AI builder with an MS in Computer Science (Data Science specialization) from Seattle University and a BE in Computer Engineering from the University of Pune. My research lives at the intersection of data quality and model behavior — the idea that better AI starts with better data, not just bigger models.
As part of a research team at Seattle University, I contributed to work on synthetic data generation for class-imbalanced medical datasets, published across IEEE Access, DaWaK, and DASFAA. Currently I'm building MindMirror — a personal AI memory system that captures how you think, not just what you know.
Outside of building, I create content on self-discovery and unlearning on Instagram, and I'm learning German — because apparently one language wasn't enough of a challenge.
Experience
May 2025 — Present
Data Analyst
Boston Financial Advisory Group
Building analytical systems and data pipelines for financial advisory workflows.
Feb. 2023 — Apr. 2025
Data Science Graduate Researcher
Seattle University
Contributed to the SDGnE research team, synthetic data generation for class-imbalanced medical datasets, published across IEEE Access, DaWaK, and DASFAA. Developed hands-on ML expertise on Jetstream2/NVIDIA A100 GPU infrastructure.
Jan. 2020 — Jan. 2022
Software Engineer
M.B.B. Consulting
Architected REST APIs and a Flask backend connecting client applications to PostgreSQL at 99.9% uptime, cutting quote generation time by 50%. Automated ETL pipelines processing 1M+ records with Python, SQL, and AWS — and ran A/B tests that contributed to $40K in projected annual gains.
Now — June 2026
Building
MindMirror: A personal AI memory system
Current obsession
Can an AI learn not what you know, but how you think?
Currently reading
The Mountain Is You by Brianna Wiest
Learning
German
Working toward
AI/ML engineering & applied research roles
Writing
First post — building MindMirror in public
Projects
MindMirror
A personal AI system that learns how you think, not just what you know. Uses RAG and structured personal context to make every interaction feel like working with someone who has known you for months — without re-explaining yourself every time.
Published · Open sourceSDGnE Python Package
Open source Python package stemming from the SDGnE research project — lets users generate synthetic data from our designed algorithm for rare event and imbalanced classification tasks. Published research, usable tool.
View docs ↗ Personal projectTrail Recommendation AI Agent
An end-to-end AI agent that monitors calendar events, retrieves real-time weather data, and reasons across a personal trail database to deliver context-aware hiking recommendations — demonstrating full agent orchestration with tool use, memory, and multi-API reasoning.
Personal projectRAG Chatbot with Agentic Pipeline
A context-aware RAG chatbot built with LangChain and LLaMA3 fine-tuned with LoRA. Focused on production readiness — evaluating outputs critically, not just getting something that runs.
Research · PublishedWalkExplorer
A cloud-hosted multimodal AI tool on GCP using CLIP transformers and OpenStreetMap data to assess urban walkability. Benchmarked against human ratings with automated test validation — published at DASFAA 2026.
Read paper ↗ Learning projectTransformer LLM from Scratch
Trained a GPT model on the Shakespeare dataset using nanoGPT with character-level tokenization and AdamW. Achieved validation loss ~1.8 — built to understand transformer architecture and ML math from first principles, not just use the API.
Research & Publications
IEEE Access · 2025
Synthetic Data Generation and Evaluation Techniques for Classifiers in Data Starved Medical Applications
DASFAA · 2026
Content-Based vs. Similarity-Based Deep Learning Approaches for Walkability Assessment
DaWaK · 2024
Incremental SMOTE with Control Coefficient for Classifiers in Data Starved Medical Applications
DASFAA · 2024
SDGnE: A Synthetic Data Generation and Evaluation System for Rare Event Prediction
Reading
Books, papers, essays — things that have shaped how I think about AI, building, and being human. Updated whenever something genuinely moves me.
Book
The Mountain Is You
On self-sabotage and why we get in our own way — uncomfortably relevant.
Book
Add a book that changed how you think
One sentence on what it gave you.
Paper
Add an Anthropic or AI paper that genuinely interested you
Why it stuck with you.
Essay
Add an essay or article that shaped your thinking
What it changed for you.
Writing
I write about building AI products, thinking in public, and the honest reality of transitioning into ML. No tutorials — just observations from someone figuring it out in real time.
Let's connect
I'm currently open to AI/ML engineering, applied research, and data science roles. If you're building something interesting or just want to talk AI — I'd love to hear from you.