About

I'm an AI/ML Engineer with 4+ years building production-grade intelligent systems. My work sits at the intersection of Generative AI, backend engineering, and data — designing pipelines and platforms that are technically sound and genuinely useful at scale.

Currently at Evernorth in Hartford, CT, I'm working on RAG-based AI platforms for healthcare prior authorization. Before that I built fraud detection and risk scoring infrastructure at Fiserv, and customer intelligence models at Jio Platforms in India.

I hold a Master's in Computer Science from the University of Massachusetts, Lowell. My technical focus spans LLM orchestration, retrieval systems, and the full MLOps lifecycle — with a strong commitment to responsible AI and compliance in regulated industries.

Generative AI & LLMs

OpenAI · Anthropic Claude · LangChain · LangGraph · RAG · Prompt Engineering

ML & Deep Learning

PyTorch · TensorFlow · XGBoost · HuggingFace Transformers · BERT · scikit-learn

Data Engineering

PySpark · Apache Kafka · Airflow · Databricks · Snowflake · BigQuery

MLOps & Cloud

AWS · Azure · GCP · Docker · Kubernetes · MLflow · GitHub Actions

Backend & APIs

Python · FastAPI · Flask · TypeScript · REST APIs · Terraform

Work

Evernorth

AI Engineer
Hartford, CT July 2025 — Present

Designed and deployed a production RAG platform integrating large language models for healthcare prior authorization workflows, enabling grounded retrieval from clinical policy and drug formulary data.

Architected LangGraph-based multi-agent systems to automate eligibility verification, formulary lookups, and coding validation at scale, backed by secure Python microservices with PHI compliance.

Led optimization efforts that reduced token consumption and latency by 35%, and established evaluation pipelines that improved grounded-answer quality by 20% across benchmarked sets.

Python Azure LLMs

Fiserv, Inc.

Software Engineer — ML
Berkeley Heights, NJ Apr 2024 — Jun 2025

Built and maintained ML serving infrastructure for real-time fraud detection and risk scoring, supporting high-throughput transaction analytics at fintech scale.

Developed end-to-end pipelines for model training, evaluation, deployment, and automated retraining — migrating legacy models to cloud-native infrastructure with a 35% reduction in prediction latency.

Collaborated with compliance and audit stakeholders to implement model explainability frameworks and maintain governance artifacts aligned to regulatory requirements.

Python AWS Machine Learning

Jio Platforms Limited

Data Scientist
Mumbai, India Mar 2021 — Aug 2023

Built predictive models and NLP pipelines for customer analytics — covering churn prediction, multilingual sentiment analysis, and subscriber segmentation across large telecom datasets.

Designed scalable feature engineering pipelines processing petabyte-scale network and usage data, reducing data preparation time significantly via distributed PySpark jobs.

Deployed ML models as REST APIs and maintained interactive dashboards surfacing subscriber health metrics and NLP-derived sentiment trends for weekly stakeholder reviews.

Python PySpark ML

Projects

Healthcare Prior Auth RAG

Production RAG system for clinical prior authorization enabling grounded retrieval from 20K+ indexed healthcare records using hybrid vector and BM25 keyword search.

Azure OpenAI LangGraph FastAPI Azure AI Search Docker / AKS

Real-Time Fraud Detection Pipeline

End-to-end ML pipeline for real-time transaction anomaly detection. Processes high-volume event streams with Apache Kafka and Spark, serving predictions at sub-100ms latency via a scalable cloud API.

Python Apache Kafka PySpark AWS SageMaker Docker

Multilingual Intent Classifier

Fine-tuned multilingual BERT model for classifying customer support queries across Hindi and regional Indian languages, deployed as a containerised REST API serving predictions at scale.

PyTorch HuggingFace Multilingual BERT FastAPI Kubernetes

LLM Document Intelligence API

FastAPI microservice that ingests, parses, and semantically indexes documents using LLMs, exposing a conversational Q&A interface over private knowledge bases with citation enforcement.

FastAPI OpenAI API Pinecone LangChain Docker

MLOps Pipeline Orchestrator

Self-contained MLOps framework for automated model retraining, data drift detection, and zero-downtime deployment — with experiment tracking and Slack alerting for production monitoring.

Apache Airflow MLflow Kubernetes GitHub Actions Python

AI Research Assistant

Multi-agent research tool built with LangGraph that autonomously searches, reads, and synthesizes academic papers into structured summaries with citation tracing and confidence scoring.

LangGraph Anthropic Claude ChromaDB FastAPI Streamlit

Schedule a call.

Let's talk. Pick a time that works for you.

Book a time