PRASAD K

Background

About

I'm an AI/ML Engineer with 4+ years building production-grade intelligent systems. My work sits at the intersection of Generative AI, backend engineering, and data — designing pipelines and platforms that are technically sound and genuinely useful at scale.

Currently at Evernorth in Hartford, CT, I'm working on RAG-based AI platforms for healthcare prior authorization. Before that I built fraud detection and risk scoring infrastructure at Fiserv, and customer intelligence models at Jio Platforms in India.

I hold a Master's in Computer Science from the University of Massachusetts, Lowell. My technical focus spans LLM orchestration, retrieval systems, and the full MLOps lifecycle — with a strong commitment to responsible AI and compliance in regulated industries.

Generative AI & LLMs

OpenAI · Anthropic Claude · LangChain · LangGraph · RAG · Prompt Engineering

ML & Deep Learning

PyTorch · TensorFlow · XGBoost · HuggingFace Transformers · BERT · scikit-learn

Data Engineering

PySpark · Apache Kafka · Airflow · Databricks · Snowflake · BigQuery

MLOps & Cloud

AWS · Azure · GCP · Docker · Kubernetes · MLflow · GitHub Actions

Backend & APIs

Python · FastAPI · Flask · TypeScript · REST APIs · Terraform

Experience

Work

Designed and deployed a production RAG platform integrating large language models for healthcare prior authorization workflows, enabling grounded retrieval from clinical policy and drug formulary data.

Architected LangGraph-based multi-agent systems to automate eligibility verification, formulary lookups, and coding validation at scale, backed by secure Python microservices with PHI compliance.

Led optimization efforts that reduced token consumption and latency by 35%, and established evaluation pipelines that improved grounded-answer quality by 20% across benchmarked sets.

Python Azure LLMs

Built and maintained ML serving infrastructure for real-time fraud detection and risk scoring, supporting high-throughput transaction analytics at fintech scale.

Developed end-to-end pipelines for model training, evaluation, deployment, and automated retraining — migrating legacy models to cloud-native infrastructure with a 35% reduction in prediction latency.

Collaborated with compliance and audit stakeholders to implement model explainability frameworks and maintain governance artifacts aligned to regulatory requirements.

Python AWS Machine Learning

Built predictive models and NLP pipelines for customer analytics — covering churn prediction, multilingual sentiment analysis, and subscriber segmentation across large telecom datasets.

Designed scalable feature engineering pipelines processing petabyte-scale network and usage data, reducing data preparation time significantly via distributed PySpark jobs.

Deployed ML models as REST APIs and maintained interactive dashboards surfacing subscriber health metrics and NLP-derived sentiment trends for weekly stakeholder reviews.

Python PySpark ML

Selected work

Projects

Production RAG system for clinical prior authorization enabling grounded retrieval from 20K+ indexed healthcare records using hybrid vector and BM25 keyword search.

Azure OpenAI LangGraph FastAPI Azure AI Search Docker / AKS

End-to-end ML pipeline for real-time transaction anomaly detection. Processes high-volume event streams with Apache Kafka and Spark, serving predictions at sub-100ms latency via a scalable cloud API.

Python Apache Kafka PySpark AWS SageMaker Docker

Fine-tuned multilingual BERT model for classifying customer support queries across Hindi and regional Indian languages, deployed as a containerised REST API serving predictions at scale.

PyTorch HuggingFace Multilingual BERT FastAPI Kubernetes

FastAPI microservice that ingests, parses, and semantically indexes documents using LLMs, exposing a conversational Q&A interface over private knowledge bases with citation enforcement.

FastAPI OpenAI API Pinecone LangChain Docker

Self-contained MLOps framework for automated model retraining, data drift detection, and zero-downtime deployment — with experiment tracking and Slack alerting for production monitoring.

Apache Airflow MLflow Kubernetes GitHub Actions Python

Multi-agent research tool built with LangGraph that autonomously searches, reads, and synthesizes academic papers into structured summaries with citation tracing and confidence scoring.

LangGraph Anthropic Claude ChromaDB FastAPI Streamlit

Let's connect

Schedule a call.

Let's talk. Pick a time that works for you.

Book a time →

prasadk5841@gmail.com LinkedIn ↗

PRASAD K

About

Generative AI & LLMs

ML & Deep Learning

Data Engineering

MLOps & Cloud

Backend & APIs

Work

Evernorth

Fiserv, Inc.

Jio Platforms Limited

Projects

Healthcare Prior Auth RAG

Real-Time Fraud Detection Pipeline

Multilingual Intent Classifier

LLM Document Intelligence API

MLOps Pipeline Orchestrator

AI Research Assistant

Schedule a call.