3 years at PwC building production AI/ML systems for Fortune 500 clients. Specializing in RAG pipelines, multi-agent systems, and LLM applications — from architecture to HuggingFace deployment.
3-agent orchestration system (ML Predictor · Emergency Responder · Route Analyzer) running in parallel via LangGraph. Predicts failures 48 hours in advance with 99.8% accuracy, executes automated 5-step emergency protocol, and maintains sub-500ms response latency end-to-end.
End-to-end multi-PDF RAG pipeline with FAISS vector search across 10K+ chunks. Features a 3-tier LLM fallback chain (Groq → OpenAI → FLAN-T5), source attribution, chat history, and local hosting with no API key required. 10× faster than baseline.
Benchmarked 3 GPT-2 variants across 5 NLP metrics with 40% faster inference via ThreadPoolExecutor parallelism. DistilBERT achieves 89% accuracy. All 100+ benchmark sessions tracked persistently in SQLite with interactive Plotly visualizations.
AI classification pipeline using LangChain + GPT-4o with a deterministic override layer to assess legacy ABAP codebases for SAP S/4HANA migration. Engineered a 4-level token-aware fallback system — reducing consultant assessment time from weeks to minutes.