Hi, Iβm Pravallika! π
ML Engineer | RAG Systems & LLM Applications
ML Engineer combining 2+ years enterprise data engineering with hands-on AI/ML development. Built production systems processing 50TB+ data for Fortune 500 clients at PwC, now applying this expertise to modern LLM applications and RAG architectures. Deployed 3 live ML applications (computer vision, NLP, RAG chatbots) using PyTorch, LangChain, and Transformers.
Unique strength: Understanding both messy real-world data AND cutting-edge AI models.
Currently seeking: Remote ML engineering role to build scalable AI systems.
π₯ Featured Projects
π€ RAG Chatbot System
Production document Q&A using retrieval-augmented generation
Built end-to-end RAG pipeline with LangChain, FAISS vector database, and FLAN-T5 LLM. Implements semantic search, intelligent chunking, and hallucination prevention through prompt engineering.
- π― Tech: LangChain, FAISS, FLAN-T5, Sentence-Transformers, PyPDF
-
π Live Demo GitHub Details β
πΌοΈ Image Classification System
Transfer learning with ResNet50 for real-time image classification
Leveraged pre-trained ResNet50 (ImageNet: 1.2M images, 1000 categories) achieving 95.4% top-5 accuracy. Optimized inference with torch.no_grad() for 50% memory reduction and 30% speed improvement.
- π― Tech: PyTorch, torchvision, ResNet50, Gradio
-
π Live Demo GitHub Details β
π° News QnA Pipeline
Multi-model NLP system: retrieval β summarization β Q&A
Three-stage pipeline combining NewsAPI integration, BART summarization, and DistilBERT question answering. Implements singleton pattern for efficient model loading and memory management.
- π― Tech: HuggingFace Transformers, BART, DistilBERT, NewsAPI
-
π Live Demo GitHub Details β
π οΈ Technical Skills
Production ML: LLM Applications (RAG, LangChain, Prompt Engineering, FAISS) β’ Deep Learning (PyTorch, CNNs, Transfer Learning) β’ NLP (Transformers: BART, FLAN-T5, DistilBERT) β’ Deployment (HuggingFace Spaces, Gradio, FastAPI, Docker)
Data Engineering: Enterprise Pipelines (2 years SAP migration: Syniti, BODS, LTMC) β’ ETL Workflows β’ Alteryx β’ SQL β’ Data Validation (50TB+ datasets)
Development: Python β’ Git/GitHub β’ Jupyter β’ VS Code β’ Linux β’ CI/CD β’ Agile
πΌ Professional Experience
| Data & ML Engineer @ PwC | Mar 2023 - Present |
- Architected data validation pipelines processing 50TB+ enterprise data for 3 Fortune 500 clients
- Automated reconciliation reporting with Alteryx, reducing manual validation time by 40%
- Built anomaly detection workflows catching 25% more data quality issues than manual review
- Designed ETL pipelines handling diverse data formats ensuring ML-ready data quality
π Education & Certifications
B.E. Computer Science & Engineering (Hons.) - Chandigarh University (2019-2023)
Certifications: Crash Course on Python (Google/Coursera) β’ Intermediate Machine Learning (Kaggle) β’ Associate Generative AI Engineer (SAP)
π« Letβs Connect
Iβm actively seeking remote ML engineering opportunities with international teams!
- πΌ LinkedIn: linkedin.com/in/pasala-pravallika
- π» GitHub: github.com/Prav-allika
- π€ HuggingFace: huggingface.co/Prav04
- π§ Email: pravallipasala@gmail.com
- π± Phone: +91 6305790358
π Remote Work Ready
β
2+ years remote collaboration with international teams at PwC
β
High-speed internet (100+ Mbps) and dedicated home office
β
Available 5+ hours daily overlap with US/EU timezones
β
Strong async communication: detailed documentation, clear Git commits
β
Fluent English, Hindi, Telugu
Built with β€οΈ using GitHub Pages
Last updated: December 2024