AI Engineer specializing in LLM systems, RAG pipelines, and Agentic AI. Building production-ready AI solutions with LangChain, FastAPI, and vector databases. Based in Nepal ๐ณ๐ต
I'm an AI Engineer from Nepal specializing in LLM systems, RAG pipelines, and Agentic AI.
I've built production-ready AI backends at Grinda AI, where I developed a hybrid OCR pipeline for Korean financial documents, and completed the AI Fellowship at Fuse Machine working on an NLP-based mental health chatbot.
I enjoy turning complex AI research into real-world systems that actually work.
Production-ready RAG backend that ingests documents, creates vector embeddings with OpenAI, and serves answers via REST API. Uses Qdrant as vector store, Redis for caching, and PyMuPDF + LlamaParse for parsing.
AI chatbot for emotional support using fine-tuned Qwen models for emotion and intent classification. Uses FAISS for semantic memory, SQLite for session persistence, and real-time crisis keyword detection for escalation.
Real-time facial emotion detection trained on FER2013 48x48 grayscale images. Built a CNN from scratch, reduced overfitting via augmentation and regularization, and integrated live webcam inference with avatar mirroring.
Built a hybrid OCR pipeline at Grinda AI combining PaddleOCR, Qwen-VL, and GLM-4V for processing Korean financial documents. Generated synthetic training samples and benchmarked against GPT-4o and Upstage using CER, WER, and TEDS metrics.
Interested in AI/ML collaboration, research opportunities, or building intelligent systems? Feel free to reach out. I'm always open to discussing new projects and ideas.