RAG-Based Conversational AI Platform
Production-grade RAG pipelines over 20k+ documents with re-ranking, score filtering, and sub-200ms latency using FastAPI and Qdrant.
Building production-grade systems across AI and backend
Microservices processing millions of records across telecom regions
API latency reduced via event-driven architecture with RabbitMQ
LangGraph-based agent workflows designed and deployed
I am a Software Engineer with 3+ years of experience and a strong focus on AI/ML, building production-grade LLM systems, RAG pipelines, and scalable microservices processing millions of records daily. I have a proven ability to architect end-to-end solutions on AWS and Kubernetes, from vector search backends to real-time observability frameworks. I care about reliability, observability, and system design — building systems that hold up under real-world constraints.
AI/ML, backend engineering, cloud infrastructure, and observability
Professional journey and key contributions
XNODE Inc.
•Jan 2025 - Present
📍 Westport, CT
Designing and building LangGraph-based AI agent workflows, GenAI backends, and production-ready RAG pipelines deployed on Kubernetes and AWS.
XNODE Inc.
•Jan 2025 - Present
📍 Westport, CT
Designing and building LangGraph-based AI agent workflows, GenAI backends, and production-ready RAG pipelines deployed on Kubernetes and AWS.
Reliance Jio Infocomm Ltd.
•May 2022 - Aug 2024
📍 Hyderabad, India
Architected Python/Flask microservices processing 8-9M+ records/day across 10+ telecom regions for real-time network monitoring.
Reliance Jio Infocomm Ltd.
•May 2022 - Aug 2024
📍 Hyderabad, India
Architected Python/Flask microservices processing 8-9M+ records/day across 10+ telecom regions for real-time network monitoring.
Academic journey and achievements
Master of Science in Information Technology
Bachelor of Technology in Information Technology
Recent projects demonstrating expertise across different domains
Production-grade RAG pipelines over 20k+ documents with re-ranking, score filtering, and sub-200ms latency using FastAPI and Qdrant.
Full-stack reservation system handling 100+ concurrent sessions with geofencing-based validation and event-driven reassignment.
AIoT-based predictive maintenance using IoT sensors and Generative AI.
Autonomous multi-API outreach system scaling personalized emails to 150-700/day.
Interactive dashboard analyzing U.S. public school performance from 2009-2019 using Stanford Education Data Archive.
Interactive Dash application for visualizing country population, life expectancy, and GDP data.
Have a project in mind or just want to chat? I'd love to hear from you.
Ready to bring your ideas to life? Drop me a message and let's start building something extraordinary together.
Tempe, Arizona
Fill out the form below and I'll get back to you within 24 hours.