
AI Application Engineer with 2+ years of experience building production-oriented Generative AI systems using Python and FastAPI. Strong understanding of Machine Learning (ML) and Deep Learning (DL) fundamentals, including transformer architectures and model evaluation techniques. Hands-on experience designing modular backend services integrating Large Language Models (LLMs),Retrieval-Augmented Generation (RAG), and agentic workflows into deployed applications. Focused on scalable architecture, LLM evaluation, CI/CD automation, and reliable AI system design.
RAG-Based Document Analysis Platform
Optimised the system by:
Containerised using Docker and deployed to AWS ECS with CI/CD automation via GitHub Actions for scalable and consistent deployments.
Multimodal Semantic Image Search System
Optimised the system by:
Developed a Streamlit interface for controlled testing and qualitative evaluation of retrieval performance.
Agentic AI Research Workflow System (LangGraph + FastAPI)
Designed and implemented a production-oriented Multi-Agent AI Orchestration System using LangGraph to manage structured, multi-step LLM workflows (analyst generation → interview → web search → section writing → report synthesis).
Developed modular backend services with FastAPI, ensuring clear separation between Core AI Logic and Production Concerns (structured logging, configuration management, exception handling).
Integrated Tool-Based Web Search (Tavily) using LLM-Driven Structured Query Generation, enabling controlled external data injection and reliable execution.
Implemented Token Usage Tracking (prompt, completion, total tokens) per node to monitor cost and optimise LLM Utilisation.
Optimised the system by:
Automated structured report generation with export to DOCX and PDF formats.
Full UK work authorisation – no sponsorship required