Second Brain: Conversational RAG Chatbot

Built a local-first RAG chatbot that ingests 4,954 personal conversations into a 312K-chunk pgvector knowledge base. Improved retrieval recall from 0.58 to 0.85 on a 50-question eval set through context engineering. Runs LLM ops on AWS Bedrock with model-tier routing to balance cost and quality. Evaluation pipeline uses LLM-as-Judge for automated regression testing across retrieval and generation quality.

PythonPyTorchHugging FacePostgreSQLpgvectorAWS BedrockLLM-as-Judge

Technical Deep-Dives

Blog posts coming soon.

Technical write-ups for this project are in progress.