Full Stack Developer Intern - Ann Arbor, MI
- Developed an end-to-end solution for parsing and extracting data from chemical safety data sheets using GPU-accelerated OCR (Tesseract) and clustering techniques, reducing processing time from 15 to 3 minutes per PDF.
- Deployed locally hosted LLMs (Mistral, Llama3 via Ollama) to ensure data privacy, orchestrated queries using LangChain, and integrated selective OpenAI API usage for enhanced accuracy.
- Implemented and optimized RabbitMQ message queues to distribute PDF parsing tasks across GPU-backed servers, incorporating retry logic and dead letter queues to handle failures and analyze problematic messages for reliability.
- Implemented an Excel export feature in a Spring Boot and Angular application, integrating a seamless UI button and leveraging Apache POI to generate well-structured spreadsheets for tracking workers' certificate statuses.
- Optimized legacy SQL queries by resolving the N+1 problem, consolidating multiple inefficient loops into a single query strategy, reducing data retrieval and file generation time from 1 minute to 5 seconds
- Deployed the service on GPU-backed servers using Docker containers and designed Nginx load balancers to ensure efficient distribution and scalability.
- Designed and implemented a Retrieval-Augmented Generation (RAG) chatbot using LangChain, incorporating a sliding window technique to deliver real-time, context-aware answers to chemical regulation queries.
- Scraped chemical regulation websites using Python, preprocessed data into multilingual vector spaces with Hugging Face's GPT-2 tokenizer, and employed Faiss for vector storage, similarity search, and automated index updates.