Projects

I love working on and building projects! Here are some my favorites below, including personal projects and some of the research I have worked on and am currently working on. Super excited about the evolutionary algorithms research I'm working on right now :) And would love to discuss more!

Personal Projects

EPA Natural Language Query System

A natural language query system for EPA (Environmental Protection Agency) datasets. This system allows you to query multiple JSON datasets using natural language, using a context layer and embeddings to select the most relevant dataset and extracting the requested ingredient information from the relevant dataset.

GitHub link here!

RevealTheIngredient: A Computer-Vision Based iOS Application

A computer vision iOS application with the ability to scan and recognize over 25,000 ingredients found in chemical products.

Check it out on the App Store!

User-Friendly Defective Solar Cell Detection using Artificial Intelligence

Created web application to recognize defects in 1,000+ solar cells in real-time, preventing solar panel degradation.

Quad Chart

Research Projects

Node Perturbation for Gradient-Free Fine-Tuning of Large Language Models I'm currently working as an AI researcher at the Brain and Cognitive Sciences Department at MIT, where I'm finetuning LLMs with mechanisms inspired by the brain! Typical fine-tuning techniques include supervised learning and reinforcement learning with human feedback (RLHF). These methods are effective but time intensive, especially with their reliance on back propagation. RL also often correlates with reward-hacking and can be unstable. Not to mention that the reliance on back propagation and partial derivates translates poorly to the processes that occur in the brain! My research explores how we can emulate the brain's own learning mechanisms to fine-tune LLMs. In particular, we are utilizing evolutionary algorithms (evolution strategies) to fine-tune LLMs. The results are promising, as we demonstrate that evolution strategies can scale efficiently and result in similar performance to RL policies like GRPO, but using mechanisms such as node and weight perturbation.

Computationally Emulating the Evolution of Eyesight using Reinforcement Learning I worked as an AI/ML undergraduate researcher at the Media Lab (Camera Culture group). I expanded an existing reinforcement learning (RL) framework in Python, PyTorch, and MuJoCo by training agents across diverse environments (multi-eye perception, adversarial setups) to test robustness and adaptability in navigation tasks. I also discovered an emergent dragonfly prediction strategy, where agents anticipated goal locations before arrival, demonstrating RL's ability to uncover novel optimization behaviors. I engineered a hybrid approach combining supervised models (CNNs, MLPs) with RL by passing distance-to-goal predictions as inputs, improving learning efficiency and enabling multisensory input modeling. I finally validated the approach through trajectory mapping and performance benchmarking, achieving competitive results with baseline RL while reducing training complexity

And some links to some other projects here...

Emulating the Evolution of Eyesight

CSAwesome Contributions